FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB3649, 592 aa 1>>>pF1KB3649 592 - 592 aa - 592 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.0445+/-0.000948; mu= 8.3135+/- 0.057 mean_var=200.8339+/-40.273, 0's: 0 Z-trim(112.9): 11 B-trim: 548 in 1/51 Lambda= 0.090501 statistics sampled from 13613 (13620) to 13613 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.418), width: 16 Scan time: 2.690 The best scores are: opt bits E(32554) CCDS4447.1 CANX gene_id:821|Hs108|chr5 ( 592) 4150 554.6 1.3e-157 CCDS3751.1 CLGN gene_id:1047|Hs108|chr4 ( 610) 2442 331.6 1.8e-90 CCDS12288.1 CALR gene_id:811|Hs108|chr19 ( 417) 472 74.2 3.7e-13 >>CCDS4447.1 CANX gene_id:821|Hs108|chr5 (592 aa) initn: 4150 init1: 4150 opt: 4150 Z-score: 2943.1 bits: 554.6 E(32554): 1.3e-157 Smith-Waterman score: 4150; 100.0% identity (100.0% similar) in 592 aa overlap (1-592:1-592) 10 20 30 40 50 60 pF1KB3 MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVEDSKPDTTAPPSSPKVTY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVEDSKPDTTAPPSSPKVTY 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB3 KAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDGKWEVEEMKESKLPGDKGL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 KAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDGKWEVEEMKESKLPGDKGL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB3 VLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 VLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQFH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB3 DKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKHAKRPDADLKTYFTDKKTHLYTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 DKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKHAKRPDADLKTYFTDKKTHLYTL 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB3 ILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREIEDPEDRKPEDWDERPKIPDPEAVK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREIEDPEDRKPEDWDERPKIPDPEAVK 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB3 PDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANPRC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 PDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANPRC 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB3 ESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQGIWKPRKIPNPDFFEDLEPFRMTPFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 ESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQGIWKPRKIPNPDFFEDLEPFRMTPFS 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB3 AIGLELWSMTSDIFFDNFIICADRRIVDDWANDGWGLKKAADGAAEPGVVGQMIEAAEER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 AIGLELWSMTSDIFFDNFIICADRRIVDDWANDGWGLKKAADGAAEPGVVGQMIEAAEER 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB3 PWLWVVYILTVALPVFLVILFCCSGKKQTSGMEYKKTDAPQPDVKEEEEEKEEEKDKGDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 PWLWVVYILTVALPVFLVILFCCSGKKQTSGMEYKKTDAPQPDVKEEEEEKEEEKDKGDE 490 500 510 520 530 540 550 560 570 580 590 pF1KB3 EEEGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKAEEDEILNRSPRNRKPRRE :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EEEGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKAEEDEILNRSPRNRKPRRE 550 560 570 580 590 >>CCDS3751.1 CLGN gene_id:1047|Hs108|chr4 (610 aa) initn: 1793 init1: 1670 opt: 2442 Z-score: 1737.7 bits: 331.6 E(32554): 1.8e-90 Smith-Waterman score: 2446; 58.7% identity (79.6% similar) in 593 aa overlap (5-579:7-585) 10 20 30 40 50 pF1KB3 MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVEDSKPDTTAPPSSPKV :: :. :.. . .: : ::: .:... ::. :.. : .. CCDS37 MHFQAFWL-CLGLLFISINAEFMD---DDVET--EDFEENSEEI-----DVNESELSSEI 10 20 30 40 60 70 80 90 100 110 pF1KB3 TYKAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDGKWEVEEMKESKLPGDK ::.: : ::::::..:: : :.::.:::::::: :.::. :::.::.::.::...:::. CCDS37 KYKTPQPIGEVYFAETFDSGRLAGWVLSKAKKDDMDEEISIYDGRWEIEELKENQVPGDR 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB3 GLVLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQ :::: :::::::::: : :::.: ::::::::::::.::.:::::.:::. : .: :.. CCDS37 GLVLKSRAKHHAISAVLAKPFIFADKPLIVQYEVNFQDGIDCGGAYIKLLADTDDLILEN 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB3 FHDKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKHAKRPDADLKTYFTDKKTHLY :.::: : :::::::::::::::::::::.::::..:::::: ::.::: .:::.::::: CCDS37 FYDKTSYIIMFGPDKCGEDYKLHFIFRHKHPKTGVFEEKHAKPPDVDLKKFFTDRKTHLY 170 180 190 200 210 220 240 250 260 270 280 290 pF1KB3 TLILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREIEDPEDRKPEDWDERPKIPDPEA ::..:::..::.::::.:::.:.::.:..::..: .:::::.:.:::.:::: ::::: : CCDS37 TLVMNPDDTFEVLVDQTVVNKGSLLEDVVPPIKPPKEIEDPNDKKPEEWDERAKIPDPSA 230 240 250 260 270 280 300 310 320 330 340 350 pF1KB3 VKPDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANP :::.::::. ::.: : ..:: :::::::...:::.::::.::.:: ::::::::: :: CCDS37 VKPEDWDESEPAQIEDSSVVKPAGWLDDEPKFIPDPNAEKPDDWNEDTDGEWEAPQILNP 290 300 310 320 330 340 360 370 380 390 400 410 pF1KB3 RCESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQGIWKPRKIPNPDFFEDLEPFRMTP :. ::: :. :.::::.::: :.::..:::.:::::.::::::::.::: .:: .: CCDS37 ACR--IGCGEWKPPMIDNPKYKGVWRPPLVDNPNYQGIWSPRKIPNPDYFEDDHPFLLTS 350 360 370 380 390 400 420 430 440 450 460 470 pF1KB3 FSAIGLELWSMTSDIFFDNFIICADRRIVDDWANDGWGLKKAADGAAEPGVVGQMIEAAE :::.:::::::::::.:::::::......: :: ::: : .: .:::. :.. ::: CCDS37 FSALGLELWSMTSDIYFDNFIICSEKEVADHWAADGWRWKIMIANANKPGVLKQLMAAAE 410 420 430 440 450 460 480 490 500 510 520 530 pF1KB3 ERPWLWVVYILTVALPVFLVILFC--CSGKKQTSGMEYKKTDAPQPDVK-----EEEEEK .::::..:..:...:. :. :: . ::. . :::::: :..: ::.::: CCDS37 GHPWLWLIYLVTAGVPIALITSFCWPRKVKKKHKDTEYKKTDICIPQTKGVLEQEEKEEK 470 480 490 500 510 520 540 550 560 570 580 pF1KB3 ---------EEEKDKGDEEE-EGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKA-EEDEIL :::: ..: : : ::. : ..::. :: .::: ... :. :::. CCDS37 AALEKPMDLEEEKKQNDGEMLEKEEESEPEEKSE-EEIEIIEGQEESNQSNKSGSEDEMK 530 540 550 560 570 580 590 pF1KB3 NRSPRNRKPRRE CCDS37 EADESTGSGDGPIKSVRKRRVRKD 590 600 610 >>CCDS12288.1 CALR gene_id:811|Hs108|chr19 (417 aa) initn: 494 init1: 326 opt: 472 Z-score: 349.7 bits: 74.2 E(32554): 3.7e-13 Smith-Waterman score: 917; 36.5% identity (57.8% similar) in 490 aa overlap (69-555:21-408) 40 50 60 70 80 90 pF1KB3 IEEVEDSKPDTTAPPSSPKVTYKAPVPTGEVYFADSF--DRGTLSGWILSKAKKDDTDDE ::: ..: : : :: :: :.: . CCDS12 MLLSVPLLLGLLGLAVAEPAVYFKEQFLDGDGWTSRWIESKHKSDF--GK 10 20 30 40 100 110 120 130 140 150 pF1KB3 IAKYDGKWEVEEMKESKLPGDKGLVLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQN .. .::. .: : :::: . :. .:.::.. .:: . :.::. :. .. CCDS12 FVLSSGKFYGDEEK------DKGLQTSQDARFYALSASF-EPFSNKGQTLVVQFTVKHEQ 50 60 70 80 90 100 160 170 180 190 200 210 pF1KB3 GIECGGAYVKLLSKTPELNLDQFHDKTPYTIMFGPDKCGEDYK-LHFIFRHKNPKTGIYE .:.:::.::::. .. :. ..: . :.:::::: :: : .: :: .:. .. : . CCDS12 NIDCGGGYVKLFPNS--LDQTDMHGDSEYNIMFGPDICGPGTKKVHVIFNYKGKNVLINK 110 120 130 140 150 220 230 240 250 260 270 pF1KB3 EKHAKRPDADLKTYFTDKKTHLYTLILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSRE . . : :. :::::::. :::..:. .:.: :.::.: .: : .. CCDS12 DIRCK----------DDEFTHLYTLIVRPDNTYEVKIDNSQVESGSLEDDWD--FLPPKK 160 170 180 190 200 280 290 300 310 320 330 pF1KB3 IEDPEDRKPEDWDERPKIPDPEAVKPDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPD :.::. :::::::: :: :: ::.:::. ::..:::: CCDS12 IKDPDASKPEDWDERAKIDDPTDSKPEDWDK---------------------PEHIPDPD 210 220 230 240 340 350 360 370 380 390 pF1KB3 AEKPEDWDEDMDGEWEAPQIANPRCESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQG :.:::::::.:::::: : :::.::.:::.::: .::::.:.: CCDS12 AKKPEDWDEEMDGEWE------P-------------PVIQNPEYKGEWKPRQIDNPDYKG 250 260 270 280 400 410 420 430 440 450 pF1KB3 IWKPRKIPNPDFFEDLEPFRMTPFSAIGLELWSMTSDIFFDNFIICADRRIVDDWANDGW : .: ::.. : . . :...::.::.. : .::::.: :. .....:. : CCDS12 TWIHPEIDNPEYSPDPSIYAYDNFGVLGLDLWQVKSGTIFDNFLITNDEAYAEEFGNETW 290 300 310 320 330 340 460 470 480 490 500 510 pF1KB3 GLKKAADGAAEPGVVGQMIEAAEERPWLWVVYILTVALPVFLVILFCCSGKKQTSGMEYK :. :::. :: . .:. : :.. . : CCDS12 GVTKAAEK--------QMKDKQDEEQRL----------------------KEEEEDKKRK 350 360 370 520 530 540 550 560 570 pF1KB3 KTDAPQPDVKEEEEEKEEEKDKGDEEEEGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKAE . .:: :.::...:: ::.:: :: :: .. : CCDS12 E--------EEEAEDKEDDEDK-DEDEEDEEDKEEDEEEDVPGQAKDEL 380 390 400 410 580 590 pF1KB3 EDEILNRSPRNRKPRRE 592 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 13:28:28 2016 done: Sat Nov 5 13:28:29 2016 Total Scan time: 2.690 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]