FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB3649, 592 aa
1>>>pF1KB3649 592 - 592 aa - 592 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.0445+/-0.000948; mu= 8.3135+/- 0.057
mean_var=200.8339+/-40.273, 0's: 0 Z-trim(112.9): 11 B-trim: 548 in 1/51
Lambda= 0.090501
statistics sampled from 13613 (13620) to 13613 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.747), E-opt: 0.2 (0.418), width: 16
Scan time: 2.690
The best scores are: opt bits E(32554)
CCDS4447.1 CANX gene_id:821|Hs108|chr5 ( 592) 4150 554.6 1.3e-157
CCDS3751.1 CLGN gene_id:1047|Hs108|chr4 ( 610) 2442 331.6 1.8e-90
CCDS12288.1 CALR gene_id:811|Hs108|chr19 ( 417) 472 74.2 3.7e-13
>>CCDS4447.1 CANX gene_id:821|Hs108|chr5 (592 aa)
initn: 4150 init1: 4150 opt: 4150 Z-score: 2943.1 bits: 554.6 E(32554): 1.3e-157
Smith-Waterman score: 4150; 100.0% identity (100.0% similar) in 592 aa overlap (1-592:1-592)
10 20 30 40 50 60
pF1KB3 MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVEDSKPDTTAPPSSPKVTY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVEDSKPDTTAPPSSPKVTY
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB3 KAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDGKWEVEEMKESKLPGDKGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 KAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDGKWEVEEMKESKLPGDKGL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB3 VLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQFH
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB3 DKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKHAKRPDADLKTYFTDKKTHLYTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 DKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKHAKRPDADLKTYFTDKKTHLYTL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB3 ILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREIEDPEDRKPEDWDERPKIPDPEAVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREIEDPEDRKPEDWDERPKIPDPEAVK
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB3 PDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANPRC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANPRC
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB3 ESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQGIWKPRKIPNPDFFEDLEPFRMTPFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 ESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQGIWKPRKIPNPDFFEDLEPFRMTPFS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB3 AIGLELWSMTSDIFFDNFIICADRRIVDDWANDGWGLKKAADGAAEPGVVGQMIEAAEER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 AIGLELWSMTSDIFFDNFIICADRRIVDDWANDGWGLKKAADGAAEPGVVGQMIEAAEER
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB3 PWLWVVYILTVALPVFLVILFCCSGKKQTSGMEYKKTDAPQPDVKEEEEEKEEEKDKGDE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PWLWVVYILTVALPVFLVILFCCSGKKQTSGMEYKKTDAPQPDVKEEEEEKEEEKDKGDE
490 500 510 520 530 540
550 560 570 580 590
pF1KB3 EEEGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKAEEDEILNRSPRNRKPRRE
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EEEGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKAEEDEILNRSPRNRKPRRE
550 560 570 580 590
>>CCDS3751.1 CLGN gene_id:1047|Hs108|chr4 (610 aa)
initn: 1793 init1: 1670 opt: 2442 Z-score: 1737.7 bits: 331.6 E(32554): 1.8e-90
Smith-Waterman score: 2446; 58.7% identity (79.6% similar) in 593 aa overlap (5-579:7-585)
10 20 30 40 50
pF1KB3 MEGKWLLCMLLVLGTAIVEAHDGHDDDVIDIEDDLDDVIEEVEDSKPDTTAPPSSPKV
:: :. :.. . .: : ::: .:... ::. :.. : ..
CCDS37 MHFQAFWL-CLGLLFISINAEFMD---DDVET--EDFEENSEEI-----DVNESELSSEI
10 20 30 40
60 70 80 90 100 110
pF1KB3 TYKAPVPTGEVYFADSFDRGTLSGWILSKAKKDDTDDEIAKYDGKWEVEEMKESKLPGDK
::.: : ::::::..:: : :.::.:::::::: :.::. :::.::.::.::...:::.
CCDS37 KYKTPQPIGEVYFAETFDSGRLAGWVLSKAKKDDMDEEISIYDGRWEIEELKENQVPGDR
50 60 70 80 90 100
120 130 140 150 160 170
pF1KB3 GLVLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQNGIECGGAYVKLLSKTPELNLDQ
:::: :::::::::: : :::.: ::::::::::::.::.:::::.:::. : .: :..
CCDS37 GLVLKSRAKHHAISAVLAKPFIFADKPLIVQYEVNFQDGIDCGGAYIKLLADTDDLILEN
110 120 130 140 150 160
180 190 200 210 220 230
pF1KB3 FHDKTPYTIMFGPDKCGEDYKLHFIFRHKNPKTGIYEEKHAKRPDADLKTYFTDKKTHLY
:.::: : :::::::::::::::::::::.::::..:::::: ::.::: .:::.:::::
CCDS37 FYDKTSYIIMFGPDKCGEDYKLHFIFRHKHPKTGVFEEKHAKPPDVDLKKFFTDRKTHLY
170 180 190 200 210 220
240 250 260 270 280 290
pF1KB3 TLILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSREIEDPEDRKPEDWDERPKIPDPEA
::..:::..::.::::.:::.:.::.:..::..: .:::::.:.:::.:::: ::::: :
CCDS37 TLVMNPDDTFEVLVDQTVVNKGSLLEDVVPPIKPPKEIEDPNDKKPEEWDERAKIPDPSA
230 240 250 260 270 280
300 310 320 330 340 350
pF1KB3 VKPDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPDAEKPEDWDEDMDGEWEAPQIANP
:::.::::. ::.: : ..:: :::::::...:::.::::.::.:: ::::::::: ::
CCDS37 VKPEDWDESEPAQIEDSSVVKPAGWLDDEPKFIPDPNAEKPDDWNEDTDGEWEAPQILNP
290 300 310 320 330 340
360 370 380 390 400 410
pF1KB3 RCESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQGIWKPRKIPNPDFFEDLEPFRMTP
:. ::: :. :.::::.::: :.::..:::.:::::.::::::::.::: .:: .:
CCDS37 ACR--IGCGEWKPPMIDNPKYKGVWRPPLVDNPNYQGIWSPRKIPNPDYFEDDHPFLLTS
350 360 370 380 390 400
420 430 440 450 460 470
pF1KB3 FSAIGLELWSMTSDIFFDNFIICADRRIVDDWANDGWGLKKAADGAAEPGVVGQMIEAAE
:::.:::::::::::.:::::::......: :: ::: : .: .:::. :.. :::
CCDS37 FSALGLELWSMTSDIYFDNFIICSEKEVADHWAADGWRWKIMIANANKPGVLKQLMAAAE
410 420 430 440 450 460
480 490 500 510 520 530
pF1KB3 ERPWLWVVYILTVALPVFLVILFC--CSGKKQTSGMEYKKTDAPQPDVK-----EEEEEK
.::::..:..:...:. :. :: . ::. . :::::: :..: ::.:::
CCDS37 GHPWLWLIYLVTAGVPIALITSFCWPRKVKKKHKDTEYKKTDICIPQTKGVLEQEEKEEK
470 480 490 500 510 520
540 550 560 570 580
pF1KB3 ---------EEEKDKGDEEE-EGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKA-EEDEIL
:::: ..: : : ::. : ..::. :: .::: ... :. :::.
CCDS37 AALEKPMDLEEEKKQNDGEMLEKEEESEPEEKSE-EEIEIIEGQEESNQSNKSGSEDEMK
530 540 550 560 570 580
590
pF1KB3 NRSPRNRKPRRE
CCDS37 EADESTGSGDGPIKSVRKRRVRKD
590 600 610
>>CCDS12288.1 CALR gene_id:811|Hs108|chr19 (417 aa)
initn: 494 init1: 326 opt: 472 Z-score: 349.7 bits: 74.2 E(32554): 3.7e-13
Smith-Waterman score: 917; 36.5% identity (57.8% similar) in 490 aa overlap (69-555:21-408)
40 50 60 70 80 90
pF1KB3 IEEVEDSKPDTTAPPSSPKVTYKAPVPTGEVYFADSF--DRGTLSGWILSKAKKDDTDDE
::: ..: : : :: :: :.: .
CCDS12 MLLSVPLLLGLLGLAVAEPAVYFKEQFLDGDGWTSRWIESKHKSDF--GK
10 20 30 40
100 110 120 130 140 150
pF1KB3 IAKYDGKWEVEEMKESKLPGDKGLVLMSRAKHHAISAKLNKPFLFDTKPLIVQYEVNFQN
.. .::. .: : :::: . :. .:.::.. .:: . :.::. :. ..
CCDS12 FVLSSGKFYGDEEK------DKGLQTSQDARFYALSASF-EPFSNKGQTLVVQFTVKHEQ
50 60 70 80 90 100
160 170 180 190 200 210
pF1KB3 GIECGGAYVKLLSKTPELNLDQFHDKTPYTIMFGPDKCGEDYK-LHFIFRHKNPKTGIYE
.:.:::.::::. .. :. ..: . :.:::::: :: : .: :: .:. .. : .
CCDS12 NIDCGGGYVKLFPNS--LDQTDMHGDSEYNIMFGPDICGPGTKKVHVIFNYKGKNVLINK
110 120 130 140 150
220 230 240 250 260 270
pF1KB3 EKHAKRPDADLKTYFTDKKTHLYTLILNPDNSFEILVDQSVVNSGNLLNDMTPPVNPSRE
. . : :. :::::::. :::..:. .:.: :.::.: .: : ..
CCDS12 DIRCK----------DDEFTHLYTLIVRPDNTYEVKIDNSQVESGSLEDDWD--FLPPKK
160 170 180 190 200
280 290 300 310 320 330
pF1KB3 IEDPEDRKPEDWDERPKIPDPEAVKPDDWDEDAPAKIPDEEATKPEGWLDDEPEYVPDPD
:.::. :::::::: :: :: ::.:::. ::..::::
CCDS12 IKDPDASKPEDWDERAKIDDPTDSKPEDWDK---------------------PEHIPDPD
210 220 230 240
340 350 360 370 380 390
pF1KB3 AEKPEDWDEDMDGEWEAPQIANPRCESAPGCGVWQRPVIDNPNYKGKWKPPMIDNPSYQG
:.:::::::.:::::: : :::.::.:::.::: .::::.:.:
CCDS12 AKKPEDWDEEMDGEWE------P-------------PVIQNPEYKGEWKPRQIDNPDYKG
250 260 270 280
400 410 420 430 440 450
pF1KB3 IWKPRKIPNPDFFEDLEPFRMTPFSAIGLELWSMTSDIFFDNFIICADRRIVDDWANDGW
: .: ::.. : . . :...::.::.. : .::::.: :. .....:. :
CCDS12 TWIHPEIDNPEYSPDPSIYAYDNFGVLGLDLWQVKSGTIFDNFLITNDEAYAEEFGNETW
290 300 310 320 330 340
460 470 480 490 500 510
pF1KB3 GLKKAADGAAEPGVVGQMIEAAEERPWLWVVYILTVALPVFLVILFCCSGKKQTSGMEYK
:. :::. :: . .:. : :.. . :
CCDS12 GVTKAAEK--------QMKDKQDEEQRL----------------------KEEEEDKKRK
350 360 370
520 530 540 550 560 570
pF1KB3 KTDAPQPDVKEEEEEKEEEKDKGDEEEEGEEKLEEKQKSDAEEDGGTVSQEEEDRKPKAE
. .:: :.::...:: ::.:: :: :: .. :
CCDS12 E--------EEEAEDKEDDEDK-DEDEEDEEDKEEDEEEDVPGQAKDEL
380 390 400 410
580 590
pF1KB3 EDEILNRSPRNRKPRRE
592 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 13:28:28 2016 done: Sat Nov 5 13:28:29 2016
Total Scan time: 2.690 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]