FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0143, 310 aa
1>>>pF1KE0143 310 - 310 aa - 310 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.6410+/-0.000765; mu= 14.7736+/- 0.046
mean_var=82.7092+/-16.248, 0's: 0 Z-trim(109.9): 28 B-trim: 0 in 0/50
Lambda= 0.141025
statistics sampled from 11173 (11197) to 11173 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.708), E-opt: 0.2 (0.344), width: 16
Scan time: 2.800
The best scores are: opt bits E(32554)
CCDS78409.1 AUH gene_id:549|Hs108|chr9 ( 310) 2003 416.9 1e-116
CCDS6689.1 AUH gene_id:549|Hs108|chr9 ( 339) 1084 229.9 2.1e-60
CCDS55600.1 ECHDC2 gene_id:55268|Hs108|chr1 ( 292) 753 162.5 3.5e-40
CCDS571.1 ECHDC2 gene_id:55268|Hs108|chr1 ( 261) 698 151.3 7.5e-37
CCDS72794.1 ECHDC2 gene_id:55268|Hs108|chr1 ( 244) 462 103.3 2e-22
CCDS7681.1 ECHS1 gene_id:1892|Hs108|chr10 ( 290) 365 83.6 2e-16
>>CCDS78409.1 AUH gene_id:549|Hs108|chr9 (310 aa)
initn: 2003 init1: 2003 opt: 2003 Z-score: 2208.9 bits: 416.9 E(32554): 1e-116
Smith-Waterman score: 2003; 100.0% identity (100.0% similar) in 310 aa overlap (1-310:1-310)
10 20 30 40 50 60
pF1KE0 MAAAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRRAGPAIWAQGWVPAAGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 MAAAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRRAGPAIWAQGWVPAAGG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PAPKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 PAPKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 DKKVRTIIIRSEVPGIFCAANLPVPTIAAIDGLALGGGLELALACDIRVAASSAKMGLVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 DKKVRTIIIRSEVPGIFCAANLPVPTIAAIDGLALGGGLELALACDIRVAASSAKMGLVE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 TKLAIIPGGGGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGLISHVLEQNQEGDAAYR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 TKLAIIPGGGGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGLISHVLEQNQEGDAAYR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 KALDLAREFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYAQTIPTKDRLEGLLAFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 KALDLAREFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYAQTIPTKDRLEGLLAFK
250 260 270 280 290 300
310
pF1KE0 EKRPPRYKGE
::::::::::
CCDS78 EKRPPRYKGE
310
>>CCDS6689.1 AUH gene_id:549|Hs108|chr9 (339 aa)
initn: 1989 init1: 1084 opt: 1084 Z-score: 1197.9 bits: 229.9 E(32554): 2.1e-60
Smith-Waterman score: 1935; 91.4% identity (91.4% similar) in 339 aa overlap (1-310:1-339)
10 20 30 40 50 60
pF1KE0 MAAAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRRAGPAIWAQGWVPAAGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 MAAAVAAAPGALGSLHAGGARLVAACSAWLCPGLRLPGSLAGRRAGPAIWAQGWVPAAGG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 PAPKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 PAPKRGYSSEMKTEDELRVRHLEEENRGIVVLGINRAYGKNSLSKNLIKMLSKAVDALKS
70 80 90 100 110 120
130 140 150
pF1KE0 DKKVRTIIIRSEVPGIFCA-----------------------------ANLPVPTIAAID
::::::::::::::::::: ::::::::::::
CCDS66 DKKVRTIIIRSEVPGIFCAGADLKERAKMSSSEVGPFVSKIRAVINDIANLPVPTIAAID
130 140 150 160 170 180
160 170 180 190 200 210
pF1KE0 GLALGGGLELALACDIRVAASSAKMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELIFSA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 GLALGGGLELALACDIRVAASSAKMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELIFSA
190 200 210 220 230 240
220 230 240 250 260 270
pF1KE0 RVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGMEVD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS66 RVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGMEVD
250 260 270 280 290 300
280 290 300 310
pF1KE0 LVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPPRYKGE
:::::::::::::::::::::::::::::::::::::::
CCDS66 LVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPPRYKGE
310 320 330
>>CCDS55600.1 ECHDC2 gene_id:55268|Hs108|chr1 (292 aa)
initn: 871 init1: 738 opt: 753 Z-score: 834.8 bits: 162.5 E(32554): 3.5e-40
Smith-Waterman score: 837; 47.9% identity (75.4% similar) in 280 aa overlap (61-310:13-292)
40 50 60 70 80
pF1KE0 CPGLRLPGSLAGRRAGPAIWAQGWVPAAGGPAPKRGYSSEMKTE-DELRVRHLEEENRGI
: :: .:. . .:..:: : ..::
CCDS55 MLRVLCLLRPWRPLRARGCASDGAAGGSEIQVRALAGPDQGI
10 20 30 40
90 100 110 120 130
pF1KE0 VVLGINRAYGKNSLSKNLIKMLSKAVDALKSDKKVRTIIIRSEVPGIFCA----------
. . .:: ..:.:.. ... : ... :. :..::....:: : :.:::
CCDS55 TEILMNRPSARNALGNVFVSELLETLAQLREDRQVRVLLFRSGVKGVFCAGADLKEREQM
50 60 70 80 90 100
140 150 160 170 180
pF1KE0 -------------------ANLPVPTIAAIDGLALGGGLELALACDIRVAASSAKMGLVE
: .:.:::::.::.:::::::::::::.::::::: :::.:
CCDS55 SEAEVGVFVQRLRGLMNDIAAFPAPTIAAMDGFALGGGLELALACDLRVAASSAVMGLIE
110 120 130 140 150 160
190 200 210 220 230 240
pF1KE0 TKLAIIPGGGGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGLISHVLEQNQEGDAAYR
: ...::.:::::::: .:..:::::::..: :.: ::...::..:.. ::.::::::.
CCDS55 TTRGLLPGAGGTQRLPRCLGVALAKELIFTGRRLSGTEAHVLGLVNHAVAQNEEGDAAYQ
170 180 190 200 210 220
250 260 270 280 290 300
pF1KE0 KALDLAREFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYAQTIPTKDRLEGLLAFK
.: ::.:.:::.:.:.:..:.::..: :::...:.::: ::::.:::.:::::. ::.
CCDS55 RARALAQEILPQAPIAVRLGKVAIDRGTEVDIASGMAIEGMCYAQNIPTRDRLEGMAAFR
230 240 250 260 270 280
310
pF1KE0 EKRPPRYKGE
::: :.. :.
CCDS55 EKRTPKFVGK
290
>>CCDS571.1 ECHDC2 gene_id:55268|Hs108|chr1 (261 aa)
initn: 753 init1: 584 opt: 698 Z-score: 775.1 bits: 151.3 E(32554): 7.5e-37
Smith-Waterman score: 698; 45.2% identity (76.2% similar) in 252 aa overlap (61-310:13-261)
40 50 60 70 80
pF1KE0 CPGLRLPGSLAGRRAGPAIWAQGWVPAAGGPAPKRGYSSEMKTE-DELRVRHLEEENRGI
: :: .:. . .:..:: : ..::
CCDS57 MLRVLCLLRPWRPLRARGCASDGAAGGSEIQVRALAGPDQGI
10 20 30 40
90 100 110 120 130 140
pF1KE0 VVLGINRAYGKNSLSKNLIKMLSKAVDALKSDKKVRTIIIRSEVPGIFCA-ANLPVPTIA
. . .:: ..:.:.. ... : ... :. :..::....:: : :.::: :.:
CCDS57 TEILMNRPSARNALGNVFVSELLETLAQLREDRQVRVLLFRSGVKGVFCAGADLKEREQM
50 60 70 80 90 100
150 160 170 180 190 200
pF1KE0 AIDGLALGGGLELALACDIRVAASSAKMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELI
. ... .: :: :::: :::.:: ...::.:::::::: .:..::::::
CCDS57 SEAEVGVFVQRLRGLMNDI---ASSAVMGLIETTRGLLPGAGGTQRLPRCLGVALAKELI
110 120 130 140 150
210 220 230 240 250 260
pF1KE0 FSARVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGM
:..: :.: ::...::..:.. ::.::::::..: ::.:.:::.:.:.:..:.::..:
CCDS57 FTGRRLSGTEAHVLGLVNHAVAQNEEGDAAYQRARALAQEILPQAPIAVRLGKVAIDRGT
160 170 180 190 200 210
270 280 290 300 310
pF1KE0 EVDLVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPPRYKGE
:::...:.::: ::::.:::.:::::. ::.::: :.. :.
CCDS57 EVDIASGMAIEGMCYAQNIPTRDRLEGMAAFREKRTPKFVGK
220 230 240 250 260
>>CCDS72794.1 ECHDC2 gene_id:55268|Hs108|chr1 (244 aa)
initn: 644 init1: 335 opt: 462 Z-score: 516.0 bits: 103.3 E(32554): 2e-22
Smith-Waterman score: 593; 42.1% identity (69.8% similar) in 252 aa overlap (61-310:13-244)
40 50 60 70 80
pF1KE0 CPGLRLPGSLAGRRAGPAIWAQGWVPAAGGPAPKRGYSSEMKTE-DELRVRHLEEENRGI
: :: .:. . .:..:: : ..::
CCDS72 MLRVLCLLRPWRPLRARGCASDGAAGGSEIQVRALAGPDQGI
10 20 30 40
90 100 110 120 130 140
pF1KE0 VVLGINRAYGKNSLSKNLIKMLSKAVDALKSDKKVRTIIIRSEVPGIFCA-ANLPVPTIA
. . .:: ..:.:.. ... : ... :. :..::....:: : :.::: :.:
CCDS72 TEILMNRPSARNALGNVFVSELLETLAQLREDRQVRVLLFRSGVKGVFCAGADLKEREQM
50 60 70 80 90 100
150 160 170 180 190 200
pF1KE0 AIDGLALGGGLELALACDIRVAASSAKMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELI
. ... .: :: :::: :::.:: ...::.:::::::: .:..::::::
CCDS72 SEAEVGVFVQRLRGLMNDI---ASSAVMGLIETTRGLLPGAGGTQRLPRCLGVALAKELI
110 120 130 140 150
210 220 230 240 250 260
pF1KE0 FSARVLDGKEAKAVGLISHVLEQNQEGDAAYRKALDLAREFLPQGPVAMRVAKLAINQGM
:..: :.: ::...::..:.. ::.::::::..: ::.:.:::
CCDS72 FTGRRLSGTEAHVLGLVNHAVAQNEEGDAAYQRARALAQEILPQ----------------
160 170 180 190 200
270 280 290 300 310
pF1KE0 EVDLVTGLAIEEACYAQTIPTKDRLEGLLAFKEKRPPRYKGE
::...:.::: ::::.:::.:::::. ::.::: :.. :.
CCDS72 -VDIASGMAIEGMCYAQNIPTRDRLEGMAAFREKRTPKFVGK
210 220 230 240
>>CCDS7681.1 ECHS1 gene_id:1892|Hs108|chr10 (290 aa)
initn: 305 init1: 254 opt: 365 Z-score: 408.3 bits: 83.6 E(32554): 2e-16
Smith-Waterman score: 365; 36.1% identity (70.5% similar) in 166 aa overlap (145-310:129-290)
120 130 140 150 160 170
pF1KE0 VDALKSDKKVRTIIIRSEVPGIFCAANLPVPTIAAIDGLALGGGLELALACDIRVAASSA
:.:::..: :.::: :::. ::: :. .:
CCDS76 DIKEMQNLSFQDCYSSKFLKHWDHLTQVKKPVIAAVNGYAFGGGCELAMMCDIIYAGEKA
100 110 120 130 140 150
180 190 200 210 220 230
pF1KE0 KMGLVETKLAIIPGGGGTQRLPRAIGMSLAKELIFSARVLDGKEAKAVGLISHVLEQNQE
... : .. :::.:::::: ::.: ::: :..... .....:: .::.:..
CCDS76 QFAQPEILIGTIPGAGGTQRLTRAVGKSLAMEMVLTGDRISAQDAKQAGLVSKICPV---
160 170 180 190 200 210
240 250 260 270 280 290
pF1KE0 GDAAYRKALDLAREFLPQGPVAMRVAKLAINQGMEVDLVTGLAIEEACYAQTIPTKDRLE
.. ..:.. :... .. ... .:: ..: ..:. :. : .:. . .:. : :: :
CCDS76 -ETLVEEAIQCAEKIASNSKIVVAMAKESVNAAFEMTLTEGSKLEKKLFYSTFATDDRKE
220 230 240 250 260 270
300 310
pF1KE0 GLLAFKEKRPPRYKGE
:. :: ::: .: .
CCDS76 GMTAFVEKRKANFKDQ
280 290
310 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 00:29:34 2016 done: Fri Nov 4 00:29:35 2016
Total Scan time: 2.800 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]