FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0184, 286 aa
1>>>pF1KE0184 286 - 286 aa - 286 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3212+/-0.000806; mu= 15.0256+/- 0.048
mean_var=62.0949+/-12.382, 0's: 0 Z-trim(106.9): 17 B-trim: 0 in 0/50
Lambda= 0.162759
statistics sampled from 9259 (9264) to 9259 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.668), E-opt: 0.2 (0.285), width: 16
Scan time: 2.450
The best scores are: opt bits E(32554)
CCDS5521.1 GBAS gene_id:2631|Hs108|chr7 ( 286) 1991 475.9 1.5e-134
CCDS13860.1 NIPSNAP1 gene_id:8508|Hs108|chr22 ( 284) 1383 333.1 1.4e-91
CCDS56488.1 GBAS gene_id:2631|Hs108|chr7 ( 247) 966 235.2 3.7e-62
CCDS6761.1 NIPSNAP3B gene_id:55335|Hs108|chr9 ( 247) 253 67.7 9.3e-12
CCDS6760.1 NIPSNAP3A gene_id:25934|Hs108|chr9 ( 247) 244 65.6 4e-11
>>CCDS5521.1 GBAS gene_id:2631|Hs108|chr7 (286 aa)
initn: 1991 init1: 1991 opt: 1991 Z-score: 2529.2 bits: 475.9 E(32554): 1.5e-134
Smith-Waterman score: 1991; 100.0% identity (100.0% similar) in 286 aa overlap (1-286:1-286)
10 20 30 40 50 60
pF1KE0 MAARVLRARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 MAARVLRARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 HSNLLAKKETSNLYKLQFHNVKPECLEAYNKICQEVLPKIHEDKHYPCTLVGTWNTWYGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 HSNLLAKKETSNLYKLQFHNVKPECLEAYNKICQEVLPKIHEDKHYPCTLVGTWNTWYGE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 QDQAVHLWRYEGGYPALTEVMNKLRENKEFLEFRKARSDMLLSRKNQLLLEFSFWNEPVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 QDQAVHLWRYEGGYPALTEVMNKLRENKEFLEFRKARSDMLLSRKNQLLLEFSFWNEPVP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 RSGPNIYELRSYQLRPGTMIEWGNYWARAIRFRQDGNEAVGGFFSQIGQLYMVHHLWAYR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 RSGPNIYELRSYQLRPGTMIEWGNYWARAIRFRQDGNEAVGGFFSQIGQLYMVHHLWAYR
190 200 210 220 230 240
250 260 270 280
pF1KE0 DLQTREDIRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
::::::::::::::::::::::::::::::::::::::::::::::
CCDS55 DLQTREDIRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
250 260 270 280
>>CCDS13860.1 NIPSNAP1 gene_id:8508|Hs108|chr22 (284 aa)
initn: 1383 init1: 1383 opt: 1383 Z-score: 1757.7 bits: 333.1 E(32554): 1.4e-91
Smith-Waterman score: 1383; 75.5% identity (93.2% similar) in 249 aa overlap (38-286:36-284)
10 20 30 40 50 60
pF1KE0 ARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDAHSNLLAK
... : ::..::::.::::::::::.::.:
CCDS13 CSISVTARRLLGGPGPRAGDVASAAAARFYSKDNEGSWFRSLFVHKVDPRKDAHSTLLSK
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 KETSNLYKLQFHNVKPECLEAYNKICQEVLPKIHEDKHYPCTLVGTWNTWYGEQDQAVHL
::::::::.:::::::: :.:::.. . ::::.: :. :::.:::.::::::::::::::
CCDS13 KETSNLYKIQFHNVKPEYLDAYNSLTEAVLPKLHLDEDYPCSLVGNWNTWYGEQDQAVHL
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 WRYEGGYPALTEVMNKLRENKEFLEFRKARSDMLLSRKNQLLLEFSFWNEPVPRSGPNIY
::. :::::: . ::::..:::.::::. ::.:::::.::::::::::::: :: :::::
CCDS13 WRFSGGYPALMDCMNKLKNNKEYLEFRRERSQMLLSRRNQLLLEFSFWNEPQPRMGPNIY
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 ELRSYQLRPGTMIEWGNYWARAIRFRQDGNEAVGGFFSQIGQLYMVHHLWAYRDLQTRED
:::.:.:.::::::::: :::::..::...:::::::::::.::.:::::::.:::.::.
CCDS13 ELRTYKLKPGTMIEWGNNWARAIKYRQENQEAVGGFFSQIGELYVVHHLWAYKDLQSREE
190 200 210 220 230 240
250 260 270 280
pF1KE0 IRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
:::::.:.::.: :::::::...:::::::::: ::::
CCDS13 TRNAAWRKRGWDENVYYTVPLVRHMESRIMIPLKISPLQ
250 260 270 280
>>CCDS56488.1 GBAS gene_id:2631|Hs108|chr7 (247 aa)
initn: 1467 init1: 966 opt: 966 Z-score: 1229.4 bits: 235.2 E(32554): 3.7e-62
Smith-Waterman score: 1420; 79.7% identity (83.6% similar) in 286 aa overlap (1-286:1-247)
10 20 30 40 50 60
pF1KE0 MAARVLRARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MAARVLRARGAAWAGGLLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 HSNLLAKKETSNLYKLQFHNVKPECLEAYNKICQEVLPKIHEDKHYPCTLVGTWNTWYGE
::::::::::::::::::. .: ::. . ::. ::: :.:
CCDS56 HSNLLAKKETSNLYKLQFK----RC-------CQR-FTKINT------TLV----LWWGL
70 80 90
130 140 150 160 170 180
pF1KE0 QDQAVHLWRYEGGYPALTEVMNKLRENKEFLEFRKARSDMLLSRKNQLLLEFSFWNEPVP
. :. . :.. ::::::::::::::::::::::::::::::::
CCDS56 GTR---------GMASRTKL--------EFLEFRKARSDMLLSRKNQLLLEFSFWNEPVP
100 110 120 130 140
190 200 210 220 230 240
pF1KE0 RSGPNIYELRSYQLRPGTMIEWGNYWARAIRFRQDGNEAVGGFFSQIGQLYMVHHLWAYR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 RSGPNIYELRSYQLRPGTMIEWGNYWARAIRFRQDGNEAVGGFFSQIGQLYMVHHLWAYR
150 160 170 180 190 200
250 260 270 280
pF1KE0 DLQTREDIRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DLQTREDIRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
210 220 230 240
>>CCDS6761.1 NIPSNAP3B gene_id:55335|Hs108|chr9 (247 aa)
initn: 190 init1: 115 opt: 253 Z-score: 324.6 bits: 67.7 E(32554): 9.3e-12
Smith-Waterman score: 253; 26.1% identity (55.4% similar) in 249 aa overlap (47-286:10-247)
20 30 40 50 60 70
pF1KE0 LLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDAHSNLLAKKETSNLYKL
:.: : . :. . .. ...:..
CCDS67 MLVLRSGLTKALASRTLAPQVCSSFATGPRQYDGTFYEF
10 20 30
80 90 100 110 120 130
pF1KE0 QFHNVKPECLEAYNKICQEVLPK-IHEDKHYPCTLVGTWNTWYG-EQDQAVHLWRYEGGY
. . .:: ..:. .: : : :: : ::: :.. .: . ... :.:.:.. .
CCDS67 RTYYLKPSNMNAF----MENLKKNIHLRTSYS-ELVGFWSVEFGGRTNKVFHIWKYDN-F
40 50 60 70 80 90
140 150 160 170 180
pF1KE0 PALTEVMNKLRENKEFLEFRKARSDMLLSRKNQLLLEFSF---WN--EPVPRSGPNIYEL
.:: . : . ::. : . :.: .. :... :. : :. : .:::
CCDS67 AHRAEVRKALANCKEWQEQSIIPN---LARIDKQETEITYLIPWSKLEKPPKEG--VYEL
100 110 120 130 140
190 200 210 220 230 240
pF1KE0 RSYQLRPGTMIEWGNYWARAIRFRQD-G-NEAVGGFFSQIGQLYMVHHLWAYRDLQTRED
.:..:: ::. . ::: . . : ...:: : .. :.: :: :: .. ..:
CCDS67 AVFQMKPGGPALWGDAFERAINAHVNLGYTKVVGVFHTEYGELNRVHVLWWNESADSRAA
150 160 170 180 190 200
250 260 270 280
pF1KE0 IRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
:. . . : .: . ... ..:: . :::.
CCDS67 GRHKSHEDPRVVAAVRESVNYLVSQQNMLLIPASFSPLK
210 220 230 240
>>CCDS6760.1 NIPSNAP3A gene_id:25934|Hs108|chr9 (247 aa)
initn: 204 init1: 131 opt: 244 Z-score: 313.2 bits: 65.6 E(32554): 4e-11
Smith-Waterman score: 244; 24.5% identity (54.6% similar) in 249 aa overlap (47-286:10-247)
20 30 40 50 60 70
pF1KE0 LLQRAAPCSLLPRLRTWTSSSNRSREDSWLKSLFVRKVDPRKDAHSNLLAKKETSNLYKL
..: : . :. . .. . .:..
CCDS67 MLVLRSALTRALASRTLAPQMCSSFATGPRQYDGIFYEF
10 20 30
80 90 100 110 120 130
pF1KE0 QFHNVKP----ECLEAYNKICQEVLPKIHEDKHYPCTLVGTWNTWYG-EQDQAVHLWRYE
. . .:: : :: ..: . : : . ::: :.. .: ... . :.:.:.
CCDS67 RSYYLKPSKMNEFLENFEKNAH--LRTAHSE------LVGYWSVEFGGRMNTVFHIWKYD
40 50 60 70 80 90
140 150 160 170 180
pF1KE0 GGYPALTEVMNKLRENKEFLEFRKARSDMLLSRKNQLLLEFSFWN--EPVPRSGPNIYEL
. . ::: . : ..::. : . :...... . . : : :. : .:::
CCDS67 N-FAHRTEVRKALAKDKEWQEQFLIPNLALIDKQESEITYLVPWCKLEKPPKEG--VYEL
100 110 120 130 140
190 200 210 220 230 240
pF1KE0 RSYQLRPGTMIEWGNYWARAIRFRQD-G-NEAVGGFFSQIGQLYMVHHLWAYRDLQTRED
..:..:: ::. . ::.. . . : .. :: : .. : : :: :: .. ..:
CCDS67 ATFQMKPGGPALWGDAFKRAVHAHVNLGYTKLVGVFHTEYGALNRVHVLWWNESADSRAA
150 160 170 180 190 200
250 260 270 280
pF1KE0 IRNAAWHKHGWEELVYYTVPLIQEMESRIMIPLKTSPLQ
:. . . : .: . ... ..:: . :::.
CCDS67 GRHKSHEDPRVVAAVRESVNYLVSQQNMLLIPTSFSPLK
210 220 230 240
286 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 22:18:12 2016 done: Thu Nov 3 22:18:13 2016
Total Scan time: 2.450 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]