FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8938, 290 aa
1>>>pF1KB8938 290 - 290 aa - 290 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.9088+/-0.000997; mu= 0.1460+/- 0.061
mean_var=457.7721+/-92.865, 0's: 0 Z-trim(117.6): 80 B-trim: 0 in 0/54
Lambda= 0.059945
statistics sampled from 18327 (18404) to 18327 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.834), E-opt: 0.2 (0.565), width: 16
Scan time: 3.020
The best scores are: opt bits E(32554)
CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 2069 192.1 4.1e-49
CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 2052 190.6 1.1e-48
CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 788 81.2 8.3e-16
CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 784 80.8 1.1e-15
CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 717 74.6 3.6e-14
>>CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 (290 aa)
initn: 2069 init1: 2069 opt: 2069 Z-score: 995.1 bits: 192.1 E(32554): 4.1e-49
Smith-Waterman score: 2069; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:1-290)
10 20 30 40 50 60
pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 SPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 SPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ
190 200 210 220 230 240
250 260 270 280 290
pF1KB8 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS22 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN
250 260 270 280 290
>>CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 (289 aa)
initn: 1436 init1: 1436 opt: 2052 Z-score: 987.2 bits: 190.6 E(32554): 1.1e-48
Smith-Waterman score: 2052; 99.7% identity (99.7% similar) in 290 aa overlap (1-290:1-289)
10 20 30 40 50 60
pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 SPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ
::::::::::::: ::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SPSQMFPWMRPQA-PGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ
190 200 210 220 230
250 260 270 280 290
pF1KB8 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN
240 250 260 270 280
>>CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 (243 aa)
initn: 941 init1: 699 opt: 788 Z-score: 397.2 bits: 81.2 E(32554): 8.3e-16
Smith-Waterman score: 878; 52.1% identity (68.8% similar) in 288 aa overlap (1-287:1-235)
10 20 30 40 50 60
pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG
::::::: :.::::. ::.. :.:::: :: ..::: .. .:: :..:
CCDS11 MSSYFVNSLFSKYKT--------GESLRPNYYDCGFAQDLGGRPTV------VYGPSSGG
10 20 30 40
70 80 90 100 110 120
pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP
. ::: . ::..: .. .: :: :
CCDS11 ---------SFQHPS-------------QIQEFYHGPSSLSTAPYQQNP-----------
50 60 70
130 140 150 160 170
pF1KB8 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCK-SSSGNIGEDPDHLNQS
:. .::::.:..::::: :::: .: .: . .:::: ::: ......::. . .::
CCDS11 ---CA-VACHGDPGNFYGYDPLQRQSLFGAQ-DPDLVQYADCKLAAASGLGEEAEGSEQS
80 90 100 110 120
180 190 200 210 220 230
pF1KB8 SSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTER
::.:.::::::::: :::::::::::.:::::::::::::::::::::::::::.::::
CCDS11 PSPTQLFPWMRPQAAAGRRRGRQTYSRYQTLELEKEFLFNPYLTRKRRIEVSHALGLTER
130 140 150 160 170 180
240 250 260 270 280 290
pF1KB8 QVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN
::::::::::::::::::::::: :. : .. : :.. .. : ::
CCDS11 QVKIWFQNRRMKWKKENNKDKFPSSKCEQEELE-KQKLERAPEAADEGDAQKGDKK
190 200 210 220 230 240
>>CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 (242 aa)
initn: 885 init1: 646 opt: 784 Z-score: 395.4 bits: 80.8 E(32554): 1.1e-15
Smith-Waterman score: 917; 55.2% identity (68.8% similar) in 288 aa overlap (1-286:1-238)
10 20 30 40 50 60
pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG
:::::::::.::::: ::...:.::::.: :: :: . . :.:: :
CCDS88 MSSYFVNPLFSKYKA--------GESLEPAYYDCRFPQSVGRSHALVYGP----GGSAPG
10 20 30 40
70 80 90 100 110
pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPG-GGSPAAAYQAAPPPPPHPPPPP
: :: .: : :..:: : .: ..:: :
CCDS88 FQHA-----SH-HV----------------QDFFHHGTSGISNSGYQQNP----------
50 60 70
120 130 140 150 160 170
pF1KB8 PPPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSG-NIGEDPDHLNQ
:. ..:::. .:::::. : :: .. .:::: .:::::::::.. : .: ::::
CCDS88 ----CS-LSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANTNSSEGQGHLNQ
80 90 100 110 120 130
180 190 200 210 220 230
pF1KB8 SSSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTE
.:::: :::::::.: :::: :::::::.:::::::::::::::::::::::::::.:::
CCDS88 NSSPSLMFPWMRPHA-PGRRSGRQTYSRYQTLELEKEFLFNPYLTRKRRIEVSHALGLTE
140 150 160 170 180 190
240 250 260 270 280 290
pF1KB8 RQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN
::::::::::::::::::::::.: .:.: : : .: .: ::.. :
CCDS88 RQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEEEKEEEEKEENKD
200 210 220 230 240
>>CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 (106 aa)
initn: 717 init1: 717 opt: 717 Z-score: 367.9 bits: 74.6 E(32554): 3.6e-14
Smith-Waterman score: 717; 100.0% identity (100.0% similar) in 106 aa overlap (185-290:1-106)
160 170 180 190 200 210
pF1KB8 ELVQYPDCKSSSGNIGEDPDHLNQSSSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEK
::::::::::::::::::::::::::::::
CCDS56 MFPWMRPQAAPGRRRGRQTYSRFQTLELEK
10 20 30
220 230 240 250 260 270
pF1KB8 EFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 EFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETK
40 50 60 70 80 90
280 290
pF1KB8 KEAQELEEDRAEGLTN
::::::::::::::::
CCDS56 KEAQELEEDRAEGLTN
100
290 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 16:35:44 2016 done: Fri Nov 4 16:35:44 2016
Total Scan time: 3.020 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]