FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8938, 290 aa 1>>>pF1KB8938 290 - 290 aa - 290 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.9088+/-0.000997; mu= 0.1460+/- 0.061 mean_var=457.7721+/-92.865, 0's: 0 Z-trim(117.6): 80 B-trim: 0 in 0/54 Lambda= 0.059945 statistics sampled from 18327 (18404) to 18327 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.834), E-opt: 0.2 (0.565), width: 16 Scan time: 3.020 The best scores are: opt bits E(32554) CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 2069 192.1 4.1e-49 CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 2052 190.6 1.1e-48 CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 788 81.2 8.3e-16 CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 784 80.8 1.1e-15 CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 717 74.6 3.6e-14 >>CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 (290 aa) initn: 2069 init1: 2069 opt: 2069 Z-score: 995.1 bits: 192.1 E(32554): 4.1e-49 Smith-Waterman score: 2069; 100.0% identity (100.0% similar) in 290 aa overlap (1-290:1-290) 10 20 30 40 50 60 pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 SPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 SPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN 250 260 270 280 290 >>CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 (289 aa) initn: 1436 init1: 1436 opt: 2052 Z-score: 987.2 bits: 190.6 E(32554): 1.1e-48 Smith-Waterman score: 2052; 99.7% identity (99.7% similar) in 290 aa overlap (1-290:1-289) 10 20 30 40 50 60 pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 SPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ ::::::::::::: :::::::::::::::::::::::::::::::::::::::::::::: CCDS56 SPSQMFPWMRPQA-PGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQ 190 200 210 220 230 250 260 270 280 290 pF1KB8 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 VKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN 240 250 260 270 280 >>CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 (243 aa) initn: 941 init1: 699 opt: 788 Z-score: 397.2 bits: 81.2 E(32554): 8.3e-16 Smith-Waterman score: 878; 52.1% identity (68.8% similar) in 288 aa overlap (1-287:1-235) 10 20 30 40 50 60 pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG ::::::: :.::::. ::.. :.:::: :: ..::: .. .:: :..: CCDS11 MSSYFVNSLFSKYKT--------GESLRPNYYDCGFAQDLGGRPTV------VYGPSSGG 10 20 30 40 70 80 90 100 110 120 pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPP . ::: . ::..: .. .: :: : CCDS11 ---------SFQHPS-------------QIQEFYHGPSSLSTAPYQQNP----------- 50 60 70 130 140 150 160 170 pF1KB8 PPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCK-SSSGNIGEDPDHLNQS :. .::::.:..::::: :::: .: .: . .:::: ::: ......::. . .:: CCDS11 ---CA-VACHGDPGNFYGYDPLQRQSLFGAQ-DPDLVQYADCKLAAASGLGEEAEGSEQS 80 90 100 110 120 180 190 200 210 220 230 pF1KB8 SSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTER ::.:.::::::::: :::::::::::.:::::::::::::::::::::::::::.:::: CCDS11 PSPTQLFPWMRPQAAAGRRRGRQTYSRYQTLELEKEFLFNPYLTRKRRIEVSHALGLTER 130 140 150 160 170 180 240 250 260 270 280 290 pF1KB8 QVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN ::::::::::::::::::::::: :. : .. : :.. .. : :: CCDS11 QVKIWFQNRRMKWKKENNKDKFPSSKCEQEELE-KQKLERAPEAADEGDAQKGDKK 190 200 210 220 230 240 >>CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 (242 aa) initn: 885 init1: 646 opt: 784 Z-score: 395.4 bits: 80.8 E(32554): 1.1e-15 Smith-Waterman score: 917; 55.2% identity (68.8% similar) in 288 aa overlap (1-286:1-238) 10 20 30 40 50 60 pF1KB8 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG :::::::::.::::: ::...:.::::.: :: :: . . :.:: : CCDS88 MSSYFVNPLFSKYKA--------GESLEPAYYDCRFPQSVGRSHALVYGP----GGSAPG 10 20 30 40 70 80 90 100 110 pF1KB8 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPG-GGSPAAAYQAAPPPPPHPPPPP : :: .: : :..:: : .: ..:: : CCDS88 FQHA-----SH-HV----------------QDFFHHGTSGISNSGYQQNP---------- 50 60 70 120 130 140 150 160 170 pF1KB8 PPPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSG-NIGEDPDHLNQ :. ..:::. .:::::. : :: .. .:::: .:::::::::.. : .: :::: CCDS88 ----CS-LSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANTNSSEGQGHLNQ 80 90 100 110 120 130 180 190 200 210 220 230 pF1KB8 SSSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTE .:::: :::::::.: :::: :::::::.:::::::::::::::::::::::::::.::: CCDS88 NSSPSLMFPWMRPHA-PGRRSGRQTYSRYQTLELEKEFLFNPYLTRKRRIEVSHALGLTE 140 150 160 170 180 190 240 250 260 270 280 290 pF1KB8 RQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN ::::::::::::::::::::::.: .:.: : : .: .: ::.. : CCDS88 RQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEEEKEEEEKEENKD 200 210 220 230 240 >>CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 (106 aa) initn: 717 init1: 717 opt: 717 Z-score: 367.9 bits: 74.6 E(32554): 3.6e-14 Smith-Waterman score: 717; 100.0% identity (100.0% similar) in 106 aa overlap (185-290:1-106) 160 170 180 190 200 210 pF1KB8 ELVQYPDCKSSSGNIGEDPDHLNQSSSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEK :::::::::::::::::::::::::::::: CCDS56 MFPWMRPQAAPGRRRGRQTYSRFQTLELEK 10 20 30 220 230 240 250 260 270 pF1KB8 EFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS56 EFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETK 40 50 60 70 80 90 280 290 pF1KB8 KEAQELEEDRAEGLTN :::::::::::::::: CCDS56 KEAQELEEDRAEGLTN 100 290 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:35:44 2016 done: Fri Nov 4 16:35:44 2016 Total Scan time: 3.020 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]