FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8908, 242 aa 1>>>pF1KB8908 242 - 242 aa - 242 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3652+/-0.00077; mu= 6.0974+/- 0.047 mean_var=161.6228+/-32.779, 0's: 0 Z-trim(114.0): 159 B-trim: 0 in 0/51 Lambda= 0.100884 statistics sampled from 14405 (14582) to 14405 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.787), E-opt: 0.2 (0.448), width: 16 Scan time: 2.190 The best scores are: opt bits E(32554) CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 1678 255.3 2.7e-68 CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 997 156.2 1.9e-38 CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 796 127.0 1.4e-29 CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 784 125.2 4.6e-29 CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 536 88.8 1.6e-18 CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 452 76.8 1.3e-14 CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 ( 230) 435 74.4 7.5e-14 CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 ( 217) 417 71.7 4.4e-13 CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 417 71.7 4.7e-13 CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 410 70.6 6.9e-13 CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 408 70.4 1.2e-12 >>CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 (242 aa) initn: 1678 init1: 1678 opt: 1678 Z-score: 1339.6 bits: 255.3 E(32554): 2.7e-68 Smith-Waterman score: 1678; 100.0% identity (100.0% similar) in 242 aa overlap (1-242:1-242) 10 20 30 40 50 60 pF1KB8 MSSYFVNPLFSKYKAGESLEPAYYDCRFPQSVGRSHALVYGPGGSAPGFQHASHHVQDFF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 MSSYFVNPLFSKYKAGESLEPAYYDCRFPQSVGRSHALVYGPGGSAPGFQHASHHVQDFF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 HHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 HHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANT 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 NSSEGQGHLNQNSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKEFLFNPYLTRKRRI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 NSSEGQGHLNQNSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKEFLFNPYLTRKRRI 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 EVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEEEKEEEEKEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 EVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEEEKEEEEKEEN 190 200 210 220 230 240 pF1KB8 KD :: CCDS88 KD >>CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 (243 aa) initn: 947 init1: 645 opt: 997 Z-score: 803.9 bits: 156.2 E(32554): 1.9e-38 Smith-Waterman score: 997; 62.0% identity (81.6% similar) in 245 aa overlap (1-241:1-239) 10 20 30 40 50 pF1KB8 MSSYFVNPLFSKYKAGESLEPAYYDCRFPQSVGRSHALVYGP--GGSAPGFQHASHHVQD ::::::: ::::::.::::.: :::: : :..: ..:::: ::: ::: :. .:. CCDS11 MSSYFVNSLFSKYKTGESLRPNYYDCGFAQDLGGRPTVVYGPSSGGS---FQHPSQ-IQE 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 FFHHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSA :.: : :..:.. ::::::...:::: ..::::. : ::::.::: . ..::: ::: .: CCDS11 FYH-GPSSLSTAPYQQNPCAVACHGDPGNFYGYDPLQRQSLFGAQ-DPDLVQYADCKLAA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 NTNSSEGQGHLNQNSSPSLMFPWMRPHAP-GRRSGRQTYSRYQTLELEKEFLFNPYLTRK .. .: .:. ::. .::::::.: ::: :::::::::::::::::::::::::: CCDS11 ASGLGEEAEGSEQSPSPTQLFPWMRPQAAAGRRRGRQTYSRYQTLELEKEFLFNPYLTRK 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 RRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGAR-DEEKVEEEGNEEEEKEEEE :::::::::::::::::::::::::::::::::::.:... ..:..:.. :. . .: CCDS11 RRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKFPSSKCEQEELEKQKLERAPEAADE 180 190 200 210 220 230 240 pF1KB8 KEENKD . .: CCDS11 GDAQKGDKK 240 >>CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 (289 aa) initn: 893 init1: 601 opt: 796 Z-score: 644.8 bits: 127.0 E(32554): 1.4e-29 Smith-Waterman score: 866; 54.3% identity (69.1% similar) in 265 aa overlap (15-238:23-285) 10 20 30 40 pF1KB8 MSSYFVNPLFSKYKAGESLEPAYYDCRFPQSVGRSHALVYGP----GGSAPG :::...:.::::.: :: :: . . :.:: : CCDS56 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG 10 20 30 40 50 60 50 60 70 pF1KB8 FQHASHHV----------------------QDFFHHGTSGISNSGYQQNP---------- : :: .. :..:: : .: ..:: : CCDS56 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPG-GGSPAAAYQAAPPPPPHPPPPP 70 80 90 100 110 80 90 100 110 120 130 pF1KB8 ----CS-LSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANTNSSEGQGHLNQ :. ..:::. .:::::. : :: .. .:::: .:::::::::.. : .: :::: CCDS56 PPPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSG-NIGEDPDHLNQ 120 130 140 150 160 170 140 150 160 170 180 190 pF1KB8 NSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKEFLFNPYLTRKRRIEVSHALGLTER .:::: :::::::.::::: :::::::.:::::::::::::::::::::::::::.:::: CCDS56 SSSPSQMFPWMRPQAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTER 180 190 200 210 220 230 200 210 220 230 240 pF1KB8 QVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEEEKEEEEKEENKD :::::::::::::::::::::.: .:.: : : .: .: ::.. : CCDS56 QVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN 240 250 260 270 280 >>CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 (290 aa) initn: 885 init1: 646 opt: 784 Z-score: 635.3 bits: 125.2 E(32554): 4.6e-29 Smith-Waterman score: 854; 54.1% identity (68.8% similar) in 266 aa overlap (15-238:23-286) 10 20 30 40 pF1KB8 MSSYFVNPLFSKYKAGESLEPAYYDCRFPQSVGRSHALVYGP----GGSAPG :::...:.::::.: :: :: . . :.:: : CCDS22 MSSYFVNPLYSKYKAAAAAAAAAGEAINPTYYDCHFAPEVGGRHAAAAAALQLYGNSAAG 10 20 30 40 50 60 50 60 70 pF1KB8 FQHASHHV----------------------QDFFHHGTSGISNSGYQQNP---------- : :: .. :..:: : .: ..:: : CCDS22 FPHAPPQAHAHPHPSPPPSGTGCGGREGRGQEYFHPG-GGSPAAAYQAAPPPPPHPPPPP 70 80 90 100 110 80 90 100 110 120 130 pF1KB8 ----CS-LSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANTNSSEGQGHLNQ :. ..:::. .:::::. : :: .. .:::: .:::::::::.. : .: :::: CCDS22 PPPPCGGIACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSG-NIGEDPDHLNQ 120 130 140 150 160 170 140 150 160 170 180 190 pF1KB8 NSSPSLMFPWMRPHA-PGRRSGRQTYSRYQTLELEKEFLFNPYLTRKRRIEVSHALGLTE .:::: :::::::.: :::: :::::::.:::::::::::::::::::::::::::.::: CCDS22 SSSPSQMFPWMRPQAAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTE 180 190 200 210 220 230 200 210 220 230 240 pF1KB8 RQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGNEEEEKEEEEKEENKD ::::::::::::::::::::::.: .:.: : : .: .: ::.. : CCDS22 RQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN 240 250 260 270 280 290 >>CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 (106 aa) initn: 540 init1: 483 opt: 536 Z-score: 446.1 bits: 88.8 E(32554): 1.6e-18 Smith-Waterman score: 536; 79.4% identity (89.2% similar) in 102 aa overlap (138-238:1-102) 110 120 130 140 150 160 pF1KB8 VVQYPDCKSSANTNSSEGQGHLNQNSSPSLMFPWMRPHA-PGRRSGRQTYSRYQTLELEK :::::::.: :::: :::::::.::::::: CCDS56 MFPWMRPQAAPGRRRGRQTYSRFQTLELEK 10 20 30 170 180 190 200 210 220 pF1KB8 EFLFNPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEG ::::::::::::::::::::.:::::::::::::::::::::::::.: .:.: : : CCDS56 EFLFNPYLTRKRRIEVSHALALTERQVKIWFQNRRMKWKKENNKDKFPVSRQEVKDGETK 40 50 60 70 80 90 230 240 pF1KB8 NEEEEKEEEEKEENKD .: .: ::.. : CCDS56 KEAQELEEDRAEGLTN 100 >>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa) initn: 443 init1: 342 opt: 452 Z-score: 375.7 bits: 76.8 E(32554): 1.3e-14 Smith-Waterman score: 452; 39.8% identity (60.2% similar) in 241 aa overlap (1-224:1-224) 10 20 30 40 50 pF1KB8 MSSYFVNPLFSKYKAG--ESL--EPAYYDCRFPQSVGRSHALVYGPG-GSAPGFQHASHH ::::::: : :. ::. . :. . . . : . :::: :. :: .:.. CCDS11 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPL-RHYPAPYGPGPGQDKGFATSSYY 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 VQDFFHHGTSGISNSGY-QQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDC ...:: . ::. .: : :: : .: ::... : CCDS11 PP----------AGGGYGRAAPCD---YGPAPAFY-REKESACALSGADEQPPFHPEPRK 60 70 80 90 100 120 130 140 150 160 pF1KB8 KSSANTNSSEGQGHLNQNSSPSLMFPWMR--------PHAPGRRSGRQTYSRYQTLELEK .. :. .: :. . .. :.: ..:::. .:. : :::::.::::::::: CCDS11 SDCAQDKSVFGETEEQKCSTP--VYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEK 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB8 EFLFNPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKEN---NKDKLPGARDEEKVE :: .: ::::.::::..::: :::::.::::::::::::::. . ..: . ..::: CCDS11 EFHYNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQA 170 180 190 200 210 220 230 240 pF1KB8 EEGNEEEEKEEEEKEENKD : CCDS11 E >>CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 (230 aa) initn: 537 init1: 389 opt: 435 Z-score: 362.1 bits: 74.4 E(32554): 7.5e-14 Smith-Waterman score: 508; 43.9% identity (61.3% similar) in 253 aa overlap (2-236:3-230) 10 20 30 40 50 pF1KB8 MSSYFVNPLFSKYKAGESL----EPAYYDCRFPQSVGRSHALVYGPGGSAPGFQHASHH :::.:: ::::: :: :: ::. .: : . :: :: :..: :: CCDS54 MSSSYYVNALFSKYTAGASLFQNAEPT--SCSFAPNSQRSG---YGAGAGA----FAST- 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 VQDFFHHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCK : ... :: :.: . : .: .. :: :: : :. .. . : CCDS54 VPGLYN------VNSPLYQSPFA-SGYGLGADAYG--NLPCASY--DQNIPGLCS--DLA 60 70 80 90 120 130 140 150 160 170 pF1KB8 SSANTNSSEGQGHLNQNSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKEFLFNPYLT ..: ...:: : ... . .:::: .: :. :::::.::::::::::: :: ::: CCDS54 KGACDKTDEGALHGAAEANFRI-YPWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLT 100 110 120 130 140 150 180 190 200 210 220 pF1KB8 RKRRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGARDE--------------EK :.::::..::: :::::.::::::::::::::. ::. : : .: CCDS54 RRRRIEIAHALCLTERQIKIWFQNRRMKWKKEH-KDEGPTAAAAPEGAVPSAAATAAADK 160 170 180 190 200 210 230 240 pF1KB8 VEEEGNEEEEKEEEEKEENKD ..:: ..:::..::: CCDS54 ADEEDDDEEEEDEEE 220 230 >>CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 (217 aa) initn: 432 init1: 382 opt: 417 Z-score: 348.3 bits: 71.7 E(32554): 4.4e-13 Smith-Waterman score: 467; 40.7% identity (61.3% similar) in 243 aa overlap (1-230:1-217) 10 20 30 40 50 pF1KB8 MSS-YFVNPLFSKYKAGESLE-----PAYYDCRFPQSVGRSHALVYGPGGSAPGFQHASH ::: :..: ::::: :. :. : .: : .. : :: .::. .: :: CCDS11 MSSLYYANTLFSKYPASSSVFATGAFPEQTSCAFASNPQRPG---YG-AGSGASFA-AS- 10 20 30 40 50 60 70 80 90 100 pF1KB8 HVQDFFHHG-------TSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEAS .: .. : ..:. .:: .: :.. : . : .:.: :. CCDS11 -MQGLYPGGGGMAGQSAAGVYAAGYGLEPSSFNMH--CAPF-------EQNLSGVC---- 60 70 80 90 100 110 120 130 140 150 160 pF1KB8 VVQYPDCKSSANTNSSEGQGHLNQNSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKE : ...: . . .. : .:. . .:::: . :. :::::.:::::::::: CCDS11 ----PGDSAKAAGAKEQRDSDLAAESNFRI-YPWMRSSGTDRKRGRQTYTRYQTLELEKE 110 120 130 140 150 170 180 190 200 210 220 pF1KB8 FLFNPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKLPGARDEEKVEEEGN : .: ::::.::::..:.: :::::.::::::::::::::: : ::. ....: : . CCDS11 FHYNRYLTRRRRIEIAHTLCLTERQIKIWFQNRRMKWKKEN-KTAGPGTTGQDRAEAEEE 160 170 180 190 200 210 230 240 pF1KB8 EEEEKEEEEKEENKD ::: CCDS11 EEE >>CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 (235 aa) initn: 442 init1: 359 opt: 417 Z-score: 347.8 bits: 71.7 E(32554): 4.7e-13 Smith-Waterman score: 431; 36.4% identity (58.6% similar) in 261 aa overlap (1-242:1-235) 10 20 30 40 50 pF1KB8 MSSYFVNPLFSKYKAG-ESLEP-------AYYDCRFPQSVGRSHAL--VYGPGGSAPGFQ :.:::.:: .: . :: ... : :: : .. : . : .:. .: . CCDS88 MNSYFTNPSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTPFYSPQEN 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 HASHHVQDFFHHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQ . . . .: ::: ::.. .: ::. : . ..:..: CCDS88 VVFSSSRGPYDYG----SNSFYQEKDMLSNC--------------RQNTLGHNTQTSIAQ 70 80 90 100 120 130 140 150 160 pF1KB8 YPDCKSSANTNSSEGQGHLNQNSSPSLMFPWMR---PHA-----PGRRSGRQTYSRYQTL . .: .:. ..... ..:::. :. :: ::: ::::::: CCDS88 --------DFSSEQGRTAPQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTL 110 120 130 140 150 170 180 190 200 210 220 pF1KB8 ELEKEFLFNPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKENN-KDKLPGARDEEK :::::: :: ::::.::::...:: :::::.::::::::::::::.: . : :. CCDS88 ELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGAT 160 170 180 190 200 210 230 240 pF1KB8 VEEEGNEEEEKEEEEKEENKD .. :..::..:: :.:..:. CCDS88 ADSLGGKEEKREETEEEKQKE 220 230 >>CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 (153 aa) initn: 398 init1: 359 opt: 410 Z-score: 344.8 bits: 70.6 E(32554): 6.9e-13 Smith-Waterman score: 410; 46.8% identity (67.9% similar) in 156 aa overlap (96-242:6-153) 70 80 90 100 110 120 pF1KB8 GISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQYPDCKSSANTNSSEG ::. : . ..:..: : .: . .. . CCDS41 MLSNCRQNTLGHNTQTSIAQ--DFSSEQGRTAPQD 10 20 30 130 140 150 160 170 pF1KB8 QGHLNQNSSPSLMFPWMR---PHA-----PGRRSGRQTYSRYQTLELEKEFLFNPYLTRK : : ..:::. :. :: ::: ::::::::::::: :: ::::. CCDS41 QKASIQ------IYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRR 40 50 60 70 80 180 190 200 210 220 230 pF1KB8 RRIEVSHALGLTERQVKIWFQNRRMKWKKENN-KDKLPGARDEEKVEEEGNEEEEKEEEE ::::...:: :::::.::::::::::::::.: . : :. .. :..::..:: : CCDS41 RRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETE 90 100 110 120 130 140 240 pF1KB8 KEENKD .:..:. CCDS41 EEKQKE 150 242 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:26:45 2016 done: Fri Nov 4 16:26:46 2016 Total Scan time: 2.190 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]