FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8903, 235 aa 1>>>pF1KB8903 235 - 235 aa - 235 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.4777+/-0.000707; mu= 14.2573+/- 0.043 mean_var=86.3327+/-17.726, 0's: 0 Z-trim(110.3): 173 B-trim: 603 in 1/53 Lambda= 0.138034 statistics sampled from 11321 (11524) to 11321 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.719), E-opt: 0.2 (0.354), width: 16 Scan time: 1.650 The best scores are: opt bits E(32554) CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 1580 323.9 5.5e-89 CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 1023 212.9 9.8e-56 CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 632 135.1 3.6e-32 CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 513 111.4 5e-25 CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 ( 270) 467 102.3 3.2e-22 CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 ( 230) 443 97.5 7.8e-21 CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 ( 217) 433 95.5 3e-20 CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 ( 269) 431 95.2 4.6e-20 CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 417 92.3 3e-19 CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 ( 222) 413 91.5 4.8e-19 CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 407 90.4 1.3e-18 CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 398 88.6 4.2e-18 CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 394 87.8 7.3e-18 CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 392 87.4 9.3e-18 CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 388 86.7 2e-17 CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 369 82.8 2.6e-16 CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 361 80.9 3.6e-16 CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 363 81.7 5.8e-16 CCDS82153.1 HOXB3 gene_id:3213|Hs108|chr17 ( 299) 300 69.1 3.6e-12 CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 ( 443) 302 69.7 3.7e-12 CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 ( 358) 300 69.2 4.1e-12 CCDS9327.1 PDX1 gene_id:3651|Hs108|chr13 ( 283) 298 68.7 4.5e-12 CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 ( 431) 300 69.2 4.7e-12 CCDS5403.1 HOXA2 gene_id:3199|Hs108|chr7 ( 376) 297 68.6 6.4e-12 CCDS4304.1 CDX1 gene_id:1044|Hs108|chr5 ( 265) 282 65.5 3.9e-11 CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 ( 432) 284 66.1 4.3e-11 CCDS34788.1 MNX1 gene_id:3110|Hs108|chr7 ( 401) 281 65.4 6.1e-11 CCDS5401.1 HOXA1 gene_id:3198|Hs108|chr7 ( 335) 280 65.2 6.2e-11 CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2 ( 352) 279 65.0 7.3e-11 >>CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 (235 aa) initn: 1580 init1: 1580 opt: 1580 Z-score: 1711.0 bits: 323.9 E(32554): 5.5e-89 Smith-Waterman score: 1580; 100.0% identity (100.0% similar) in 235 aa overlap (1-235:1-235) 10 20 30 40 50 60 pF1KB8 MNSYFTNPSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTPFYSPQEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 MNSYFTNPSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTPFYSPQEN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 VVFSSSRGPYDYGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKASI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 VVFSSSRGPYDYGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKASI 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 QIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANALCL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 QIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANALCL 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 TERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 TERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE 190 200 210 220 230 >>CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 (153 aa) initn: 1023 init1: 1023 opt: 1023 Z-score: 1114.1 bits: 212.9 E(32554): 9.8e-56 Smith-Waterman score: 1023; 100.0% identity (100.0% similar) in 153 aa overlap (83-235:1-153) 60 70 80 90 100 110 pF1KB8 PFYSPQENVVFSSSRGPYDYGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTA :::::::::::::::::::::::::::::: CCDS41 MLSNCRQNTLGHNTQTSIAQDFSSEQGRTA 10 20 30 120 130 140 150 160 170 pF1KB8 PQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 PQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRI 40 50 60 70 80 90 180 190 200 210 220 230 pF1KB8 EIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS41 EIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEK 100 110 120 130 140 150 pF1KB8 QKE ::: CCDS41 QKE >>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa) initn: 596 init1: 486 opt: 632 Z-score: 691.0 bits: 135.1 E(32554): 3.6e-32 Smith-Waterman score: 635; 48.3% identity (72.7% similar) in 238 aa overlap (1-229:1-224) 10 20 30 40 50 pF1KB8 MNSYFTNPSLSCHLAGGQD-VLPNVALNSTAY-DPVRHF-STYGAAVAQNRIYSTPFYSP :.:::.: .. ::.::. : .. : :..: ::.::. . :: . .:.. ..: : : CCDS11 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGFATSSYYP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 QENVVFSSSRGPYDYG-SNSFYQEKD---MLSNCRQNTLGHNT--QTSIAQDFSSEQGRT . .. . .: ::: . .::.::. ::. .. : ... ::: .: :.: CCDS11 PAGGGYGRA-APCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCAQD-KSVFGET 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 APQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRR ..:: : .::::::::: .. ..: . ::::: :.::::::::::::.::::::::: CCDS11 --EEQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYNRYLTRRRR 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 IEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEE ::::.::::::::::::::::::::::::.: : :..:...::.....: CCDS11 IEIAHALCLTERQIKIWFQNRRMKWKKESKLLS----------ASQLSAEEEEEKQAE 180 190 200 210 220 pF1KB8 KQKE >>CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 (233 aa) initn: 714 init1: 488 opt: 513 Z-score: 562.7 bits: 111.4 E(32554): 5e-25 Smith-Waterman score: 696; 49.8% identity (71.2% similar) in 229 aa overlap (1-215:1-229) 10 20 30 40 50 pF1KB8 MNSYFTNPSLSCHLAGGQD-VLPNVALNSTAYDPVRHF-STYGAAVAQNRIYSTPFYSPQ :.:::.::.. : .::: : .. : ...:: .: : ..:::. .. :..: . : CCDS54 MSSYFVNPTFPGSLPSGQDSFLGQLPLYQAGYDALRPFPASYGASSLPDKTYTSPCFYQQ 10 20 30 40 50 60 60 70 80 90 100 pF1KB8 ENVVFSSSRGPYDYGSNSFYQEKDMLSNC-----RQNTLGHNTQTSIAQ----DFSSEQG : :.. .:. :.::.. ::..::. . .: : . : : : :: :: CCDS54 SNSVLACNRASYEYGASCFYSDKDLSGASPSGSGKQRGPGDYLHFSPEQQYKPDSSSGQG 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB8 RTAPQ---DQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYL .. . :.: . .::::::::: .:. ::. ::::: :.::::::::::::::::: CCDS54 KALHDEGADRKYTSPVYPWMQRMNSCAGAVYGSHGRRGRQTYTRYQTLELEKEFHFNRYL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KB8 TRRRRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKRE :::::::::::::::::::::::::::::::::..: .. . .: . : CCDS54 TRRRRIEIANALCLTERQIKIWFQNRRMKWKKENKLINSTQPSGEDSEAKAGE 190 200 210 220 230 230 pF1KB8 ETEEEKQKE >>CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 (270 aa) initn: 455 init1: 430 opt: 467 Z-score: 512.4 bits: 102.3 E(32554): 3.2e-22 Smith-Waterman score: 486; 47.4% identity (68.9% similar) in 196 aa overlap (38-213:75-267) 10 20 30 40 50 60 pF1KB8 PSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTP---FYSPQENVVFS .. .:: :. : :: : .::: . . CCDS54 YGYGYNGMDLSVGRSGSGHFGSGERARSYAASASAAPAEPR-YSQPATSTHSPQPDPLPC 50 60 70 80 90 100 70 80 90 100 pF1KB8 SSRGPYDYGSNSFYQEKDMLSNCRQNTL-----------GHNTQTSIAQDF--SSEQG-- :. .: . ::.: . :. ::: . : .: .. .: ::::. CCDS54 SAVAP-SPGSDSHHGGKNSLSNSSGASADAGSTHISSREGVGTASGAEEDAPASSEQASA 110 120 130 140 150 160 110 120 130 140 150 160 pF1KB8 RTAPQDQK-ASIQIYPWMQRMN-SHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLT .. :. :. ::::::.... ::...: : . .:.: :.:::::::::::::::::: CCDS54 QSEPSPAPPAQPQIYPWMRKLHISHDNIG-GPEGKRARTAYTRYQTLELEKEFHFNRYLT 170 180 190 200 210 220 170 180 190 200 210 220 pF1KB8 RRRRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREE ::::::::.::::.:::::::::::::::::...: : ...::: CCDS54 RRRRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSMAAAGGAFRP 230 240 250 260 270 230 pF1KB8 TEEEKQKE >>CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 (230 aa) initn: 480 init1: 398 opt: 443 Z-score: 487.5 bits: 97.5 E(32554): 7.8e-21 Smith-Waterman score: 478; 39.2% identity (65.4% similar) in 240 aa overlap (2-235:3-229) 10 20 30 40 50 pF1KB8 MNSYFTNPSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTP-FYSPQ .::..: .: . ::.. .. :. .: .. : . : :::. : ..: .:. . CCDS54 MSSSYYVNALFSKYTAGAS-LFQNAEPTSCSFAPNSQRSGYGAG-AGAFASTVPGLYNVN 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 ENVVFSSSRGPYDYGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKA . : . : :.... . .. :: : .. . . ....: . .: CCDS54 SPLYQSPFASGYGLGADAYGNLP--CASYDQNIPGLCSDLAKGACDKTDEG-ALHGAAEA 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 SIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANAL ...:::::. . : ::.:::: :.::::::::::::::::::::::::::.:: CCDS54 NFRIYPWMR--------SSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHAL 120 130 140 150 160 180 190 200 210 220 230 pF1KB8 CLTERQIKIWFQNRRMKWKKESNLTSTLS-----GGGGGATADSLGGKEEKREETEEEKQ ::::::::::::::::::::: . . . :. .:.: . . : ..... :::.. CCDS54 CLTERQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEEEED 170 180 190 200 210 220 pF1KB8 KE .: CCDS54 EEE 230 >>CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 (217 aa) initn: 415 init1: 383 opt: 433 Z-score: 477.1 bits: 95.5 E(32554): 3e-20 Smith-Waterman score: 450; 42.0% identity (63.2% similar) in 231 aa overlap (2-227:8-217) 10 20 30 40 50 pF1KB8 MNSYFTN-PSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTP :. :.. :. : .: : ..:. . . : .: : :::. . . : CCDS11 MSSLYYANTLFSKYPASSSVFATG--AFPEQTSCAFASNPQR--PGYGAGSGASFAASMQ 10 20 30 40 50 60 70 80 90 100 pF1KB8 -FYSPQENVVFSSSRGPYDYGSNSFYQEKDMLSNC---RQNTLGHNTQTSIAQDFSSEQG .: ... .:. : : : . . .: .: .:: : : ..:: CCDS11 GLYPGGGGMAGQSAAGVYAAGYGLEPSSFNM--HCAPFEQNLSGVCPGDSAKAAGAKEQ- 60 70 80 90 100 110 110 120 130 140 150 160 pF1KB8 RTAPQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRR : . .....:::::. . :.::.:::: :.::::::::::::.::::::: CCDS11 RDSDLAAESNFRIYPWMR--------SSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRR 120 130 140 150 160 170 180 190 200 210 220 pF1KB8 RRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETE ::::::..:::::::::::::::::::::: : :. : :.:... . ::..:: CCDS11 RRIEIAHTLCLTERQIKIWFQNRRMKWKKE-NKTA-----GPGTTGQDRAEAEEEEEE 170 180 190 200 210 230 pF1KB8 EEKQKE >>CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 (269 aa) initn: 399 init1: 338 opt: 431 Z-score: 473.6 bits: 95.2 E(32554): 4.6e-20 Smith-Waterman score: 431; 59.6% identity (78.9% similar) in 114 aa overlap (101-213:157-266) 80 90 100 110 120 130 pF1KB8 DYGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKASIQIYPWMQRMN :: . .::. : . ::.:::.... CCDS11 SANFTEIDEASASSEPEEAASQLSSPSLARAQPEPMATSTAAPEGQ--TPQIFPWMRKLH 130 140 150 160 170 180 140 150 160 170 180 pF1KB8 -SHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWF ::. .: : .:.: :.::::::::::::::::::::::::::.::::.:::::::: CCDS11 ISHDMTG--PDGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWF 190 200 210 220 230 240 190 200 210 220 230 pF1KB8 QNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE :::::::::...: : . .:.: CCDS11 QNRRMKWKKDNKLKSMSLATAGSAFQP 250 260 >>CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 (242 aa) initn: 487 init1: 359 opt: 417 Z-score: 459.2 bits: 92.3 E(32554): 3e-19 Smith-Waterman score: 431; 37.2% identity (57.9% similar) in 261 aa overlap (1-235:1-242) 10 20 30 40 50 60 pF1KB8 MNSYFTNPSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTPFYSPQEN :.:::.:: .: . :: ... : :: : .. : . : .:. .: . CCDS88 MSSYFVNPLFSKYKAG-ESLEP-------AYYDCRFPQSVGRSHAL--VYGPGGSAPGFQ 10 20 30 40 50 70 80 90 100 pF1KB8 VVFSSSRGPYDYG----SNSFYQEKDMLSNC--------------RQNTLGHNTQTSIAQ . . . .: ::: ::.. .: ::. : . ..:..: CCDS88 HASHHVQDFFHHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVVQ 60 70 80 90 100 110 110 120 130 140 150 pF1KB8 --DFSSEQGRTAPQDQKASIQ------IYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTL : .: . .. . : : ..:::. :. :: ::: ::::::: CCDS88 YPDCKSSANTNSSEGQGHLNQNSSPSLMFPWMR---PHA-----PGRRSGRQTYSRYQTL 120 130 140 150 160 160 170 180 190 200 210 pF1KB8 ELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKESNLTSTLSGGGGGAT :::::: :: ::::.::::...:: :::::.::::::::::::::.: . : :. CCDS88 ELEKEFLFNPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKENN-KDKLPGARDEEK 170 180 190 200 210 220 220 230 pF1KB8 ADSLGGKEEKREETEEEKQKE .. :..::..:: :.:..:. CCDS88 VEEEGNEEEEKEEEEKEENKD 230 240 >>CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 (222 aa) initn: 424 init1: 338 opt: 413 Z-score: 455.4 bits: 91.5 E(32554): 4.8e-19 Smith-Waterman score: 413; 61.9% identity (79.0% similar) in 105 aa overlap (102-204:119-218) 80 90 100 110 120 130 pF1KB8 YGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQDQK-ASIQIYPWMQRMN .. ... :. : .: : :::::: ... CCDS88 DEAAPLNPGMYSQKAARPALEERAKSSGEIKEEQAQTGQPAGLSQPPAPPQIYPWMTKLH 90 100 110 120 130 140 140 150 160 170 180 pF1KB8 -SHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWF :: .: .:.: :.::::::::::::::::::::::::::: :::.:::::::: CCDS88 MSHE-----TDGKRSRTSYTRYQTLELEKEFHFNRYLTRRRRIEIANNLCLNERQIKIWF 150 160 170 180 190 200 190 200 210 220 230 pF1KB8 QNRRMKWKKESNLTSTLSGGGGGATADSLGGKEEKREETEEEKQKE :::::::::.:.. : CCDS88 QNRRMKWKKDSKMKSKEAL 210 220 235 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 20:04:27 2016 done: Sat Nov 5 20:04:27 2016 Total Scan time: 1.650 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]