FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9691, 230 aa 1>>>pF1KB9691 230 - 230 aa - 230 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3046+/-0.000779; mu= 11.4331+/- 0.048 mean_var=128.5246+/-26.797, 0's: 0 Z-trim(112.7): 162 B-trim: 772 in 2/50 Lambda= 0.113131 statistics sampled from 13221 (13409) to 13221 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.412), width: 16 Scan time: 2.410 The best scores are: opt bits E(32554) CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 ( 230) 1557 264.5 4.2e-71 CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 ( 217) 806 141.9 3.2e-34 CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 ( 153) 443 82.5 1.7e-16 CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 ( 235) 443 82.7 2.3e-16 CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 ( 224) 436 81.5 4.9e-16 CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 ( 242) 435 81.4 5.8e-16 CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 ( 270) 434 81.2 7.1e-16 CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 ( 269) 431 80.8 9.9e-16 CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 ( 243) 428 80.2 1.3e-15 CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 ( 289) 429 80.5 1.3e-15 CCDS5407.1 HOXA6 gene_id:3203|Hs108|chr7 ( 233) 425 79.7 1.8e-15 CCDS56149.1 HOXD8 gene_id:3234|Hs108|chr2 ( 106) 410 76.9 5.5e-15 CCDS2268.1 HOXD8 gene_id:3234|Hs108|chr2 ( 290) 411 77.5 1e-14 CCDS8873.1 HOXC4 gene_id:3221|Hs108|chr12 ( 264) 400 75.7 3.3e-14 CCDS8872.1 HOXC5 gene_id:3222|Hs108|chr12 ( 222) 391 74.1 8e-14 CCDS11529.1 HOXB4 gene_id:3214|Hs108|chr17 ( 251) 377 71.9 4.2e-13 CCDS5405.1 HOXA4 gene_id:3201|Hs108|chr7 ( 320) 378 72.2 4.5e-13 CCDS2269.1 HOXD4 gene_id:3233|Hs108|chr2 ( 255) 376 71.8 4.8e-13 CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2 ( 352) 337 65.5 5e-11 >>CCDS5408.1 HOXA7 gene_id:3204|Hs108|chr7 (230 aa) initn: 1557 init1: 1557 opt: 1557 Z-score: 1390.0 bits: 264.5 E(32554): 4.2e-71 Smith-Waterman score: 1557; 99.6% identity (100.0% similar) in 230 aa overlap (1-230:1-230) 10 20 30 40 50 60 pF1KB9 MSSSYYVNALFSKYTAGTSLFQNAEPTSCSFAPNSQRSGYGAGAGAFASTVPGLYNVNSP :::::::::::::::::.:::::::::::::::::::::::::::::::::::::::::: CCDS54 MSSSYYVNALFSKYTAGASLFQNAEPTSCSFAPNSQRSGYGAGAGAFASTVPGLYNVNSP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 LYQSPFASGYGLGADAYGNLPCASYDQNIPGLCSDLAKGACDKTDEGALHGAAEANFRIY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 LYQSPFASGYGLGADAYGNLPCASYDQNIPGLCSDLAKGACDKTDEGALHGAAEANFRIY 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 PWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQN 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 RRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 RRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE 190 200 210 220 230 >>CCDS11532.1 HOXB7 gene_id:3217|Hs108|chr17 (217 aa) initn: 787 init1: 586 opt: 806 Z-score: 727.9 bits: 141.9 E(32554): 3.2e-34 Smith-Waterman score: 813; 58.4% identity (76.4% similar) in 233 aa overlap (1-224:1-217) 10 20 30 40 50 pF1KB9 MSSSYYVNALFSKYTAGTSLFQNA---EPTSCSFAPNSQRSGYGAGAGA-FASTVPGLYN ::: ::.:.::::: :..:.: .. : :::.:: : :: :::::.:: ::... ::: CCDS11 MSSLYYANTLFSKYPASSSVFATGAFPEQTSCAFASNPQRPGYGAGSGASFAASMQGLYP 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB9 VNSPLY-QSP---FASGYGLGADAYGNLPCASYDQNIPGLC-SDLAKGACDKTDEGALHG .. . :: .:.:::: ... :. :: ..::. :.: .: ::.: : .. . CCDS11 GGGGMAGQSAAGVYAAGYGLEPSSF-NMHCAPFEQNLSGVCPGDSAKAAGAKEQRDS-DL 70 80 90 100 110 120 130 140 150 160 170 pF1KB9 AAEANFRIYPWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTE :::.:::::::::::: ::::::::::::::::::::::.::::::::::::::.::::: CCDS11 AAESNFRIYPWMRSSGTDRKRGRQTYTRYQTLELEKEFHYNRYLTRRRRIEIAHTLCLTE 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB9 RQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE :::::::::::::::::.: :: .:.. :.:. :...:: CCDS11 RQIKIWFQNRRMKWKKENKTAGP--------------GTTGQDRAEAEEEEEE 180 190 200 210 >>CCDS41792.1 HOXC6 gene_id:3223|Hs108|chr12 (153 aa) initn: 480 init1: 398 opt: 443 Z-score: 409.6 bits: 82.5 E(32554): 1.7e-16 Smith-Waterman score: 443; 57.3% identity (78.2% similar) in 124 aa overlap (114-229:35-153) 90 100 110 120 130 pF1KB9 SYDQNIPGLCSDLAKGACDKTDEGALHGAAEANFRIYPWMR--------SSGPDRKRGRQ .:...:::::. . : ::.:::: CCDS41 CRQNTLGHNTQTSIAQDFSSEQGRTAPQDQKASIQIYPWMQRMNSHSGVGYGADRRRGRQ 10 20 30 40 50 60 140 150 160 170 180 190 pF1KB9 TYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKEHKDEGPT :.::::::::::::::::::::::::::.::::::::::::::::::::::: . CCDS41 IYSRYQTLELEKEFHFNRYLTRRRRIEIANALCLTERQIKIWFQNRRMKWKKESN----- 70 80 90 100 110 200 210 220 230 pF1KB9 AAAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE ... :. .:.: . . : ..... :::...: CCDS41 LTSTLSGGGGGATADSLGGKEEKREETEEEKQKE 120 130 140 150 >>CCDS8871.1 HOXC6 gene_id:3223|Hs108|chr12 (235 aa) initn: 504 init1: 398 opt: 443 Z-score: 407.2 bits: 82.7 E(32554): 2.3e-16 Smith-Waterman score: 477; 39.9% identity (65.0% similar) in 243 aa overlap (3-229:2-235) 10 20 30 40 50 pF1KB9 MSSSYYVNALFSKYTAG-TSLFQNAEPTSCSFAPNSQRSGYGAG-AGAFASTVPGLYNVN .::..: .: . :: ... :. .: .. : . : :::. : ..: .: CCDS88 MNSYFTNPSLSCHLAGGQDVLPNVALNSTAYDPVRHFSTYGAAVAQNRIYSTP-FY--- 10 20 30 40 50 60 70 80 90 100 110 pF1KB9 SPLYQSPFASGYG---LGADAYGNLP--CASYDQNIPGLCSDLAKGACDKTDEGALHGAA :: . :.:. : :.... . .. :: : .. . . ....: CCDS88 SPQENVVFSSSRGPYDYGSNSFYQEKDMLSNCRQNTLGHNTQTSIAQDFSSEQGRTAPQD 60 70 80 90 100 110 120 130 140 150 160 pF1KB9 E-ANFRIYPWMR--------SSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIA . :...:::::. . : ::.:::: :.:::::::::::::::::::::::::: CCDS88 QKASIQIYPWMQRMNSHSGVGYGADRRRGRQIYSRYQTLELEKEFHFNRYLTRRRRIEIA 120 130 140 150 160 170 170 180 190 200 210 220 pF1KB9 HALCLTERQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEE .::::::::::::::::::::::: . ... :. .:.: . . : ..... :: CCDS88 NALCLTERQIKIWFQNRRMKWKKE-----SNLTSTLSGGGGGATADSLGGKEEKREETEE 180 190 200 210 220 230 230 pF1KB9 EEDEEE :...: CCDS88 EKQKE >>CCDS11531.1 HOXB6 gene_id:3216|Hs108|chr17 (224 aa) initn: 491 init1: 410 opt: 436 Z-score: 401.3 bits: 81.5 E(32554): 4.9e-16 Smith-Waterman score: 481; 44.9% identity (57.3% similar) in 227 aa overlap (3-201:2-217) 10 20 30 40 50 pF1KB9 MSSSYYVNALFSKYTA-GTSLFQNAEPTSCSFAPNSQR---SGYGAGAG---AFASTVPG :::.::. : : : : . : : . : . :: : : .::.. CCDS11 MSSYFVNSTFPVTLASGQESFLGQLPLYSSGYADPLRHYPAPYGPGPGQDKGFATSS-- 10 20 30 40 50 60 70 80 90 100 pF1KB9 LYNVNSPLYQSPFASGYGLGADA-YGNLP---------CA-SYDQNIPGLCSDLAKGAC- : : ..::: .: :: : :: : .. : . . :. : CCDS11 --------YYPPAGGGYGRAAPCDYGPAPAFYREKESACALSGADEQPPFHPEPRKSDCA 60 70 80 90 100 110 120 130 140 150 pF1KB9 -DKTDEGALHGAAEANFRIYPWMR--------SSGPDRKRGRQTYTRYQTLELEKEFHFN ::. : . . . .::::. : ::. .:::::::::::::::::::.: CCDS11 QDKSVFGETE-EQKCSTPVYPWMQRMNSCNSSSFGPSGRRGRQTYTRYQTLELEKEFHYN 110 120 130 140 150 160 160 170 180 190 200 210 pF1KB9 RYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAA :::::::::::::::::::::::::::::::::::: : . . .: : CCDS11 RYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKESKLLSASQLSAEEEEEKQAE 170 180 190 200 210 220 220 230 pF1KB9 ADKADEEDDDEEEEDEEE >>CCDS8870.1 HOXC8 gene_id:3224|Hs108|chr12 (242 aa) initn: 537 init1: 389 opt: 435 Z-score: 400.0 bits: 81.4 E(32554): 5.8e-16 Smith-Waterman score: 508; 42.5% identity (61.0% similar) in 259 aa overlap (3-230:2-236) 10 20 30 40 50 pF1KB9 MSSSYYVNALFSKYTAGTSLFQNAEPT--SCSFAPNSQRSG---YGAGAGAFASTVPGLY :::.:: ::::: :: :: ::. .: : . :: :: :..: ::. CCDS88 MSSYFVNPLFSKYKAGESL----EPAYYDCRFPQSVGRSHALVYGPGGSA-----PGFQ 10 20 30 40 50 60 70 80 90 pF1KB9 NVNSPLYQSPFASGY-GLGADAYGNLPCA-----------SYD----QNIPGLCS----- .. : :. : : :.. ..: . ::. .:. :.. : . CCDS88 HA-SHHVQDFFHHGTSGISNSGYQQNPCSLSCHGDASKFYGYEALPRQSLYGAQQEASVV 60 70 80 90 100 100 110 120 130 140 150 pF1KB9 ---DLAKGACDKTDEGALHGAAEANFRI-YPWMRSSGPDRKRGRQTYTRYQTLELEKEFH : ..: ...:: : ... . .:::: .: :. :::::.::::::::::: CCDS88 QYPDCKSSANTNSSEGQGHLNQNSSPSLMFPWMRPHAPGRRSGRQTYSRYQTLELEKEFL 110 120 130 140 150 160 160 170 180 190 200 pF1KB9 FNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKEH-KDEGPTAAAAPEGAVPSAAA :: ::::.::::..::: :::::.::::::::::::::. ::. : .: : CCDS88 FNPYLTRKRRIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKLP--GARDE-------- 170 180 190 200 210 210 220 230 pF1KB9 TAAADKADEEDDDEEEEDEEE .:..:: ..:::..::: CCDS88 ----EKVEEEGNEEEEKEEEEKEENKD 220 230 240 >>CCDS5406.1 HOXA5 gene_id:3202|Hs108|chr7 (270 aa) initn: 483 init1: 403 opt: 434 Z-score: 398.5 bits: 81.2 E(32554): 7.1e-16 Smith-Waterman score: 447; 42.5% identity (62.0% similar) in 221 aa overlap (5-199:47-264) 10 20 pF1KB9 MSSSYYVNAL-FSKYTAGTSLFQNAE-----PTS : :.. .: .:.. : ..: .: CCDS54 PDYQLHNYGDHSSVSEQFRDSASMHSGRYGYGYNGMDLSVGRSGSGHFGSGERARSYAAS 20 30 40 50 60 70 30 40 50 60 70 80 pF1KB9 CSFAPNSQRSGYGAGAGAFASTVPGLYNVNSPLYQSPFASGYGLGADAYGNLPCASYDQN : :: : :. : . : : : . :: .... : .. .: :: : . CCDS54 ASAAPAEPR--YSQPATSTHSPQPDPLPC-SAVAPSPGSDSHHGGKNSLSNSSGASADAG 80 90 100 110 120 130 90 100 110 120 pF1KB9 IPGLCS----DLAKGACDKTDEGALHGAAE--------ANFRIYPWMRS--------SGP . : :.:: . . .. ...:. :. .::::::. .:: CCDS54 STHISSREGVGTASGAEEDAPASSEQASAQSEPSPAPPAQPQIYPWMRKLHISHDNIGGP 140 150 160 170 180 190 130 140 150 160 170 180 pF1KB9 DRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQNRRMKWKKE . ::.: .:::::::::::::::::::::::::::::::::.:::::::::::::::::. CCDS54 EGKRARTAYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLSERQIKIWFQNRRMKWKKD 200 210 220 230 240 250 190 200 210 220 230 pF1KB9 HKDEGPTAAAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE .: .. . ::: CCDS54 NKLKSMSMAAAGGAFRP 260 270 >>CCDS11530.1 HOXB5 gene_id:3215|Hs108|chr17 (269 aa) initn: 493 init1: 401 opt: 431 Z-score: 395.9 bits: 80.8 E(32554): 9.9e-16 Smith-Waterman score: 433; 43.7% identity (65.5% similar) in 197 aa overlap (21-205:85-269) 10 20 30 40 pF1KB9 MSSSYYVNALFSKYTAGTSLFQNAEPTSCSFA-PNSQRSGYGAGAGAFAS :..: .:::.. :.: : . :: : CCDS11 SVNRSSASSSHFGAVGESSRAFPAPAQEPRFRQAA-SSCSLSSPESLPCTNGDSHGAKPS 60 70 80 90 100 110 50 60 70 80 90 100 pF1KB9 TVPGLYNVNSPLYQSPFASGYGLGADAYGNLPCASYDQNIPGLCSDLAKGACDKTDEGAL . .:: :. ::. . . .. :: ... :.:.. . ... . CCDS11 A-------SSPSDQATSASS----SANFTEIDEASASSEPEEAASQLSSPSLARAQPEPM 120 130 140 150 160 110 120 130 140 150 pF1KB9 HGAAEA----NFRIYPWMRS-------SGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRR .. : . .:.::::. .::: ::.: .:::::::::::::::::::::: CCDS11 ATSTAAPEGQTPQIFPWMRKLHISHDMTGPDGKRARTAYTRYQTLELEKEFHFNRYLTRR 170 180 190 200 210 220 160 170 180 190 200 210 pF1KB9 RRIEIAHALCLTERQIKIWFQNRRMKWKKEHKDEGPTAAAAPEGAVPSAAATAAADKADE :::::::::::.:::::::::::::::::..: .. . :.: . : CCDS11 RRIEIAHALCLSERQIKIWFQNRRMKWKKDNKLKSMSLATAGSAFQP 230 240 250 260 220 230 pF1KB9 EDDDEEEEDEEE >>CCDS11533.1 HOXB8 gene_id:3218|Hs108|chr17 (243 aa) initn: 478 init1: 353 opt: 428 Z-score: 393.8 bits: 80.2 E(32554): 1.3e-15 Smith-Waterman score: 496; 43.1% identity (63.8% similar) in 246 aa overlap (3-224:2-243) 10 20 30 40 50 pF1KB9 MSSSYYVNALFSKYTAGTSLFQNAEPTSCSFAPN-SQRSG--YG-AGAGAFA--STVPGL :::.::.::::: .: :: : .:.:: . . : :: ...:.: : . . CCDS11 MSSYFVNSLFSKYKTGESLRPNYY--DCGFAQDLGGRPTVVYGPSSGGSFQHPSQIQEF 10 20 30 40 50 60 70 80 90 100 pF1KB9 YNVNS-----PLYQSPFASG-YGLGADAYGNLPCASYDQNIPGLCS-DLAKGA-CDKTDE :. : : :.: : . .: .. :: : :.. : . ::.. : : . CCDS11 YHGPSSLSTAPYQQNPCAVACHGDPGNFYGYDPLQR--QSLFGAQDPDLVQYADCKLAAA 60 70 80 90 100 110 110 120 130 140 150 pF1KB9 GALHGAAEAN------FRIYPWMR-SSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRR ..: ::.. ...:::: ... :.::::::.::::::::::: :: ::::.: CCDS11 SGLGEEAEGSEQSPSPTQLFPWMRPQAAAGRRRGRQTYSRYQTLELEKEFLFNPYLTRKR 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB9 RIEIAHALCLTERQIKIWFQNRRMKWKKEH-KDEGPTAAAAPEGAVPSAAATA--AADKA :::..::: :::::.::::::::::::::. ::. :.. : . : :::.. CCDS11 RIEVSHALGLTERQVKIWFQNRRMKWKKENNKDKFPSSKCEQEELEKQKLERAPEAADEG 180 190 200 210 220 230 220 230 pF1KB9 DEEDDDEEEEDEEE : . :.. CCDS11 DAQKGDKK 240 >>CCDS56148.1 HOXD8 gene_id:3234|Hs108|chr2 (289 aa) initn: 469 init1: 396 opt: 429 Z-score: 393.7 bits: 80.5 E(32554): 1.3e-15 Smith-Waterman score: 429; 41.8% identity (63.5% similar) in 189 aa overlap (39-217:97-285) 10 20 30 40 50 60 pF1KB9 ALFSKYTAGTSLFQNAEPTSCSFAPNSQRSGYGAGAGAFASTVPGLYNVNSPLYQSPFA- : :. :.:. .. : . : : . CCDS56 QAHAHPHPSPPPSGTGCGGREGRGQEYFHPGGGSPAAAYQAAPPPPPHPPPPPPPPPCGG 70 80 90 100 110 120 70 80 90 100 110 120 pF1KB9 -SGYGLGADAYG--NL---PCASYDQNIPGLCSDLAKGACDKTDEGALH-GAAEANFRIY . .: : :: :: : . .:. . :.. . : : . . . ... CCDS56 IACHGEPAKFYGYDNLQRQPIFTTQQEAELVQYPDCKSSSGNIGEDPDHLNQSSSPSQMF 130 140 150 160 170 180 130 140 150 160 170 180 pF1KB9 PWMRSSGPDRKRGRQTYTRYQTLELEKEFHFNRYLTRRRRIEIAHALCLTERQIKIWFQN :::: ..: :.::::::.:.::::::::: :: ::::.::::..::: :::::.:::::: CCDS56 PWMRPQAPGRRRGRQTYSRFQTLELEKEFLFNPYLTRKRRIEVSHALALTERQVKIWFQN 190 200 210 220 230 240 190 200 210 220 230 pF1KB9 RRMKWKKEH-KDEGPTA-AAAPEGAVPSAAATAAADKADEEDDDEEEEDEEE ::::::::. ::. :.. . .: . . : :.:. CCDS56 RRMKWKKENNKDKFPVSRQEVKDGETKKEAQELEEDRAEGLTN 250 260 270 280 230 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:21:37 2016 done: Fri Nov 4 18:21:38 2016 Total Scan time: 2.410 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]