FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8962, 342 aa 1>>>pF1KB8962 342 - 342 aa - 342 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.4652+/-0.000887; mu= 3.1077+/- 0.053 mean_var=196.0181+/-40.249, 0's: 0 Z-trim(112.8): 140 B-trim: 8 in 1/51 Lambda= 0.091606 statistics sampled from 13328 (13475) to 13328 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.763), E-opt: 0.2 (0.414), width: 16 Scan time: 2.550 The best scores are: opt bits E(32554) CCDS8868.1 HOXC10 gene_id:3226|Hs108|chr12 ( 342) 2323 319.0 3.5e-87 CCDS2266.1 HOXD10 gene_id:3236|Hs108|chr2 ( 340) 958 138.6 7.1e-33 CCDS5410.2 HOXA10 gene_id:3206|Hs108|chr7 ( 410) 605 92.0 9.1e-19 CCDS8869.1 HOXC9 gene_id:3225|Hs108|chr12 ( 260) 449 71.3 1e-12 CCDS5409.1 HOXA9 gene_id:3205|Hs108|chr7 ( 272) 438 69.8 2.9e-12 CCDS11534.1 HOXB9 gene_id:3219|Hs108|chr17 ( 250) 427 68.3 7.5e-12 CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2 ( 352) 404 65.4 8e-11 >>CCDS8868.1 HOXC10 gene_id:3226|Hs108|chr12 (342 aa) initn: 2323 init1: 2323 opt: 2323 Z-score: 1678.6 bits: 319.0 E(32554): 3.5e-87 Smith-Waterman score: 2323; 100.0% identity (100.0% similar) in 342 aa overlap (1-342:1-342) 10 20 30 40 50 60 pF1KB8 MTCPRNVTPNSYAEPLAAPGGGERYSRSAGMYMQSGSDFNCGVMRGCGLAPSLSKRDEGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 MTCPRNVTPNSYAEPLAAPGGGERYSRSAGMYMQSGSDFNCGVMRGCGLAPSLSKRDEGS 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 SPSLALNTYPSYLSQLDSWGDPKAAYRLEQPVGRPLSSCSYPPSVKEENVCCMYSAEKRA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 SPSLALNTYPSYLSQLDSWGDPKAAYRLEQPVGRPLSSCSYPPSVKEENVCCMYSAEKRA 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 KSGPEAALYSHPLPESCLGEHEVPVPSYYRASPSYSALDKTPHCSGANDFEAPFEQRASL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 KSGPEAALYSHPLPESCLGEHEVPVPSYYRASPSYSALDKTPHCSGANDFEAPFEQRASL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 NPRAEHLESPQLGGKVSFPETPKSDSQTPSPNEIKTEQSLAGPKGSPSESEKERAKAADS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 NPRAEHLESPQLGGKVSFPETPKSDSQTPSPNEIKTEQSLAGPKGSPSESEKERAKAADS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 SPDTSDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS88 SPDTSDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLE 250 260 270 280 290 300 310 320 330 340 pF1KB8 ISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT :::::::::::::::::::::::::::::::::::::::::: CCDS88 ISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT 310 320 330 340 >>CCDS2266.1 HOXD10 gene_id:3236|Hs108|chr2 (340 aa) initn: 947 init1: 589 opt: 958 Z-score: 703.7 bits: 138.6 E(32554): 7.1e-33 Smith-Waterman score: 958; 50.5% identity (75.1% similar) in 321 aa overlap (25-342:28-340) 10 20 30 40 50 pF1KB8 MTCPRNVTPNSYAEPLAAPGGGERYSRSAGMYMQSGS-DFNCGVMRGCGLAPSLSKR :: ::.::: : :.. :. ::: :::.:: CCDS22 MSFPNSSPAANTFLVDSLISACRSDSFYSSSASMYMPPPSADMGTYGMQTCGLLPSLAKR 10 20 30 40 50 60 60 70 80 90 100 110 pF1KB8 DEGSSPSLALNTYPSYLSQLDSWGDPKAAYRLEQPVGRPLSSCSYPPSVKEENVCCMYSA : . ....:..: :. :.::: ::. . :.:::: . . .::. ..:::. ::::: CCDS22 -EVNHQNMGMNVHP-YIPQVDSWTDPNRSCRIEQPVTQQVPTCSFTTNIKEESNCCMYSD 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 EKRAKSGPEAALYSHPLPESCLGEH-EVPVPSYYRASPSYSALDKTPHCSGANDFEAPFE .. . :. :.. .:::: :. :::::.:.: : .: : :: . . :. :. CCDS22 KRNKLISAEVPSYQRLVPESCPVENPEVPVPGYFRLSQTY-ATGKTQEYN--NSPEGSST 120 130 140 150 160 170 180 190 200 210 220 230 pF1KB8 QRASLNPRAEHLESPQLGG-KVSFPETPKSDSQTPSPNEIKTEQSLAGPKGSPSESEKER .::::. .:::.. .... . . . :.... .: . : : : .. CCDS22 VMLQLNPRGA--AKPQLSAAQLQMEKKMNEPVSGQEPTKVSQVESPEAKGGLPEE-RSCL 180 190 200 210 220 230 240 250 260 270 280 290 pF1KB8 AKAADSSPDTSDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLT :... :::.....:.:::::... :.:::::::::::::::::::::::::::::::::: CCDS22 AEVSVSSPEVQEKESKEEIKSDTPTSNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLT 240 250 260 270 280 290 300 310 320 330 340 pF1KB8 RERRLEISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT :::::::::..:::::::::::::::::::::.:::::::::.:..:. CCDS22 RERRLEISKSVNLTDRQVKIWFQNRRMKLKKMSRENRIRELTANLTFS 300 310 320 330 340 >>CCDS5410.2 HOXA10 gene_id:3206|Hs108|chr7 (410 aa) initn: 673 init1: 522 opt: 605 Z-score: 450.4 bits: 92.0 E(32554): 9.1e-19 Smith-Waterman score: 752; 42.1% identity (62.8% similar) in 368 aa overlap (17-342:49-410) 10 20 30 40 pF1KB8 MTCPRNVTPNSYAEPLAAPGGGERYSRSAGMYMQSGSDFNCGVMRG :. ::: : .:.:. ..:. : ... CCDS54 SCSESPAANSFLVDSLISSGRGEAGGGGGGAGGGGGGGYYAHGGVYLPPAADLPYG-LQS 20 30 40 50 60 70 50 60 70 80 90 pF1KB8 CGLAPSLS-KRDEGSSPS-------LALNTYPSYLSQLDSWGDPKAAYRLEQPVGRP--- ::: :.:. ::.:..::. :. ... : .: : : . :.: : : : CCDS54 CGLFPTLGGKRNEAASPGSGGGGGGLGPGAHGYGPSPIDLWLDAPRSCRMEPPDGPPPPP 80 90 100 110 120 130 100 110 120 130 pF1KB8 ----------------LSSCSYPPSVKEENVCCMY-SAEKRAKSGPEAALYSH----PLP .:::. ..:::. :.: ::.: : . :: . : : CCDS54 QQQPPPPPQPPQPAPQATSCSFAQNIKEESSYCLYDSADKCPKVSATAAELAPFPRGPPP 140 150 160 170 180 190 140 150 160 170 180 190 pF1KB8 ESC-LGEHE-VPVPSYYRASPSY-SALDKTPHCSGANDFEA-PFEQRASLNPRAEHLESP ..: :: ::::.:.: : .: .: .::... : :: :. :. : CCDS54 DGCALGTSSGVPVPGYFRLSQAYGTAKGYGSGGGGAQQLGAGPFP--AQPPGRGFDLPPA 200 210 220 230 240 250 200 210 220 230 240 pF1KB8 QLGGKVSFPETPKSDSQTPSPNEIKTEQSLAGPKG------SPSESEKERAKAADSSPDT .:... . .. .. : :. . : .: .: : : .:. ..:: . CCDS54 LASGSADAARKERALDSPPPPT--LACGSGGGSQGDEEAHASSSAAEELSPAPSESSKAS 260 270 280 290 300 310 250 260 270 280 290 300 pF1KB8 SDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKT .... . :.::.. ::::::::::::::::::::::::::::::::::::::::::.. CCDS54 PEKDSLGNSKGENAA-NWLTAKSGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISRS 320 330 340 350 360 370 310 320 330 340 pF1KB8 INLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT ..::::::::::::::::::::::::::::::.::::. CCDS54 VHLTDRQVKIWFQNRRMKLKKMNRENRIRELTANFNFS 380 390 400 410 >>CCDS8869.1 HOXC9 gene_id:3225|Hs108|chr12 (260 aa) initn: 439 init1: 418 opt: 449 Z-score: 341.7 bits: 71.3 E(32554): 1e-12 Smith-Waterman score: 470; 35.9% identity (60.2% similar) in 259 aa overlap (95-334:6-258) 70 80 90 100 110 120 pF1KB8 ALNTYPSYLSQLDSWGDPKAAYRLEQPVGRPLSSCSYPPSVKEENVCCMYSAEKRAKSGP :.:. ....: . : . . : CCDS88 MSATGPISNYYVDSLISHDNEDLLASRFPATGAHP 10 20 30 130 140 150 160 170 180 pF1KB8 EAALYSHPLPESCLGEHEVPVPSYYRASPSYSALDKTPHCSGANDFEAPFEQRASLNPRA :: : .:. : . : :. .:. . . .: : .. :. . :. . CCDS88 AAARPSGLVPD-C---SDFPSCSFA-PKPAVFSTSWAPVPSQSSVVYHPYGPQPHLGADT 40 50 60 70 80 90 190 200 210 220 230 pF1KB8 EHLES--PQLGGKVSFPETPKSDSQTP-SPNEIKTEQSLAGPK----------GSPSESE ..... :.: :::: : . . .:. ... :: :::.: CCDS88 RYMRTWLEPLSGAVSFPSFPAGGRHYALKPDAYPGRRADCGPGEGRSYPDYMYGSPGEL- 100 110 120 130 140 240 250 260 270 280 pF1KB8 KERAKAADSSPD------TSDNEAKEEIKAENTTGNWLTAKSGRKKRCPYTKHQTLELEK ..:: . ::. .. .: : .. : ..::. :.: :::::::::.::::::: CCDS88 RDRAPQTLPSPEADALAGSKHKEEKADLDPSNPVANWIHARSTRKKRCPYTKYQTLELEK 150 160 170 180 190 200 290 300 310 320 330 340 pF1KB8 EFLFNMYLTRERRLEISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELTSNFNFT ::::::::::.:: :.....:::.:::::::::::::.::::.:. .: CCDS88 EFLFNMYLTRDRRYEVARVLNLTERQVKIWFQNRRMKMKKMNKEKTDKEQS 210 220 230 240 250 260 >>CCDS5409.1 HOXA9 gene_id:3205|Hs108|chr7 (272 aa) initn: 446 init1: 392 opt: 438 Z-score: 333.6 bits: 69.8 E(32554): 2.9e-12 Smith-Waterman score: 438; 40.9% identity (63.1% similar) in 203 aa overlap (134-329:69-267) 110 120 130 140 150 160 pF1KB8 SVKEENVCCMYSAEKRAKSGPEAALYSHPLPESCLGEHEVPVPSYYRASPSYSALDKTPH : : . ::. :.. . ..: CCDS54 PRQAATLAEHPDFSPCSFQSKATVFGASWNPVHAAGANAVPAAVYHHHHHHPYVHPQAPV 40 50 60 70 80 90 170 180 190 200 210 220 pF1KB8 CSGANDFEAPFEQRASLNPRAEHLESPQLGGKVSF---PETPKSDSQTPSPNEIKTEQSL ..: : . .:. :.: : : .. . :: : : . :. :: CCDS54 AAAAPDGRY---MRSWLEPTPGALSFAGLPSSRPYGIKPE-PLSARRGDCPTLDTHTLSL 100 110 120 130 140 150 230 240 250 260 270 pF1KB8 AGPK-GSPSESEKERAKAADSSPDTSDNEA---KEEIKAENTTGNWLTAKSGRKKRCPYT . ::: ..... . . : ....::. : : .: ..::: :.: :::::::: CCDS54 TDYACGSPPVDREKQPSEGAFSENNAENESGGDKPPIDPNNPAANWLHARSTRKKRCPYT 160 170 180 190 200 210 280 290 300 310 320 330 pF1KB8 KHQTLELEKEFLFNMYLTRERRLEISKTINLTDRQVKIWFQNRRMKLKKMNRENRIRELT :::::::::::::::::::.:: :... .:::.:::::::::::::.::.:.. CCDS54 KHQTLELEKEFLFNMYLTRDRRYEVARLLNLTERQVKIWFQNRRMKMKKINKDRAKDE 220 230 240 250 260 270 340 pF1KB8 SNFNFT >>CCDS11534.1 HOXB9 gene_id:3219|Hs108|chr17 (250 aa) initn: 429 init1: 404 opt: 427 Z-score: 326.3 bits: 68.3 E(32554): 7.5e-12 Smith-Waterman score: 451; 42.2% identity (64.9% similar) in 211 aa overlap (140-329:43-246) 110 120 130 140 150 160 pF1KB8 VCCMYSAEKRAKSGPEAALYSHPLPESCLGEH-EVPVPSYYRASPSYSA--LDKTPHCSG :: : : :. .: ..: .:: :: CCDS11 DSIISHESEDAPPAKFPSGQYASSRQPGHAEHLEFPSCSFQPKAPVFGASWAPLSPHASG 20 30 40 50 60 70 170 180 190 200 210 220 pF1KB8 A-NDFEAPFEQRASLNPRAEHLESPQLGGKVSFPETPKSDSQTPSPNE--IKTEQSLAGP . . :. : .. : :: : .. . .:.... .:. .. .:.: :..: CCDS11 SLPSVYHPYIQPQGVPP----AESRYL--RTWLEPAPRGEA-APGQGQAAVKAEPLLGAP 80 90 100 110 120 230 240 250 260 pF1KB8 -----KGSPSES-EKERAKAA---DSSPDTSDN------EAKEEIKAENTTGNWLTAKSG .:.: : : .. : .. : .:: : ::. : ..::: :.:. CCDS11 GELLKQGTPEYSLETSAGREAVLSNQRPGYGDNKICEGSEDKERPDQTNPSANWLHARSS 130 140 150 160 170 180 270 280 290 300 310 320 pF1KB8 RKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKTINLTDRQVKIWFQNRRMKLKKMNR :::::::::.:::::::::::::::::.:: :... .::..:::::::::::::.::::. CCDS11 RKKRCPYTKYQTLELEKEFLFNMYLTRDRRHEVARLLNLSERQVKIWFQNRRMKMKKMNK 190 200 210 220 230 240 330 340 pF1KB8 ENRIRELTSNFNFT : CCDS11 EQGKE 250 >>CCDS2267.2 HOXD9 gene_id:3235|Hs108|chr2 (352 aa) initn: 437 init1: 384 opt: 404 Z-score: 307.8 bits: 65.4 E(32554): 8e-11 Smith-Waterman score: 422; 38.5% identity (61.5% similar) in 213 aa overlap (122-329:150-346) 100 110 120 130 140 pF1KB8 VGRPLSSCSYPPSVKEENVCCMYSAEKRAKSGPEAALYSHPLPESCLGEHEVPVP----- ::: . . ::. . .:.: CCDS22 MEPLPGFPGGAGGGGGGGGGGPGRGPSPGPSGPANGRHYGIKPET----RAAPAPATAAS 120 130 140 150 160 170 150 160 170 180 190 200 pF1KB8 SYYRASPSYSALDKTPHCSGANDFEAPFEQRASLNPRAEHLESPQLGGKVSFPETPKSDS . .: : :. .: .:: : . .. . : : .. . :: : . CCDS22 TTSSSSTSLSSSSKRTECSVARESQGSSGPEFSCNSFLQEKAAAATGG-----TGPGAGI 180 190 200 210 220 230 210 220 230 240 250 260 pF1KB8 QTPSPNEIKTEQSLAGPKGSPSESEKERAKAADSSPDTSDNEAKEEIKAENTTGNWLTAK . . . ..: : . . :. : ::. : :.:. .... .: ..::. :. CCDS22 GAATGTGGSSEPSACSDHPIPGCSLKEEEKQ-HSQPQ------QQQLDPNNPAANWIHAR 240 250 260 270 280 270 280 290 300 310 320 pF1KB8 SGRKKRCPYTKHQTLELEKEFLFNMYLTRERRLEISKTINLTDRQVKIWFQNRRMKLKKM : :::::::::.:::::::::::::::::.:: :... .:::.:::::::::::::.::: CCDS22 STRKKRCPYTKYQTLELEKEFLFNMYLTRDRRYEVARILNLTERQVKIWFQNRRMKMKKM 290 300 310 320 330 340 330 340 pF1KB8 NRENRIRELTSNFNFT ..: CCDS22 SKEKCPKGD 350 342 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 16:42:37 2016 done: Fri Nov 4 16:42:38 2016 Total Scan time: 2.550 Total Display time: 0.010 Function used was FASTA [36.3.4 Apr, 2011]