FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7951, 255 aa 1>>>pF1KB7951 255 - 255 aa - 255 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2039+/-0.000724; mu= 7.6684+/- 0.044 mean_var=142.1283+/-29.954, 0's: 0 Z-trim(114.5): 183 B-trim: 776 in 2/50 Lambda= 0.107581 statistics sampled from 14817 (15027) to 14817 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.462), width: 16 Scan time: 2.750 The best scores are: opt bits E(32554) CCDS2247.2 DLX1 gene_id:1745|Hs108|chr2 ( 255) 1712 276.3 1.5e-74 CCDS47647.2 DLX6 gene_id:1750|Hs108|chr7 ( 293) 769 129.9 1.9e-30 CCDS33328.1 DLX1 gene_id:1745|Hs108|chr2 ( 129) 733 124.1 4.7e-29 CCDS11555.1 DLX4 gene_id:1748|Hs108|chr17 ( 240) 534 93.4 1.5e-19 CCDS45728.1 DLX4 gene_id:1748|Hs108|chr17 ( 168) 481 85.1 3.4e-17 CCDS5647.1 DLX5 gene_id:1749|Hs108|chr7 ( 289) 476 84.5 9e-17 CCDS2248.1 DLX2 gene_id:1746|Hs108|chr2 ( 328) 471 83.7 1.7e-16 CCDS11556.1 DLX3 gene_id:1747|Hs108|chr17 ( 287) 466 82.9 2.6e-16 >>CCDS2247.2 DLX1 gene_id:1745|Hs108|chr2 (255 aa) initn: 1712 init1: 1712 opt: 1712 Z-score: 1452.1 bits: 276.3 E(32554): 1.5e-74 Smith-Waterman score: 1712; 100.0% identity (100.0% similar) in 255 aa overlap (1-255:1-255) 10 20 30 40 50 60 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGADSEKSTVVEGGEVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGADSEKSTVVEGGEVR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 FNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 FNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 SKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSSGKGSGGNAGSYIPSYTSW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS22 SKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSSGKGSGGNAGSYIPSYTSW 190 200 210 220 230 240 250 pF1KB7 YPSAHQEAMQQPQLM ::::::::::::::: CCDS22 YPSAHQEAMQQPQLM 250 >>CCDS47647.2 DLX6 gene_id:1750|Hs108|chr7 (293 aa) initn: 816 init1: 550 opt: 769 Z-score: 660.3 bits: 129.9 E(32554): 1.9e-30 Smith-Waterman score: 824; 56.4% identity (75.7% similar) in 243 aa overlap (25-255:55-293) 10 20 30 40 50 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPD : .:: ::. :. .:: .::::::. . CCDS47 GQQQQQQQQQQQQQQQQQQQPPPPPPPPPQPHSQQSSPA-MAGAHYPLHCLHSAAAAAAA 30 40 50 60 70 80 60 70 80 90 100 pF1KB7 GAYSSASSFSRPLGYPYV----NSVS--SHASSPYISS------VQSYPGSASLAQSRLE :.. . : ::. :: . : :. ::.: .::: .:.. ::.: . CCDS47 GSHHHHHHQHHHHGSPYASGGGNSYNHRSLAAYPYMSHSQHSPYLQSYHNSSAAAQTRGD 90 100 110 120 130 140 110 120 130 140 150 160 pF1KB7 DPGADSEKSTVVEGGEVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELA : .:..:.::.:.::.:::::::::::::::::::::::::.::::::::::::::::: CCDS47 D--TDQQKTTVIENGEIRFNGKGKKIRKPRTIYSSLQLQALNHRFQQTQYLALPERAELA 150 160 170 180 190 200 170 180 190 200 210 220 pF1KB7 ASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWNPNSSS ::::::::::::::::::::::::.:::. :.. : .. ::: :: .:: :. :.: CCDS47 ASLGLTQTQVKIWFQNKRSKFKKLLKQGSNPHESDPLQGSAALSPRSPALPPVWDV-SAS 210 220 230 240 250 260 230 240 250 pF1KB7 GKGSGGNAGSYIPSYTSWYPSAHQEAMQQPQLM .:: . .::.:.:. :: : ::..::.::.: CCDS47 AKGVSMPPNSYMPGYSHWYSSPHQDTMQRPQMM 270 280 290 >>CCDS33328.1 DLX1 gene_id:1745|Hs108|chr2 (129 aa) initn: 770 init1: 725 opt: 733 Z-score: 635.1 bits: 124.1 E(32554): 4.7e-29 Smith-Waterman score: 733; 86.7% identity (92.2% similar) in 128 aa overlap (1-125:1-128) 10 20 30 40 50 60 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGAD---SEKSTVVEGG ::::::::::::::::::::::::::::::::::::::::::::: : .. : :. CCDS33 SSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPGQDLVPKQAIQVQEAD 70 80 90 100 110 120 120 130 140 150 160 170 pF1KB7 EVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQ :. ..:.: CCDS33 EAGWGGSGG >>CCDS11555.1 DLX4 gene_id:1748|Hs108|chr17 (240 aa) initn: 479 init1: 404 opt: 534 Z-score: 464.4 bits: 93.4 E(32554): 1.5e-19 Smith-Waterman score: 547; 43.9% identity (63.0% similar) in 262 aa overlap (3-255:1-240) 10 20 30 40 50 60 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQPDGAYSSA ::..: : . ..:::: :. ..: : . : . : : : : : CCDS11 MTSLPCPLPGRDASKAVF-----PD--LAPVPSVAAAYPL------GLS-PTTAASPN 10 20 30 40 70 80 90 100 110 pF1KB7 SSFSRPLG----YPYVNSVSSHASSPYISSVQSYPGSASLAQSRLEDPG---ADSEKSTV :.::: : :::.. .. : :.: : : : : : ::::: . CCDS11 LSYSRPYGHLLSYPYTEPANPGDS--YLSCQQPAALSQPLCGPA-EHPQELEADSEKPRL 50 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 V-EGGEVRFNGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQV : .: : .. .::.::::::::::::: ::.:::.::::::::::.:::.:::::::: CCDS11 SPEPSERRPQAPAKKLRKPRTIYSSLQLQHLNQRFQHTQYLALPERAQLAAQLGLTQTQV 110 120 130 140 150 160 180 190 200 210 220 230 pF1KB7 KIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGWN-PNSSSGKGSGGNAG ::::::::::.:::.::.... ::. . ..: :::.: :. :.... :: CCDS11 KIWFQNKRSKYKKLLKQNSGGQEGDFPGRTFSVSPCSPPLPSLWDLPKAGTLPTSG---- 170 180 190 200 210 240 250 pF1KB7 SYIPSYTSWYPSAHQEAMQQPQLM : :. .:: .... .::.: CCDS11 -YGNSFGAWYQHHSSDVLASPQMM 220 230 240 >>CCDS45728.1 DLX4 gene_id:1748|Hs108|chr17 (168 aa) initn: 454 init1: 404 opt: 481 Z-score: 422.1 bits: 85.1 E(32554): 3.4e-17 Smith-Waterman score: 481; 53.6% identity (76.2% similar) in 151 aa overlap (107-255:23-168) 80 90 100 110 120 130 pF1KB7 SHASSPYISSVQSYPGSASLAQSRLEDPGADSEKSTVV-EGGEVRFNGKGKKIRKPRTIY :::: . : .: : .. .::.::::::: CCDS45 MKLSVLPPRSLLAPYTVLCCPPDSEKPRLSPEPSERRPQAPAKKLRKPRTIY 10 20 30 40 50 140 150 160 170 180 190 pF1KB7 SSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALE :::::: ::.:::.::::::::::.:::.::::::::::::::::::.:::.::.... : CCDS45 SSLQLQHLNQRFQHTQYLALPERAQLAAQLGLTQTQVKIWFQNKRSKYKKLLKQNSGGQE 60 70 80 90 100 110 200 210 220 230 240 250 pF1KB7 GSALANGRALSAGSPPVPPGWN-PNSSSGKGSGGNAGSYIPSYTSWYPSAHQEAMQQPQL :. . ..: :::.: :. :.... :: : :. .:: .... .::. CCDS45 GDFPGRTFSVSPCSPPLPSLWDLPKAGTLPTSG-----YGNSFGAWYQHHSSDVLASPQM 120 130 140 150 160 pF1KB7 M : CCDS45 M >>CCDS5647.1 DLX5 gene_id:1749|Hs108|chr7 (289 aa) initn: 430 init1: 389 opt: 476 Z-score: 414.6 bits: 84.5 E(32554): 9e-17 Smith-Waterman score: 501; 44.0% identity (63.1% similar) in 252 aa overlap (5-244:36-263) 10 20 30 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSP :.::: : ... : :. ::. CCDS56 DRRVPSIRSGDFQAPFQTSAAMHHPSQESPTLPES--SATDSDYYSPTGGAPHGYCSPTS 10 20 30 40 50 60 40 50 60 70 80 90 pF1KB7 MSHGHYSMHCLHSAGHSQPDGAYSSASSFSRPLGYPYVNSVSSHASSPYISSVQSYPGSA :.:. :. . : :. .::.: :: ....:. : :: ..: : CCDS56 ASYGK----ALNPYQY-QYHGVNGSAGS------YP----AKAYADYSYASSYHQYGG-- 70 80 90 100 100 110 120 130 140 150 pF1KB7 SLAQSRLEDPGADSEKSTVVEGGEVRF-NGKGKKIRKPRTIYSSLQLQALNRRFQQTQYL : .:. :.: .. : :::. ::: ::.:::::::::.:: ::.::::.:::: CCDS56 --AYNRV--PSATNQPEKEVTEPEVRMVNGKPKKVRKPRTIYSSFQLAALQRRFQKTQYL 110 120 130 140 150 160 160 170 180 190 200 210 pF1KB7 ALPERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVP :::::::::::::::::::::::::::::.::.::.: : : ... .. .:: : CCDS56 ALPERAELAASLGLTQTQVKIWFQNKRSKIKKIMKNGEMPPEHSP-SSSDPMACNSPQSP 170 180 190 200 210 220 220 230 240 250 pF1KB7 PGWNPNSSSGKGSG-----------GNAGSYIPSYTSWYPSAHQEAMQQPQLM :.:..:: . : . :.::. . .::: :: CCDS56 AVWEPQGSSRSLSHHPHAHPPTSNQSPASSYLENSASWYTSAASSINSHLPPPGSLQHPL 230 240 250 260 270 280 CCDS56 ALASGTLY >>CCDS2248.1 DLX2 gene_id:1746|Hs108|chr2 (328 aa) initn: 426 init1: 394 opt: 471 Z-score: 409.7 bits: 83.7 E(32554): 1.7e-16 Smith-Waterman score: 471; 44.4% identity (63.8% similar) in 232 aa overlap (29-248:51-275) 10 20 30 40 50 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPS-PMS---HGHYSMHCLHSAGHSQPD : ::. :.: . : . : :: . CCDS22 STYHQHQQPPSGGGAGPGGNSSSSSSLHKPQESPTLPVSTATDSSYYTNQQHPAGGGGGG 30 40 50 60 70 80 60 70 80 90 100 110 pF1KB7 GA-YSSASSFSRPLGYPYVNSVSSHASSPY-ISSVQSYPGSASLAQSRLEDPGADSEKST :. :. .:.. . .:.: :.: : .. . .: . : . : .:. . .. CCDS22 GSPYAHMGSYQYQASG--LNNVPYSAKSSYDLGYTAAYTSYAPYGTS--SSPANNEPEKE 90 100 110 120 130 120 130 140 150 160 170 pF1KB7 VVEGGEVRF-NGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALPERAELAASLGLTQTQ .: :.:. ::: ::.:::::::::.:: ::.::::.:::::::::::::::::::::: CCDS22 DLEP-EIRIVNGKPKKVRKPRTIYSSFQLAALQRRFQKTQYLALPERAELAASLGLTQTQ 140 150 160 170 180 190 180 190 200 210 220 pF1KB7 VKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPV--PPGWN---PNSSSGKGS :::::::.::::::. :.: : :.. : :::: : .:. :. .: :. CCDS22 VKIWFQNRRSKFKKMWKSGEIPSEQHPGASASPPCA-SPPVSAPASWDFGVPQRMAGGGG 200 210 220 230 240 250 230 240 250 pF1KB7 GGNAGSYIPSYTSWYPSAHQEAMQQPQLM :..:: : : ::. : CCDS22 PGSGGSGAGSSGS-SPSSAASAFLGNYPWYHQTSGSASHLQATAPLLHPTQTPQPHHHHH 260 270 280 290 300 310 >>CCDS11556.1 DLX3 gene_id:1747|Hs108|chr17 (287 aa) initn: 457 init1: 388 opt: 466 Z-score: 406.3 bits: 82.9 E(32554): 2.6e-16 Smith-Waterman score: 477; 41.0% identity (60.5% similar) in 266 aa overlap (24-254:3-262) 10 20 30 40 50 pF1KB7 MTMTTMPESLNSPVSGKAVFMEFGPPNQQMSPSPMSHGHYSMHCLHSAGHSQP------- : ....: : .. :. : :......: CCDS11 MSGSFDRKLS-SILTDISSSLSC-HAGSKDSPTLPESSV 10 20 30 60 70 80 90 pF1KB7 -DGAYSSASSFSRPLGYPY---VNSVSSH--------ASSPYISSVQSYPGSASLAQ--S : .: :: . . : :: :: . : :.. : . : .:: : . CCDS11 TDLGYYSAPQHDYYSGQPYGQTVNPYTYHHQFNLNGLAGTGAYSPKSEYTYGASYRQYGA 40 50 60 70 80 90 100 110 120 130 140 150 pF1KB7 RLEDPGADSEKSTVVEG--GEVRF-NGKGKKIRKPRTIYSSLQLQALNRRFQQTQYLALP :.: .. .: : .:::. ::: ::.::::::::: :: ::.::::..:::::: CCDS11 YREQPLPAQDPVSVKEEPEAEVRMVNGKPKKVRKPRTIYSSYQLAALQRRFQKAQYLALP 100 110 120 130 140 150 160 170 180 190 200 210 pF1KB7 ERAELAASLGLTQTQVKIWFQNKRSKFKKLMKQGGAALEGSALANGRALSAGSPPVPPGW :::::::.::::::::::::::.:::::::.:.: . :: : :. ... .::: : : CCDS11 ERAELAAQLGLTQTQVKIWFQNRRSKFKKLYKNGEVPLEHSP-NNSDSMACNSPPSPALW 160 170 180 190 200 210 220 230 240 250 pF1KB7 NPNSSSGKGSGGNA------GSYIPSY-----TSWYPSAHQEAMQQPQLM . .: : . . . : ::: .::: : . .. :.: CCDS11 DTSSHSTPAPARSQLPPPLPYSASPSYLDDPTNSWY---HAQNLSGPHLQQQPPQPATLH 220 230 240 250 260 270 CCDS11 HASPGPPPNPGAVY 280 255 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 10:15:12 2016 done: Sat Nov 5 10:15:13 2016 Total Scan time: 2.750 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]