FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7749, 443 aa 1>>>pF1KB7749 443 - 443 aa - 443 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 13.9168+/-0.00117; mu= -16.4737+/- 0.071 mean_var=687.4279+/-141.307, 0's: 0 Z-trim(118.1): 102 B-trim: 259 in 2/53 Lambda= 0.048917 statistics sampled from 18885 (18988) to 18885 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.829), E-opt: 0.2 (0.583), width: 16 Scan time: 2.910 The best scores are: opt bits E(32554) CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 ( 443) 3151 236.7 3.5e-62 CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 ( 431) 959 82.0 1.3e-15 CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 ( 432) 883 76.7 5.2e-14 CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 ( 358) 746 66.9 3.7e-11 >>CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 (443 aa) initn: 3151 init1: 3151 opt: 3151 Z-score: 1229.9 bits: 236.7 E(32554): 3.5e-62 Smith-Waterman score: 3151; 100.0% identity (100.0% similar) in 443 aa overlap (1-443:1-443) 10 20 30 40 50 60 pF1KB7 MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MQKATYYDSSAIYGGYPYQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSSAGG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 HPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 HPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPP 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 PPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB7 GDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLTERQIKIWFQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 GDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLTERQIKIWFQ 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB7 NRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPYEPQSPPPFSKP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSMHSLVNSVPYEPQSPPPFSKP 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB7 PQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGLQGNGSYGTPHI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPHAHGLQGNGSYGTPHI 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB7 QGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHHHGPGPGEPHPT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QGSPVFVGGSYVEPMSNSGPALFGLTHLPHAASGAMDYGGAGPLGSGHHHGPGPGEPHPT 370 380 390 400 410 420 430 440 pF1KB7 YTDLTGHHPSQGRIQEAPKLTHL ::::::::::::::::::::::: CCDS54 YTDLTGHHPSQGRIQEAPKLTHL 430 440 >>CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 (431 aa) initn: 831 init1: 605 opt: 959 Z-score: 394.1 bits: 82.0 E(32554): 1.3e-15 Smith-Waterman score: 1415; 50.4% identity (70.6% similar) in 476 aa overlap (1-443:1-431) 10 20 30 40 50 pF1KB7 MQKATYYDSSA--IYGGYP-YQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSS ::::::::..: ..::: : ..:::.... :: : .:: .:.:.: :::::: .. CCDS11 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVPPQP-PFQAATHLEGDYQRSACSLQSLGN 10 20 30 40 50 60 70 80 90 100 110 pF1KB7 AGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPA :. : :..::. .:.: :.:. : : .:::. ::. :: . . . CCDS11 AAPHAKSKELNGSCMR---------PGLA------PEPLSAPPGSPPPSAAPTSATSNSS 60 70 80 90 100 120 130 140 150 160 170 pF1KB7 APPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGE ::...::. . : :: :..::::::::::::..: :..: ...: CCDS11 NGGGPSKSGPPKCG------------PGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAE 110 120 130 140 150 180 190 200 210 pF1KB7 SCAG-----------------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNR .:.: ::::::.:.:::::::::::::::::::::::: CCDS11 GCGGGGGGGGGGGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNR 160 170 180 190 200 210 220 230 240 250 260 270 pF1KB7 YLCRPRRVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--G ::::::::::::::::.::::::::::::::::::::.::. .:::: ::. :: : . CCDS11 YLCRPRRVEMANLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQS 220 230 240 250 260 270 280 290 300 310 320 330 pF1KB7 AGGYLNSMHSLVNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAA ..:..:..::.. : :: ::: :.: :..:.:: ..: : .:. : ..: . CCDS11 TAGFMNALHSMTPS--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPT 280 290 300 310 320 340 350 360 370 380 pF1KB7 GAGAGGTPDYDPHAHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLP : :.:.::. ::.:: .:::: .:::::.:::. :..:. .::.:.::.:: CCDS11 PA-----PEYEPHV--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLS 330 340 350 360 370 390 400 410 420 430 440 pF1KB7 HAASGAMDYGGAGPLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL : :: .::.:: :.. ..:::: :::::::::..:: : ::::::::::::: CCDS11 HHPSGNLDYNGAPPMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL 380 390 400 410 420 430 >>CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 (432 aa) initn: 989 init1: 570 opt: 883 Z-score: 365.1 bits: 76.7 E(32554): 5.2e-14 Smith-Waterman score: 1401; 52.0% identity (72.4% similar) in 450 aa overlap (1-443:17-432) 10 20 30 40 pF1KB7 MQKATYYDSSAIYGGYPY-QAANGFAYNANQQPYPASAALGA-D ::::.::.. ...::: : .... ..:.. .:::: :: .. : CCDS22 MLFEQGQQALELPECTMQKAAYYENPGLFGGYGYSKTTDTYGYSTPHQPYPPPAAASSLD 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB7 GEYHRPACSLQS--PSSAGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPP .: :::.:: : : .: :. ::. .:.: :. : : :: . CCDS22 TDYPGSACSIQSSAPLRAPAH-KGAELNGSCMR-----PGTGNSQGGGGGSQPPGLNSE- 70 80 90 100 110 110 120 130 140 150 160 pF1KB7 APQPPQPAPQPPAPTPAAPPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMK ::::: : ::. :..: :... : .. ...: ::..: : :..:::::::: CCDS22 -QQPPQPPPPPPTLPPSSPTNPGGGVPAKKPKGGP---NASSS----SATISKQIFPWMK 120 130 140 150 160 170 180 190 200 210 220 pF1KB7 ESRQNTKQKTSSSSSGESCAGDKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPR :::::.:::.: ...:::: :::::: :: ::.:::::::::::::::::::::::::: CCDS22 ESRQNSKQKNSCATAGESCE-DKSPPGPAS-KRVRTAYTSAQLVELEKEFHFNRYLCRPR 170 180 190 200 210 220 230 240 250 260 270 280 pF1KB7 RVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPPGAGGYLNSM :::::::::::::::::::::::::::::::.::.: : ..::: ::: ::.:.. CCDS22 RVEMANLLNLTERQIKIWFQNRRMKYKKDQKAKGILHSPASQSPERSPPLGGAAGHVAYS 230 240 250 260 270 280 290 300 310 320 330 pF1KB7 HSL--VNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGG .: : .. :. ::: :.: . ::: :.: : : :: : ::::.: CCDS22 GQLPPVPGLAYDAPSPPAFAKSQPNMYGL--AAYTAPLSSCLP---QQKRYAA------- 290 300 310 320 330 340 350 360 370 380 390 pF1KB7 TPDYDPHAHGLQGNGSYGTPHIQGSPVFVGGSYVEPMS-NSGPALFGLTHLPHAASGAMD :...:: . .:.: ... ..:::::.:::..:: :. ::: .:.: :: : .:...: CCDS22 -PEFEPHPMASNGGG-FASANLQGSPVYVGGNFVESMAPASGP-VFNLGHLSHPSSASVD 340 350 360 370 380 400 410 420 430 440 pF1KB7 YGGAGPLGSGHHHGPGPGEPHPTYTDLTGHHPSQGRIQEAPKLTHL :. :. . ..::::: .::::::::..:: ::::. :::::::: CCDS22 YSCAAQIPGNHHHGPC--DPHPTYTDLSAHHSSQGRLPEAPKLTHL 390 400 410 420 430 >>CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 (358 aa) initn: 831 init1: 605 opt: 746 Z-score: 313.8 bits: 66.9 E(32554): 3.7e-11 Smith-Waterman score: 1229; 53.9% identity (74.3% similar) in 373 aa overlap (105-443:3-358) 80 90 100 110 120 130 pF1KB7 LSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPAAPPP---PSSASPPQNA : ::.: . :..::: :.::. .. CCDS82 MRPGLAPEPLSAPPGSPPPSAAPTSATSNSSN 10 20 30 140 150 160 170 180 pF1KB7 SNNPTPANAAK-SPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGESCAG--------- ...:. .. : .: :: :..::::::::::::..: :..: ...:.:.: CCDS82 GGGPSKSGPPKCGPGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGGG 40 50 60 70 80 90 190 200 210 220 pF1KB7 --------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANL ::::::.:.::::::::::::::::::::::::::::::::::::: CCDS82 SGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANL 100 110 120 130 140 150 230 240 250 260 270 280 pF1KB7 LNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--GAGGYLNSMHSLVN :::.::::::::::::::::::::.::. .:::: ::. :: : ...:..:..::.. CCDS82 LNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMTP 160 170 180 190 200 210 290 300 310 320 330 340 pF1KB7 SVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAAGAGAGGTPDYDPH : :: ::: :.: :..:.:: ..: : .:. : ..: . : :.:.:: CCDS82 S--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPTPA-----PEYEPH 220 230 240 250 350 360 370 380 390 400 pF1KB7 AHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLPHAASGAMDYGGAG . ::.:: .:::: .:::::.:::. :..:. .::.:.::.:: : :: .::.:: CCDS82 V--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAP 260 270 280 290 300 310 410 420 430 440 pF1KB7 PLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL :.. ..:::: :::::::::..:: : ::::::::::::: CCDS82 PMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL 320 330 340 350 443 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 09:22:49 2016 done: Fri Nov 4 09:22:49 2016 Total Scan time: 2.910 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]