FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9726, 431 aa 1>>>pF1KB9726 431 - 431 aa - 431 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.8152+/-0.00105; mu= 2.7923+/- 0.064 mean_var=464.9526+/-96.014, 0's: 0 Z-trim(117.7): 131 B-trim: 708 in 1/53 Lambda= 0.059480 statistics sampled from 18401 (18535) to 18401 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.824), E-opt: 0.2 (0.569), width: 16 Scan time: 3.340 The best scores are: opt bits E(32554) CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 ( 431) 3074 277.6 1.7e-74 CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 ( 358) 2566 233.9 2e-61 CCDS82153.1 HOXB3 gene_id:3213|Hs108|chr17 ( 299) 2140 197.2 1.8e-50 CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 ( 432) 1031 102.3 9.9e-22 CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 ( 443) 959 96.1 7.3e-20 >>CCDS11528.1 HOXB3 gene_id:3213|Hs108|chr17 (431 aa) initn: 3074 init1: 3074 opt: 3074 Z-score: 1451.1 bits: 277.6 E(32554): 1.7e-74 Smith-Waterman score: 3074; 100.0% identity (100.0% similar) in 431 aa overlap (1-431:1-431) 10 20 30 40 50 60 pF1KB9 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVPPQPPFQAATHLEGDYQRSACSLQSLGNA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVPPQPPFQAATHLEGDYQRSACSLQSLGNA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 APHAKSKELNGSCMRPGLAPEPLSAPPGSPPPSAAPTSATSNSSNGGGPSKSGPPKCGPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 APHAKSKELNGSCMRPGLAPEPLSAPPGSPPPSAAPTSATSNSSNGGGPSKSGPPKCGPG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 TNSTLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGGGSGGSGGGGGGGGGGDK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TNSTLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGGGSGGSGGGGGGGGGGDK 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 SPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLSERQIKIWFQNRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 SPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMANLLNLSERQIKIWFQNRR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 MKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMTPSYESPSPPAFGKAHQN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMTPSYESPSPPAFGKAHQN 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB9 AYALPSNYQPPLKGCGAPQKYPPTPAPEYEPHVLQANGGAYGTPTMQGSPVYVGGGGYAD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 AYALPSNYQPPLKGCGAPQKYPPTPAPEYEPHVLQANGGAYGTPTMQGSPVYVGGGGYAD 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB9 PLPPPAGPSLYGLNHLSHHPSGNLDYNGAPPMAPSQHHGPCEPHPTYTDLSSHHAPPPQG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PLPPPAGPSLYGLNHLSHHPSGNLDYNGAPPMAPSQHHGPCEPHPTYTDLSSHHAPPPQG 370 380 390 400 410 420 430 pF1KB9 RIQEAPKLTHL ::::::::::: CCDS11 RIQEAPKLTHL 430 >>CCDS82154.1 HOXB3 gene_id:3213|Hs108|chr17 (358 aa) initn: 2566 init1: 2566 opt: 2566 Z-score: 1216.3 bits: 233.9 E(32554): 2e-61 Smith-Waterman score: 2566; 100.0% identity (100.0% similar) in 358 aa overlap (74-431:1-358) 50 60 70 80 90 100 pF1KB9 EGDYQRSACSLQSLGNAAPHAKSKELNGSCMRPGLAPEPLSAPPGSPPPSAAPTSATSNS :::::::::::::::::::::::::::::: CCDS82 MRPGLAPEPLSAPPGSPPPSAAPTSATSNS 10 20 30 110 120 130 140 150 160 pF1KB9 SNGGGPSKSGPPKCGPGTNSTLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 SNGGGPSKSGPPKCGPGTNSTLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGGG 40 50 60 70 80 90 170 180 190 200 210 220 pF1KB9 GSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMAN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMAN 100 110 120 130 140 150 230 240 250 260 270 280 pF1KB9 LLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 LLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSMT 160 170 180 190 200 210 290 300 310 320 330 340 pF1KB9 PSYESPSPPAFGKAHQNAYALPSNYQPPLKGCGAPQKYPPTPAPEYEPHVLQANGGAYGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 PSYESPSPPAFGKAHQNAYALPSNYQPPLKGCGAPQKYPPTPAPEYEPHVLQANGGAYGT 220 230 240 250 260 270 350 360 370 380 390 400 pF1KB9 PTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAPPMAPSQHHGPCEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 PTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAPPMAPSQHHGPCEP 280 290 300 310 320 330 410 420 430 pF1KB9 HPTYTDLSSHHAPPPQGRIQEAPKLTHL :::::::::::::::::::::::::::: CCDS82 HPTYTDLSSHHAPPPQGRIQEAPKLTHL 340 350 >>CCDS82153.1 HOXB3 gene_id:3213|Hs108|chr17 (299 aa) initn: 2140 init1: 2140 opt: 2140 Z-score: 1019.6 bits: 197.2 E(32554): 1.8e-50 Smith-Waterman score: 2140; 100.0% identity (100.0% similar) in 299 aa overlap (133-431:1-299) 110 120 130 140 150 160 pF1KB9 SSNGGGPSKSGPPKCGPGTNSTLTKQIFPWMKESRQTSKLKNNSPGTAEGCGGGGGGGGG :::::::::::::::::::::::::::::: CCDS82 MKESRQTSKLKNNSPGTAEGCGGGGGGGGG 10 20 30 170 180 190 200 210 220 pF1KB9 GGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 GGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNRYLCRPRRVEMA 40 50 60 70 80 90 230 240 250 260 270 280 pF1KB9 NLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 NLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQSTAGFMNALHSM 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB9 TPSYESPSPPAFGKAHQNAYALPSNYQPPLKGCGAPQKYPPTPAPEYEPHVLQANGGAYG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 TPSYESPSPPAFGKAHQNAYALPSNYQPPLKGCGAPQKYPPTPAPEYEPHVLQANGGAYG 160 170 180 190 200 210 350 360 370 380 390 400 pF1KB9 TPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAPPMAPSQHHGPCE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS82 TPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPSGNLDYNGAPPMAPSQHHGPCE 220 230 240 250 260 270 410 420 430 pF1KB9 PHPTYTDLSSHHAPPPQGRIQEAPKLTHL ::::::::::::::::::::::::::::: CCDS82 PHPTYTDLSSHHAPPPQGRIQEAPKLTHL 280 290 >>CCDS2270.1 HOXD3 gene_id:3232|Hs108|chr2 (432 aa) initn: 1322 init1: 465 opt: 1031 Z-score: 503.6 bits: 102.3 E(32554): 9.9e-22 Smith-Waterman score: 1367; 50.7% identity (68.7% similar) in 454 aa overlap (1-431:17-432) 10 20 30 40 pF1KB9 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVP--PQPPFQAATH ::::.::.: . :::::. .. .:...: : :: ::. CCDS22 MLFEQGQQALELPECTMQKAAYYENPG--LFGGYGYSKTTDTYGYSTPHQPYPPPAAASS 10 20 30 40 50 50 60 70 80 pF1KB9 LEGDYQRSACSLQSLGNA-APHAKSKELNGSCMRPGLA----------PEPLSA---PPG :. :: ::::.:: . :: :. :::::::::: . : :.. :: CCDS22 LDTDYPGSACSIQSSAPLRAPAHKGAELNGSCMRPGTGNSQGGGGGSQPPGLNSEQQPPQ 60 70 80 90 100 110 90 100 110 120 130 140 pF1KB9 SPPPSAAPTSATSNSSNGGGPSKSGPPKCGPGTNS---TLTKQIFPWMKESRQTSKLKNN ::: :: :. .: :: . :: ::...: :..::::::::::::.:: ::. CCDS22 PPPPP--PTLPPSSPTNPGGGVPAKKPKGGPNASSSSATISKQIFPWMKESRQNSKQKNS 120 130 140 150 160 170 150 160 170 180 190 200 pF1KB9 SPGTAEGCGGGGGGGGGGGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEK ..:.: :::::: : :::.:::::::::::::: CCDS22 CATAGESCE------------------------DKSPPGPA-SKRVRTAYTSAQLVELEK 180 190 200 210 210 220 230 240 250 260 pF1KB9 EFHFNRYLCRPRRVEMANLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSP ::::::::::::::::::::::.:::::::::::::::::::::::. : .. :: :: CCDS22 EFHFNRYLCRPRRVEMANLLNLTERQIKIWFQNRRMKYKKDQKAKGILHSPASQSPERSP 220 230 240 250 260 270 270 280 290 300 310 320 pF1KB9 PQPMQSTAGFMNALHSMTP----SYESPSPPAFGKAHQNAYALPSNYQPPLKGCGAPQKY :. ..:: . .. : .:..::::::.:.. : :.: . : ::..: :: CCDS22 --PLGGAAGHVAYSGQLPPVPGLAYDAPSPPAFAKSQPNMYGL-AAYTAPLSSCLPQQKR 280 290 300 310 320 330 340 350 360 370 380 pF1KB9 PPTPAPEYEPHVLQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLSHHPS :::.::: . .:::.... ..:::::::::. ... . : .:: ...:.:::: : CCDS22 --YAAPEFEPHPMASNGGGFASANLQGSPVYVGGN-FVESMAPASGP-VFNLGHLSHPSS 330 340 350 360 370 380 390 400 410 420 430 pF1KB9 GNLDYNGAPPMAPSQHHGPCEPHPTYTDLSSHHAPPPQGRIQEAPKLTHL ...::. : . ..:::::.:::::::::.::. :::. :::::::: CCDS22 ASVDYSCAAQIPGNHHHGPCDPHPTYTDLSAHHS--SQGRLPEAPKLTHL 390 400 410 420 430 >>CCDS5404.1 HOXA3 gene_id:3200|Hs108|chr7 (443 aa) initn: 831 init1: 605 opt: 959 Z-score: 470.1 bits: 96.1 E(32554): 7.3e-20 Smith-Waterman score: 1415; 50.4% identity (70.6% similar) in 476 aa overlap (1-431:1-443) 10 20 30 40 50 pF1KB9 MQKATYYDNAAAALFGGYSSYPGSNGFGFDVPPQP-PFQAATHLEGDYQRSACSLQSLGN ::::::::..: ..::: : ..:::.... :: : .:: .:.:.: :::::: .. CCDS54 MQKATYYDSSA--IYGGYP-YQAANGFAYNANQQPYPASAALGADGEYHRPACSLQSPSS 10 20 30 40 50 60 70 80 90 100 pF1KB9 AAPHAKSKELNGSCMR---------PGLA------PEPLSAPPGSPPPSAAPTSATSNSS :. : :..::. .:.: :.:. : : .:::. ::. :: . . . CCDS54 AGGHPKAHELSEACLRTLSAPPSQPPSLGEPPLHPPPPQAAPPAPQPPQPAPQPPAPTPA 60 70 80 90 100 110 110 120 130 140 150 pF1KB9 NGGGPSKSGPPKCG------------PGTNS-TLTKQIFPWMKESRQTSKLKNNSPGTAE ::...::. . : :: :..::::::::::::..: :..: ...: CCDS54 APPPPSSASPPQNASNNPTPANAAKSPLLNSPTVAKQIFPWMKESRQNTKQKTSSSSSGE 120 130 140 150 160 170 160 170 180 190 200 210 pF1KB9 GCGGGGGGGGGGGSGGSGGGGGGGGGGDKSPPGSAASKRARTAYTSAQLVELEKEFHFNR .:.: ::::::.:.:::::::::::::::::::::::: CCDS54 SCAG-----------------------DKSPPGQASSKRARTAYTSAQLVELEKEFHFNR 180 190 200 210 220 230 240 250 260 270 pF1KB9 YLCRPRRVEMANLLNLSERQIKIWFQNRRMKYKKDQKAKGLASSSGGPSPAGSPPQPMQS ::::::::::::::::.::::::::::::::::::::.::. .:::: ::. :: : . CCDS54 YLCRPRRVEMANLLNLTERQIKIWFQNRRMKYKKDQKGKGMLTSSGGQSPSRSPVPP--G 220 230 240 250 260 270 280 290 300 310 320 pF1KB9 TAGFMNALHSMTPS--YESPSPPAFGKAHQNAYALP-SNYQPPLKGCGAP----QKYPPT ..:..:..::.. : :: ::: :.: :..:.:: ..: : .:. : ..: . CCDS54 AGGYLNSMHSLVNSVPYEPQSPPPFSKPPQGTYGLPPASYPASLPSCAPPPPPQKRYTAA 280 290 300 310 320 330 330 340 350 360 370 pF1KB9 PA-----PEYEPHV--LQANGGAYGTPTMQGSPVYVGGGGYADPLPPPAGPSLYGLNHLS : :.:.::. ::.:: .:::: .:::::.:::. :..:. .::.:.::.:: CCDS54 GAGAGGTPDYDPHAHGLQGNG-SYGTPHIQGSPVFVGGS-YVEPMSN-SGPALFGLTHLP 340 350 360 370 380 380 390 400 410 420 430 pF1KB9 HHPSGNLDYNGAPPMAPSQHHGPC--EPHPTYTDLSSHHAPPPQGRIQEAPKLTHL : :: .::.:: :.. ..:::: :::::::::..:: : ::::::::::::: CCDS54 HAASGAMDYGGAGPLGSGHHHGPGPGEPHPTYTDLTGHH--PSQGRIQEAPKLTHL 390 400 410 420 430 440 431 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 18:32:41 2016 done: Fri Nov 4 18:32:42 2016 Total Scan time: 3.340 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]