FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB4575, 885 aa 1>>>pF1KB4575 885 - 885 aa - 885 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.6289+/-0.00103; mu= 10.5189+/- 0.062 mean_var=137.0924+/-27.012, 0's: 0 Z-trim(107.8): 25 B-trim: 0 in 0/50 Lambda= 0.109539 statistics sampled from 9789 (9801) to 9789 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.652), E-opt: 0.2 (0.301), width: 16 Scan time: 4.410 The best scores are: opt bits E(32554) CCDS31623.1 KMT5B gene_id:51111|Hs108|chr11 ( 885) 5897 944.2 0 CCDS44660.1 KMT5B gene_id:51111|Hs108|chr11 ( 393) 2664 433.1 4.8e-121 CCDS76444.1 KMT5B gene_id:51111|Hs108|chr11 ( 370) 1827 300.8 3e-81 CCDS12922.1 KMT5C gene_id:84787|Hs108|chr19 ( 462) 983 167.5 5.1e-41 >>CCDS31623.1 KMT5B gene_id:51111|Hs108|chr11 (885 aa) initn: 5897 init1: 5897 opt: 5897 Z-score: 5042.6 bits: 944.2 E(32554): 0 Smith-Waterman score: 5897; 100.0% identity (100.0% similar) in 885 aa overlap (1-885:1-885) 10 20 30 40 50 60 pF1KB4 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB4 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB4 KVVLYKNLPIKKDKEPEGPAQAAVASGCLTRHAAREHRQNPVRGAHSQGESSPCTYITRR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 KVVLYKNLPIKKDKEPEGPAQAAVASGCLTRHAAREHRQNPVRGAHSQGESSPCTYITRR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB4 SVRTRTNLKEASDIKLEPNTLNGYKSSVTEPCPDSGEQLQPAPVLQEEELAHETAQKGEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SVRTRTNLKEASDIKLEPNTLNGYKSSVTEPCPDSGEQLQPAPVLQEEELAHETAQKGEA 550 560 570 580 590 600 610 620 630 640 650 660 pF1KB4 KCHKSDTGMSKKKSRQGKLVKQFAKIEESTPVHDSPGKDDAVPDLMGPHSDQGEHSGTVG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 KCHKSDTGMSKKKSRQGKLVKQFAKIEESTPVHDSPGKDDAVPDLMGPHSDQGEHSGTVG 610 620 630 640 650 660 670 680 690 700 710 720 pF1KB4 VPVSYTDCAPSPVGCSVVTSDSFKTKDSFRTAKSKKKRRITRYDAQLILENNSGIPKLTL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 VPVSYTDCAPSPVGCSVVTSDSFKTKDSFRTAKSKKKRRITRYDAQLILENNSGIPKLTL 670 680 690 700 710 720 730 740 750 760 770 780 pF1KB4 RRRHDSSSKTNDQENDGMNSSKISIKLSKDHDNDNNLYVAKLNNGFNSGSGSSSTKLKIQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 RRRHDSSSKTNDQENDGMNSSKISIKLSKDHDNDNNLYVAKLNNGFNSGSGSSSTKLKIQ 730 740 750 760 770 780 790 800 810 820 830 840 pF1KB4 LKRDEENRGSYTEGLHENGVCCSDPLSLLESRMEVDDYSQYEEESTDDSSSSEGDEEEDD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 LKRDEENRGSYTEGLHENGVCCSDPLSLLESRMEVDDYSQYEEESTDDSSSSEGDEEEDD 790 800 810 820 830 840 850 860 870 880 pF1KB4 YDDDFEDDFIPLPPAKRLRLIVGKDSIDIDISSRRREDQSLRLNA ::::::::::::::::::::::::::::::::::::::::::::: CCDS31 YDDDFEDDFIPLPPAKRLRLIVGKDSIDIDISSRRREDQSLRLNA 850 860 870 880 >>CCDS44660.1 KMT5B gene_id:51111|Hs108|chr11 (393 aa) initn: 2664 init1: 2664 opt: 2664 Z-score: 2286.7 bits: 433.1 E(32554): 4.8e-121 Smith-Waterman score: 2664; 99.7% identity (100.0% similar) in 392 aa overlap (1-392:1-392) 10 20 30 40 50 60 pF1KB4 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB4 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB4 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB4 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB4 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS44 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB4 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA :::::::::::::::::::::::::::::::. CCDS44 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNASK 370 380 390 430 440 450 460 470 480 pF1KB4 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP >>CCDS76444.1 KMT5B gene_id:51111|Hs108|chr11 (370 aa) initn: 2494 init1: 1819 opt: 1827 Z-score: 1572.2 bits: 300.8 E(32554): 3e-81 Smith-Waterman score: 2452; 93.9% identity (94.1% similar) in 392 aa overlap (1-392:1-369) 10 20 30 40 50 60 pF1KB4 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB4 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS :::::::::::::::::::::::::::::::::::::::::: CCDS76 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNT------------------ 70 80 90 100 130 140 150 160 170 180 pF1KB4 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 -----RFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE 110 120 130 140 150 190 200 210 220 230 240 pF1KB4 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN 160 170 180 190 200 210 250 260 270 280 290 300 pF1KB4 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG 220 230 240 250 260 270 310 320 330 340 350 360 pF1KB4 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS76 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR 280 290 300 310 320 330 370 380 390 400 410 420 pF1KB4 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA :::::::::::::::::::::::::::::::. CCDS76 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNASK 340 350 360 370 430 440 450 460 470 480 pF1KB4 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP >>CCDS12922.1 KMT5C gene_id:84787|Hs108|chr19 (462 aa) initn: 835 init1: 598 opt: 983 Z-score: 849.9 bits: 167.5 E(32554): 5.1e-41 Smith-Waterman score: 1124; 58.1% identity (76.1% similar) in 289 aa overlap (72-360:6-270) 50 60 70 80 90 100 pF1KB4 AGKNAVERRSNRCNGNSGFEGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMN ..:.::::::::::::::::::::.::::: CCDS12 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMN 10 20 30 110 120 130 140 150 160 pF1KB4 TSAFPSRSSRHFSKSDSFSHNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGE .: : :.: ::..:. ..: : ... :: :.. :: : CCDS12 VSPVP-----------------PLR------RQQHLRSALETFLRQRDLEAAYRALTLGG 40 50 60 70 170 180 190 200 210 220 pF1KB4 WARHYFLNKNKMQEKLFKEHVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRN :. .:: ... :: .: ::. ::: : .::: ::::.::: : ::::::.:. ::.: CCDS12 WTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETNGAKIVSTRAWKKN 80 90 100 110 120 130 230 240 250 260 270 280 pF1KB4 DKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKF .:.:::::::::: : .:. ::: ::::::.:::::: :::::::::::::::.::::: CCDS12 EKLELLVGCIAELREADEG-LLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKF 140 150 160 170 180 190 290 300 310 320 330 340 pF1KB4 VSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAP : . ..::::.:::::::.:..:.::.:::::.:: :::.::::.: :::..: :: CCDS12 VPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREPAL 200 210 220 230 240 250 350 360 370 380 390 400 pF1KB4 APVINSKYGLRETDKRLNRLKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGV : .:: :::: .::.. CCDS12 PPRPLDKYQLRETKRRLQQGLDSGSRQGLLGPRACVHPSPLRRDPFCAACQPLRLPACSA 260 270 280 290 300 310 885 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 15:11:06 2016 done: Thu Nov 3 15:11:07 2016 Total Scan time: 4.410 Total Display time: 0.060 Function used was FASTA [36.3.4 Apr, 2011]