FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4575, 885 aa
1>>>pF1KB4575 885 - 885 aa - 885 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.6289+/-0.00103; mu= 10.5189+/- 0.062
mean_var=137.0924+/-27.012, 0's: 0 Z-trim(107.8): 25 B-trim: 0 in 0/50
Lambda= 0.109539
statistics sampled from 9789 (9801) to 9789 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.652), E-opt: 0.2 (0.301), width: 16
Scan time: 4.410
The best scores are: opt bits E(32554)
CCDS31623.1 KMT5B gene_id:51111|Hs108|chr11 ( 885) 5897 944.2 0
CCDS44660.1 KMT5B gene_id:51111|Hs108|chr11 ( 393) 2664 433.1 4.8e-121
CCDS76444.1 KMT5B gene_id:51111|Hs108|chr11 ( 370) 1827 300.8 3e-81
CCDS12922.1 KMT5C gene_id:84787|Hs108|chr19 ( 462) 983 167.5 5.1e-41
>>CCDS31623.1 KMT5B gene_id:51111|Hs108|chr11 (885 aa)
initn: 5897 init1: 5897 opt: 5897 Z-score: 5042.6 bits: 944.2 E(32554): 0
Smith-Waterman score: 5897; 100.0% identity (100.0% similar) in 885 aa overlap (1-885:1-885)
10 20 30 40 50 60
pF1KB4 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB4 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB4 KVVLYKNLPIKKDKEPEGPAQAAVASGCLTRHAAREHRQNPVRGAHSQGESSPCTYITRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 KVVLYKNLPIKKDKEPEGPAQAAVASGCLTRHAAREHRQNPVRGAHSQGESSPCTYITRR
490 500 510 520 530 540
550 560 570 580 590 600
pF1KB4 SVRTRTNLKEASDIKLEPNTLNGYKSSVTEPCPDSGEQLQPAPVLQEEELAHETAQKGEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 SVRTRTNLKEASDIKLEPNTLNGYKSSVTEPCPDSGEQLQPAPVLQEEELAHETAQKGEA
550 560 570 580 590 600
610 620 630 640 650 660
pF1KB4 KCHKSDTGMSKKKSRQGKLVKQFAKIEESTPVHDSPGKDDAVPDLMGPHSDQGEHSGTVG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 KCHKSDTGMSKKKSRQGKLVKQFAKIEESTPVHDSPGKDDAVPDLMGPHSDQGEHSGTVG
610 620 630 640 650 660
670 680 690 700 710 720
pF1KB4 VPVSYTDCAPSPVGCSVVTSDSFKTKDSFRTAKSKKKRRITRYDAQLILENNSGIPKLTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 VPVSYTDCAPSPVGCSVVTSDSFKTKDSFRTAKSKKKRRITRYDAQLILENNSGIPKLTL
670 680 690 700 710 720
730 740 750 760 770 780
pF1KB4 RRRHDSSSKTNDQENDGMNSSKISIKLSKDHDNDNNLYVAKLNNGFNSGSGSSSTKLKIQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 RRRHDSSSKTNDQENDGMNSSKISIKLSKDHDNDNNLYVAKLNNGFNSGSGSSSTKLKIQ
730 740 750 760 770 780
790 800 810 820 830 840
pF1KB4 LKRDEENRGSYTEGLHENGVCCSDPLSLLESRMEVDDYSQYEEESTDDSSSSEGDEEEDD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS31 LKRDEENRGSYTEGLHENGVCCSDPLSLLESRMEVDDYSQYEEESTDDSSSSEGDEEEDD
790 800 810 820 830 840
850 860 870 880
pF1KB4 YDDDFEDDFIPLPPAKRLRLIVGKDSIDIDISSRRREDQSLRLNA
:::::::::::::::::::::::::::::::::::::::::::::
CCDS31 YDDDFEDDFIPLPPAKRLRLIVGKDSIDIDISSRRREDQSLRLNA
850 860 870 880
>>CCDS44660.1 KMT5B gene_id:51111|Hs108|chr11 (393 aa)
initn: 2664 init1: 2664 opt: 2664 Z-score: 2286.7 bits: 433.1 E(32554): 4.8e-121
Smith-Waterman score: 2664; 99.7% identity (100.0% similar) in 392 aa overlap (1-392:1-392)
10 20 30 40 50 60
pF1KB4 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB4 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB4 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB4 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB4 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB4 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA
:::::::::::::::::::::::::::::::.
CCDS44 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNASK
370 380 390
430 440 450 460 470 480
pF1KB4 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP
>>CCDS76444.1 KMT5B gene_id:51111|Hs108|chr11 (370 aa)
initn: 2494 init1: 1819 opt: 1827 Z-score: 1572.2 bits: 300.8 E(32554): 3e-81
Smith-Waterman score: 2452; 93.9% identity (94.1% similar) in 392 aa overlap (1-392:1-369)
10 20 30 40 50 60
pF1KB4 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 MKWLGESKNMVVNGRRNGGKLSNDHQQNQSKLQHTGKDTLKAGKNAVERRSNRCNGNSGF
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNTSAFPSRSSRHFSKSDSFS
::::::::::::::::::::::::::::::::::::::::::
CCDS76 EGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMNT------------------
70 80 90 100
130 140 150 160 170 180
pF1KB4 HNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 -----RFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGEWARHYFLNKNKMQEKLFKE
110 120 130 140 150
190 200 210 220 230 240
pF1KB4 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 HVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRNDKIELLVGCIAELSEIEEN
160 170 180 190 200 210
250 260 270 280 290 300
pF1KB4 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 MLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKALRDIEPG
220 230 240 250 260 270
310 320 330 340 350 360
pF1KB4 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS76 EEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAPAPVINSKYGLRETDKRLNR
280 290 300 310 320 330
370 380 390 400 410 420
pF1KB4 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGVKKNSKSRTLTRQSMSRIPA
:::::::::::::::::::::::::::::::.
CCDS76 LKKLGDSSKNSDSQSVSSNTDADTTQEKNNASK
340 350 360 370
430 440 450 460 470 480
pF1KB4 SSNSTSSKLTHINNSRVPKKLKKPAKPLLSKIKLRNHCKRLEQKNASRKLEMGNLVLKEP
>>CCDS12922.1 KMT5C gene_id:84787|Hs108|chr19 (462 aa)
initn: 835 init1: 598 opt: 983 Z-score: 849.9 bits: 167.5 E(32554): 5.1e-41
Smith-Waterman score: 1124; 58.1% identity (76.1% similar) in 289 aa overlap (72-360:6-270)
50 60 70 80 90 100
pF1KB4 AGKNAVERRSNRCNGNSGFEGQSRYVPSSGMSAKELCENDDLATSLVLDPYLGFQTHKMN
..:.::::::::::::::::::::.:::::
CCDS12 MGPDRVTARELCENDDLATSLVLDPYLGFRTHKMN
10 20 30
110 120 130 140 150 160
pF1KB4 TSAFPSRSSRHFSKSDSFSHNNPVRFRPIKGRQEELKEVIERFKKDEHLEKAFKCLTSGE
.: : :.: ::..:. ..: : ... :: :.. :: :
CCDS12 VSPVP-----------------PLR------RQQHLRSALETFLRQRDLEAAYRALTLGG
40 50 60 70
170 180 190 200 210 220
pF1KB4 WARHYFLNKNKMQEKLFKEHVFIYLRMFATDSGFEILPCNRYSSEQNGAKIVATKEWKRN
:. .:: ... :: .: ::. ::: : .::: ::::.::: : ::::::.:. ::.:
CCDS12 WTARYFQSRGPRQEAALKTHVYRYLRAFLPESGFTILPCTRYSMETNGAKIVSTRAWKKN
80 90 100 110 120 130
230 240 250 260 270 280
pF1KB4 DKIELLVGCIAELSEIEENMLLRHGENDFSVMYSTRKNCAQLWLGPAAFINHDCRPNCKF
.:.:::::::::: : .:. ::: ::::::.:::::: :::::::::::::::.:::::
CCDS12 EKLELLVGCIAELREADEG-LLRAGENDFSIMYSTRKRSAQLWLGPAAFINHDCKPNCKF
140 150 160 170 180 190
290 300 310 320 330 340
pF1KB4 VSTGRDTACVKALRDIEPGEEISCYYGDGFFGENNEFCECYTCERRGTGAFKSRVGLPAP
: . ..::::.:::::::.:..:.::.:::::.:: :::.::::.: :::..: ::
CCDS12 VPADGNAACVKVLRDIEPGDEVTCFYGEGFFGEKNEHCECHTCERKGEGAFRTRPREPAL
200 210 220 230 240 250
350 360 370 380 390 400
pF1KB4 APVINSKYGLRETDKRLNRLKKLGDSSKNSDSQSVSSNTDADTTQEKNNATSNRKSSVGV
: .:: :::: .::..
CCDS12 PPRPLDKYQLRETKRRLQQGLDSGSRQGLLGPRACVHPSPLRRDPFCAACQPLRLPACSA
260 270 280 290 300 310
885 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 15:11:06 2016 done: Thu Nov 3 15:11:07 2016
Total Scan time: 4.410 Total Display time: 0.060
Function used was FASTA [36.3.4 Apr, 2011]