FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5154, 628 aa 1>>>pF1KB5154 628 - 628 aa - 628 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.3231+/-0.000839; mu= 11.5444+/- 0.051 mean_var=143.6798+/-29.289, 0's: 0 Z-trim(112.0): 13 B-trim: 5 in 1/51 Lambda= 0.106998 statistics sampled from 12829 (12840) to 12829 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.729), E-opt: 0.2 (0.394), width: 16 Scan time: 4.040 The best scores are: opt bits E(32554) CCDS6101.1 ASH2L gene_id:9070|Hs108|chr8 ( 628) 4339 681.5 9.1e-196 CCDS47840.1 ASH2L gene_id:9070|Hs108|chr8 ( 534) 3733 587.9 1.2e-167 CCDS64872.1 ASH2L gene_id:9070|Hs108|chr8 ( 489) 3416 538.9 5.8e-153 CCDS59100.1 ASH2L gene_id:9070|Hs108|chr8 ( 501) 3112 492.0 7.9e-139 >>CCDS6101.1 ASH2L gene_id:9070|Hs108|chr8 (628 aa) initn: 4339 init1: 4339 opt: 4339 Z-score: 3628.1 bits: 681.5 E(32554): 9.1e-196 Smith-Waterman score: 4339; 100.0% identity (100.0% similar) in 628 aa overlap (1-628:1-628) 10 20 30 40 50 60 pF1KB5 MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 MAAAGAGPGQEAGAGPGPGAVANATGAEEGEMKPVAAGAAAPPGEGISAAPTVEPSSGEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 EGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGIC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 EGGEANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGIC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 TKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 RTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 RTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHP 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 DPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 DPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKR 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 KQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLEL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 KQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLEL 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 DCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 DCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGA 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB5 WYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 WYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGY 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB5 GQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 GQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSE 490 500 510 520 530 540 550 560 570 580 590 600 pF1KB5 IIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS61 IIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWG 550 560 570 580 590 600 610 620 pF1KB5 AVVEHTLADVLYHVETEVDGRRSPPWEP :::::::::::::::::::::::::::: CCDS61 AVVEHTLADVLYHVETEVDGRRSPPWEP 610 620 >>CCDS47840.1 ASH2L gene_id:9070|Hs108|chr8 (534 aa) initn: 3733 init1: 3733 opt: 3733 Z-score: 3123.6 bits: 587.9 E(32554): 1.2e-167 Smith-Waterman score: 3733; 100.0% identity (100.0% similar) in 534 aa overlap (95-628:1-534) 70 80 90 100 110 120 pF1KB5 ANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGICTKWF :::::::::::::::::::::::::::::: CCDS47 MDTQAGSVDEENGRQLGEVELQCGICTKWF 10 20 30 130 140 150 160 170 180 pF1KB5 TADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQSRTQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 TADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQSRTQD 40 50 60 70 80 90 190 200 210 220 230 240 pF1KB5 EHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHPDPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 EHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHPDPGS 100 110 120 130 140 150 250 260 270 280 290 300 pF1KB5 KDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQD 160 170 180 190 200 210 310 320 330 340 350 360 pF1KB5 GGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLELDCWA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLELDCWA 220 230 240 250 260 270 370 380 390 400 410 420 pF1KB5 GKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGAWYFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 GKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGAWYFE 280 290 300 310 320 330 430 440 450 460 470 480 pF1KB5 ITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGD 340 350 360 370 380 390 490 500 510 520 530 540 pF1KB5 VLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 VLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFY 400 410 420 430 440 450 550 560 570 580 590 600 pF1KB5 KNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWGAVVE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWGAVVE 460 470 480 490 500 510 610 620 pF1KB5 HTLADVLYHVETEVDGRRSPPWEP :::::::::::::::::::::::: CCDS47 HTLADVLYHVETEVDGRRSPPWEP 520 530 >>CCDS64872.1 ASH2L gene_id:9070|Hs108|chr8 (489 aa) initn: 3416 init1: 3416 opt: 3416 Z-score: 2859.6 bits: 538.9 E(32554): 5.8e-153 Smith-Waterman score: 3416; 100.0% identity (100.0% similar) in 489 aa overlap (140-628:1-489) 110 120 130 140 150 160 pF1KB5 LGEVELQCGICTKWFTADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMC :::::::::::::::::::::::::::::: CCDS64 MTNYSFHCNVCHHSGNTYFLRKQANLKEMC 10 20 30 170 180 190 200 210 220 pF1KB5 LSALANLTWQSRTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 LSALANLTWQSRTQDEHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSK 40 50 60 70 80 90 230 240 250 260 270 280 pF1KB5 ERDVFLVKEHPDPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 ERDVFLVKEHPDPGSKDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAA 100 110 120 130 140 150 290 300 310 320 330 340 pF1KB5 GSSGKGRGAKRKQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 GSSGKGRGAKRKQQDGGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPD 160 170 180 190 200 210 350 360 370 380 390 400 pF1KB5 PHAPDPEKLELDCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 PHAPDPEKLELDCWAGKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSM 220 230 240 250 260 270 410 420 430 440 450 460 pF1KB5 VRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 VRASHGVRKGAWYFEITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFH 280 290 300 310 320 330 470 480 490 500 510 520 pF1KB5 QSIGKHYSSGYGQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 QSIGKHYSSGYGQGDVLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKA 340 350 360 370 380 390 530 540 550 560 570 580 pF1KB5 EKSLKQTPHSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS64 EKSLKQTPHSEIIFYKNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDL 400 410 420 430 440 450 590 600 610 620 pF1KB5 TYRPMSDMGWGAVVEHTLADVLYHVETEVDGRRSPPWEP ::::::::::::::::::::::::::::::::::::::: CCDS64 TYRPMSDMGWGAVVEHTLADVLYHVETEVDGRRSPPWEP 460 470 480 >>CCDS59100.1 ASH2L gene_id:9070|Hs108|chr8 (501 aa) initn: 3148 init1: 3111 opt: 3112 Z-score: 2605.9 bits: 492.0 E(32554): 7.9e-139 Smith-Waterman score: 3436; 93.8% identity (93.8% similar) in 534 aa overlap (95-628:1-501) 70 80 90 100 110 120 pF1KB5 ANLVDVSGGLETESSNGKDTLEGAGDTSEVMDTQAGSVDEENGRQLGEVELQCGICTKWF :::::::::::::::::::::::::::::: CCDS59 MDTQAGSVDEENGRQLGEVELQCGICTKWF 10 20 30 130 140 150 160 170 180 pF1KB5 TADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQSRTQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 TADTFGIDTSSCLPFMTNYSFHCNVCHHSGNTYFLRKQANLKEMCLSALANLTWQSRTQD 40 50 60 70 80 90 190 200 210 220 230 240 pF1KB5 EHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHPDPGS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 EHPKTMFSKDKDIIPFIDKYWECMTTRQRPGKMTWPNNIVKTMSKERDVFLVKEHPDPGS 100 110 120 130 140 150 250 260 270 280 290 300 pF1KB5 KDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 KDPEEDYPKFGLLDQDLSNIGPAYDNQKQSSAVSTSGNLNGGIAAGSSGKGRGAKRKQQD 160 170 180 190 200 210 310 320 330 340 350 360 pF1KB5 GGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLELDCWA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 GGTTGTTKKARSDPLFSAQRLPPHGYPLEHPFNKDGYRYILAEPDPHAPDPEKLELDCWA 220 230 240 250 260 270 370 380 390 400 410 420 pF1KB5 GKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGAWYFE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 GKPIPGDLYRACLYERVLLALHDRAPQLKISDDRLTVVGEKGYSMVRASHGVRKGAWYFE 280 290 300 310 320 330 430 440 450 460 470 480 pF1KB5 ITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 ITVDEMPPDTAARLGWSQPLGNLQAPLGYDKFSYSWRSKKGTKFHQSIGKHYSSGYGQGD 340 350 360 370 380 390 490 500 510 520 530 540 pF1KB5 VLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSEIIFY :::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 VLGFYINLPEDTETAKSLPDTYKDKALIKFKSYLYFEEKDFVDKAEKSLKQTPHSE---- 400 410 420 430 440 550 560 570 580 590 600 pF1KB5 KNGVNQGVAYKDIFEGVYFPAISLYKSCTVSINFGPCFKYPPKDLTYRPMSDMGWGAVVE ::::::::::::::::::::::::::::::: CCDS59 -----------------------------VSINFGPCFKYPPKDLTYRPMSDMGWGAVVE 450 460 470 610 620 pF1KB5 HTLADVLYHVETEVDGRRSPPWEP :::::::::::::::::::::::: CCDS59 HTLADVLYHVETEVDGRRSPPWEP 480 490 500 628 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 16:13:04 2016 done: Thu Nov 3 16:13:04 2016 Total Scan time: 4.040 Total Display time: 0.050 Function used was FASTA [36.3.4 Apr, 2011]