FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KA0783, 888 aa 1>>>pF1KA0783 888 - 888 aa - 888 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 9.9460+/-0.00106; mu= 1.5811+/- 0.064 mean_var=319.4070+/-63.799, 0's: 0 Z-trim(113.4): 41 B-trim: 149 in 1/53 Lambda= 0.071763 statistics sampled from 14033 (14063) to 14033 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.74), E-opt: 0.2 (0.432), width: 16 Scan time: 4.420 The best scores are: opt bits E(32554) CCDS47542.1 PHF14 gene_id:9678|Hs108|chr7 ( 888) 5985 634.1 3.3e-181 CCDS7135.1 MLLT10 gene_id:8028|Hs108|chr10 (1027) 522 68.6 6.7e-11 CCDS55708.1 MLLT10 gene_id:8028|Hs108|chr10 (1068) 522 68.6 6.9e-11 >>CCDS47542.1 PHF14 gene_id:9678|Hs108|chr7 (888 aa) initn: 5985 init1: 5985 opt: 5985 Z-score: 3366.7 bits: 634.1 E(32554): 3.3e-181 Smith-Waterman score: 5985; 99.9% identity (100.0% similar) in 888 aa overlap (1-888:1-888) 10 20 30 40 50 60 pF1KA0 MDRSSKRRQVKPLAASLLEALDYDSSDDSDFKVGDASDSEGSGNGSEDASKDSGEGSCSD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MDRSSKRRQVKPLAASLLEALDYDSSDDSDFKVGDASDSEGSGNGSEDASKDSGEGSCSD 10 20 30 40 50 60 70 80 90 100 110 120 pF1KA0 SEENILEEELNEDIKVKEEQLKNSAEEEVLSSEKQLIKMEKKEEEENGERPRKKREKEKE ::::::::::::::::::::::::::::::::::::::::::::::::::::::.::::: CCDS47 SEENILEEELNEDIKVKEEQLKNSAEEEVLSSEKQLIKMEKKEEEENGERPRKKKEKEKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KA0 KEKEKEKEKEREKEKEKATVSENVAASAAATTPATSPPAVNTSPSVPTTTTATEEQVSEP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KEKEKEKEKEREKEKEKATVSENVAASAAATTPATSPPAVNTSPSVPTTTTATEEQVSEP 130 140 150 160 170 180 190 200 210 220 230 240 pF1KA0 KKWNLRRNRPLLDFVSMEELNDMDDYDSEDDNDWRPTVVKRKGRSASQKEGSDGDNEDDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 KKWNLRRNRPLLDFVSMEELNDMDDYDSEDDNDWRPTVVKRKGRSASQKEGSDGDNEDDE 190 200 210 220 230 240 250 260 270 280 290 300 pF1KA0 DEGSGSDEDENDEGNDEDHSSPASEGGCKKKKSKVLSRNSADDEELTNDSLTLSQSKSNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DEGSGSDEDENDEGNDEDHSSPASEGGCKKKKSKVLSRNSADDEELTNDSLTLSQSKSNE 250 260 270 280 290 300 310 320 330 340 350 360 pF1KA0 DSLILEKSQNWSSQKMDHILICCVCLGDNSEDADEIIQCDNCGITVHEGCYGVDGESDSI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DSLILEKSQNWSSQKMDHILICCVCLGDNSEDADEIIQCDNCGITVHEGCYGVDGESDSI 310 320 330 340 350 360 370 380 390 400 410 420 pF1KA0 MSSASENSTEPWFCDACKCGVSPSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MSSASENSTEPWFCDACKCGVSPSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDI 370 380 390 400 410 420 430 440 450 460 470 480 pF1KA0 DKLRPVTLTEMNYSKYGAKECSFCEDPRFARTGVCISCDAGMCRAYFHVTCAQKEGLLSE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 DKLRPVTLTEMNYSKYGAKECSFCEDPRFARTGVCISCDAGMCRAYFHVTCAQKEGLLSE 430 440 450 460 470 480 490 500 510 520 530 540 pF1KA0 AAAEEDIADPFFAYCKQHADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQARINAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 AAAEEDIADPFFAYCKQHADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQARINAR 490 500 510 520 530 540 550 560 570 580 590 600 pF1KA0 LQQYRAKAELARSTRPQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDIFPVDNSDT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LQQYRAKAELARSTRPQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDIFPVDNSDT 550 560 570 580 590 600 610 620 630 640 650 660 pF1KA0 SSSVDGRRKHKQPALTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SSSVDGRRKHKQPALTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNK 610 620 630 640 650 660 670 680 690 700 710 720 pF1KA0 LCESLEELQNLNGKLRSEGQGIWALLGRITGQKLNIPAILRAPKERKPSKKEGGTQKTST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LCESLEELQNLNGKLRSEGQGIWALLGRITGQKLNIPAILRAPKERKPSKKEGGTQKTST 670 680 690 700 710 720 730 740 750 760 770 780 pF1KA0 LPAVLYSCGICKKNHDQHLLLLCDTCKLHYHLGCLDPPLTRMPRKTKNSYWQCSECDQAG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LPAVLYSCGICKKNHDQHLLLLCDTCKLHYHLGCLDPPLTRMPRKTKNSYWQCSECDQAG 730 740 750 760 770 780 790 800 810 820 830 840 pF1KA0 SSDMEADMAMETLPDGTKRSRRQIKEPVKFVPQDVPPEPKKIPIRNTRTRGRKRSFVPEE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 SSDMEADMAMETLPDGTKRSRRQIKEPVKFVPQDVPPEPKKIPIRNTRTRGRKRSFVPEE 790 800 810 820 830 840 850 860 870 880 pF1KA0 EKHEERVPRERRQRQSVLQKKPKAEDLRTECATCKGTGDNENLVRYPS :::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 EKHEERVPRERRQRQSVLQKKPKAEDLRTECATCKGTGDNENLVRYPS 850 860 870 880 >>CCDS7135.1 MLLT10 gene_id:8028|Hs108|chr10 (1027 aa) initn: 463 init1: 219 opt: 522 Z-score: 309.1 bits: 68.6 E(32554): 6.7e-11 Smith-Waterman score: 522; 31.5% identity (57.0% similar) in 321 aa overlap (298-605:3-306) 270 280 290 300 310 320 pF1KA0 CKKKKSKVLSRNSADDEELTNDSLTLSQSKSNEDSLILEKSQNWSSQKMDHILICCVCLG :.. . :: . : ..: : :::: CCDS71 MVSSDRPVSLEDEVSHSMKEM--IGGCCVCSD 10 20 30 330 340 350 360 370 380 pF1KA0 DNSEDADEIIQCDN--CGITVHEGCYGVDGESDSIMSSASENSTEPWFCDACKC---GVS . . . .. ::. :...::..:::. . : :::: :. .. CCDS71 ERGWAENPLVYCDGHGCSVAVHQACYGI-----------VQVPTGPWFCRKCESQERAAR 40 50 60 70 390 400 410 420 430 440 pF1KA0 PSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDIDKLRPVTLTEMNYSKYGAKECS :::::..:: .:.:: : :.:.:::::.: : :.... ..:..: . ...:. : : CCDS71 VRCELCPHKDGALKRTDNGGWAHVVCALYIPEVQFANVSTMEPIVLQSVPHDRYN-KTCY 80 90 100 110 120 130 450 460 470 480 490 pF1KA0 FC-EDPRFAR--TGVCISCDAGMCRAYFHVTCAQKEGLLSEAAAEEDIAD--PFFAYCKQ .: :. : .. ::.:..:. :: ::::::: ::: : : . :: . .::: CCDS71 ICDEQGRESKAATGACMTCNKHGCRQAFHVTCAQFAGLLCEE--EGNGADNVQYCGYCKY 140 150 160 170 180 190 500 510 520 530 540 550 pF1KA0 HADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQA--RINARLQQYRAKAELARSTR : ..: .. . .: :: : . ..:. : . . . . :... . : . . CCDS71 HFSKLKKSKRGSNRSYDQSLSDSSSHSQDKHHEKEKKKYKEKDKHKQKHKKQPEPSPALV 200 210 220 230 240 250 560 570 580 590 600 610 pF1KA0 PQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDI-FPVDNSDTSSSVDGRRKHKQPA :. : :: ::. : .: : . . :. : .. :::. : CCDS71 PSLTVTTEKT-YTSTSNNSISGSLKRLEDTTARFTNANFQEVSAHTSSGKDVSETRGSEG 260 270 280 290 300 310 620 630 640 650 660 670 pF1KA0 LTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNKLCESLEELQNLNGK CCDS71 KGKKSSAHSSGQRGRKPGGGRNPGTTVSAASPFPQGSFSGTPGSVKSSSGSSVQSPQDFL 320 330 340 350 360 370 >>CCDS55708.1 MLLT10 gene_id:8028|Hs108|chr10 (1068 aa) initn: 463 init1: 219 opt: 522 Z-score: 308.9 bits: 68.6 E(32554): 6.9e-11 Smith-Waterman score: 522; 31.5% identity (57.0% similar) in 321 aa overlap (298-605:3-306) 270 280 290 300 310 320 pF1KA0 CKKKKSKVLSRNSADDEELTNDSLTLSQSKSNEDSLILEKSQNWSSQKMDHILICCVCLG :.. . :: . : ..: : :::: CCDS55 MVSSDRPVSLEDEVSHSMKEM--IGGCCVCSD 10 20 30 330 340 350 360 370 380 pF1KA0 DNSEDADEIIQCDN--CGITVHEGCYGVDGESDSIMSSASENSTEPWFCDACKC---GVS . . . .. ::. :...::..:::. . : :::: :. .. CCDS55 ERGWAENPLVYCDGHGCSVAVHQACYGI-----------VQVPTGPWFCRKCESQERAAR 40 50 60 70 390 400 410 420 430 440 pF1KA0 PSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDIDKLRPVTLTEMNYSKYGAKECS :::::..:: .:.:: : :.:.:::::.: : :.... ..:..: . ...:. : : CCDS55 VRCELCPHKDGALKRTDNGGWAHVVCALYIPEVQFANVSTMEPIVLQSVPHDRYN-KTCY 80 90 100 110 120 130 450 460 470 480 490 pF1KA0 FC-EDPRFAR--TGVCISCDAGMCRAYFHVTCAQKEGLLSEAAAEEDIAD--PFFAYCKQ .: :. : .. ::.:..:. :: ::::::: ::: : : . :: . .::: CCDS55 ICDEQGRESKAATGACMTCNKHGCRQAFHVTCAQFAGLLCEE--EGNGADNVQYCGYCKY 140 150 160 170 180 190 500 510 520 530 540 550 pF1KA0 HADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQA--RINARLQQYRAKAELARSTR : ..: .. . .: :: : . ..:. : . . . . :... . : . . CCDS55 HFSKLKKSKRGSNRSYDQSLSDSSSHSQDKHHEKEKKKYKEKDKHKQKHKKQPEPSPALV 200 210 220 230 240 250 560 570 580 590 600 610 pF1KA0 PQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDI-FPVDNSDTSSSVDGRRKHKQPA :. : :: ::. : .: : . . :. : .. :::. : CCDS55 PSLTVTTEKT-YTSTSNNSISGSLKRLEDTTARFTNANFQEVSAHTSSGKDVSETRGSEG 260 270 280 290 300 310 620 630 640 650 660 670 pF1KA0 LTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNKLCESLEELQNLNGK CCDS55 KGKKSSAHSSGQRGRKPGGGRNPGTTVSAASPFPQGSFSGTPGSVKSSSGSSVQSPQDFL 320 330 340 350 360 370 888 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 01:00:33 2016 done: Fri Nov 4 01:00:34 2016 Total Scan time: 4.420 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]