FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0783, 888 aa
1>>>pF1KA0783 888 - 888 aa - 888 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.9460+/-0.00106; mu= 1.5811+/- 0.064
mean_var=319.4070+/-63.799, 0's: 0 Z-trim(113.4): 41 B-trim: 149 in 1/53
Lambda= 0.071763
statistics sampled from 14033 (14063) to 14033 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.74), E-opt: 0.2 (0.432), width: 16
Scan time: 4.420
The best scores are: opt bits E(32554)
CCDS47542.1 PHF14 gene_id:9678|Hs108|chr7 ( 888) 5985 634.1 3.3e-181
CCDS7135.1 MLLT10 gene_id:8028|Hs108|chr10 (1027) 522 68.6 6.7e-11
CCDS55708.1 MLLT10 gene_id:8028|Hs108|chr10 (1068) 522 68.6 6.9e-11
>>CCDS47542.1 PHF14 gene_id:9678|Hs108|chr7 (888 aa)
initn: 5985 init1: 5985 opt: 5985 Z-score: 3366.7 bits: 634.1 E(32554): 3.3e-181
Smith-Waterman score: 5985; 99.9% identity (100.0% similar) in 888 aa overlap (1-888:1-888)
10 20 30 40 50 60
pF1KA0 MDRSSKRRQVKPLAASLLEALDYDSSDDSDFKVGDASDSEGSGNGSEDASKDSGEGSCSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MDRSSKRRQVKPLAASLLEALDYDSSDDSDFKVGDASDSEGSGNGSEDASKDSGEGSCSD
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 SEENILEEELNEDIKVKEEQLKNSAEEEVLSSEKQLIKMEKKEEEENGERPRKKREKEKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::.:::::
CCDS47 SEENILEEELNEDIKVKEEQLKNSAEEEVLSSEKQLIKMEKKEEEENGERPRKKKEKEKE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 KEKEKEKEKEREKEKEKATVSENVAASAAATTPATSPPAVNTSPSVPTTTTATEEQVSEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KEKEKEKEKEREKEKEKATVSENVAASAAATTPATSPPAVNTSPSVPTTTTATEEQVSEP
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 KKWNLRRNRPLLDFVSMEELNDMDDYDSEDDNDWRPTVVKRKGRSASQKEGSDGDNEDDE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KKWNLRRNRPLLDFVSMEELNDMDDYDSEDDNDWRPTVVKRKGRSASQKEGSDGDNEDDE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 DEGSGSDEDENDEGNDEDHSSPASEGGCKKKKSKVLSRNSADDEELTNDSLTLSQSKSNE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 DEGSGSDEDENDEGNDEDHSSPASEGGCKKKKSKVLSRNSADDEELTNDSLTLSQSKSNE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 DSLILEKSQNWSSQKMDHILICCVCLGDNSEDADEIIQCDNCGITVHEGCYGVDGESDSI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 DSLILEKSQNWSSQKMDHILICCVCLGDNSEDADEIIQCDNCGITVHEGCYGVDGESDSI
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 MSSASENSTEPWFCDACKCGVSPSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MSSASENSTEPWFCDACKCGVSPSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDI
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 DKLRPVTLTEMNYSKYGAKECSFCEDPRFARTGVCISCDAGMCRAYFHVTCAQKEGLLSE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 DKLRPVTLTEMNYSKYGAKECSFCEDPRFARTGVCISCDAGMCRAYFHVTCAQKEGLLSE
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 AAAEEDIADPFFAYCKQHADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQARINAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 AAAEEDIADPFFAYCKQHADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQARINAR
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 LQQYRAKAELARSTRPQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDIFPVDNSDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LQQYRAKAELARSTRPQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDIFPVDNSDT
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 SSSVDGRRKHKQPALTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SSSVDGRRKHKQPALTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNK
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 LCESLEELQNLNGKLRSEGQGIWALLGRITGQKLNIPAILRAPKERKPSKKEGGTQKTST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LCESLEELQNLNGKLRSEGQGIWALLGRITGQKLNIPAILRAPKERKPSKKEGGTQKTST
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 LPAVLYSCGICKKNHDQHLLLLCDTCKLHYHLGCLDPPLTRMPRKTKNSYWQCSECDQAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LPAVLYSCGICKKNHDQHLLLLCDTCKLHYHLGCLDPPLTRMPRKTKNSYWQCSECDQAG
730 740 750 760 770 780
790 800 810 820 830 840
pF1KA0 SSDMEADMAMETLPDGTKRSRRQIKEPVKFVPQDVPPEPKKIPIRNTRTRGRKRSFVPEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SSDMEADMAMETLPDGTKRSRRQIKEPVKFVPQDVPPEPKKIPIRNTRTRGRKRSFVPEE
790 800 810 820 830 840
850 860 870 880
pF1KA0 EKHEERVPRERRQRQSVLQKKPKAEDLRTECATCKGTGDNENLVRYPS
::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 EKHEERVPRERRQRQSVLQKKPKAEDLRTECATCKGTGDNENLVRYPS
850 860 870 880
>>CCDS7135.1 MLLT10 gene_id:8028|Hs108|chr10 (1027 aa)
initn: 463 init1: 219 opt: 522 Z-score: 309.1 bits: 68.6 E(32554): 6.7e-11
Smith-Waterman score: 522; 31.5% identity (57.0% similar) in 321 aa overlap (298-605:3-306)
270 280 290 300 310 320
pF1KA0 CKKKKSKVLSRNSADDEELTNDSLTLSQSKSNEDSLILEKSQNWSSQKMDHILICCVCLG
:.. . :: . : ..: : ::::
CCDS71 MVSSDRPVSLEDEVSHSMKEM--IGGCCVCSD
10 20 30
330 340 350 360 370 380
pF1KA0 DNSEDADEIIQCDN--CGITVHEGCYGVDGESDSIMSSASENSTEPWFCDACKC---GVS
. . . .. ::. :...::..:::. . : :::: :. ..
CCDS71 ERGWAENPLVYCDGHGCSVAVHQACYGI-----------VQVPTGPWFCRKCESQERAAR
40 50 60 70
390 400 410 420 430 440
pF1KA0 PSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDIDKLRPVTLTEMNYSKYGAKECS
:::::..:: .:.:: : :.:.:::::.: : :.... ..:..: . ...:. : :
CCDS71 VRCELCPHKDGALKRTDNGGWAHVVCALYIPEVQFANVSTMEPIVLQSVPHDRYN-KTCY
80 90 100 110 120 130
450 460 470 480 490
pF1KA0 FC-EDPRFAR--TGVCISCDAGMCRAYFHVTCAQKEGLLSEAAAEEDIAD--PFFAYCKQ
.: :. : .. ::.:..:. :: ::::::: ::: : : . :: . .:::
CCDS71 ICDEQGRESKAATGACMTCNKHGCRQAFHVTCAQFAGLLCEE--EGNGADNVQYCGYCKY
140 150 160 170 180 190
500 510 520 530 540 550
pF1KA0 HADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQA--RINARLQQYRAKAELARSTR
: ..: .. . .: :: : . ..:. : . . . . :... . : . .
CCDS71 HFSKLKKSKRGSNRSYDQSLSDSSSHSQDKHHEKEKKKYKEKDKHKQKHKKQPEPSPALV
200 210 220 230 240 250
560 570 580 590 600 610
pF1KA0 PQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDI-FPVDNSDTSSSVDGRRKHKQPA
:. : :: ::. : .: : . . :. : .. :::. :
CCDS71 PSLTVTTEKT-YTSTSNNSISGSLKRLEDTTARFTNANFQEVSAHTSSGKDVSETRGSEG
260 270 280 290 300 310
620 630 640 650 660 670
pF1KA0 LTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNKLCESLEELQNLNGK
CCDS71 KGKKSSAHSSGQRGRKPGGGRNPGTTVSAASPFPQGSFSGTPGSVKSSSGSSVQSPQDFL
320 330 340 350 360 370
>>CCDS55708.1 MLLT10 gene_id:8028|Hs108|chr10 (1068 aa)
initn: 463 init1: 219 opt: 522 Z-score: 308.9 bits: 68.6 E(32554): 6.9e-11
Smith-Waterman score: 522; 31.5% identity (57.0% similar) in 321 aa overlap (298-605:3-306)
270 280 290 300 310 320
pF1KA0 CKKKKSKVLSRNSADDEELTNDSLTLSQSKSNEDSLILEKSQNWSSQKMDHILICCVCLG
:.. . :: . : ..: : ::::
CCDS55 MVSSDRPVSLEDEVSHSMKEM--IGGCCVCSD
10 20 30
330 340 350 360 370 380
pF1KA0 DNSEDADEIIQCDN--CGITVHEGCYGVDGESDSIMSSASENSTEPWFCDACKC---GVS
. . . .. ::. :...::..:::. . : :::: :. ..
CCDS55 ERGWAENPLVYCDGHGCSVAVHQACYGI-----------VQVPTGPWFCRKCESQERAAR
40 50 60 70
390 400 410 420 430 440
pF1KA0 PSCELCPNQDGIFKETDAGRWVHIVCALYVPGVAFGDIDKLRPVTLTEMNYSKYGAKECS
:::::..:: .:.:: : :.:.:::::.: : :.... ..:..: . ...:. : :
CCDS55 VRCELCPHKDGALKRTDNGGWAHVVCALYIPEVQFANVSTMEPIVLQSVPHDRYN-KTCY
80 90 100 110 120 130
450 460 470 480 490
pF1KA0 FC-EDPRFAR--TGVCISCDAGMCRAYFHVTCAQKEGLLSEAAAEEDIAD--PFFAYCKQ
.: :. : .. ::.:..:. :: ::::::: ::: : : . :: . .:::
CCDS55 ICDEQGRESKAATGACMTCNKHGCRQAFHVTCAQFAGLLCEE--EGNGADNVQYCGYCKY
140 150 160 170 180 190
500 510 520 530 540 550
pF1KA0 HADRLDRKWKRKNYLALQSYCKMSLQEREKQLSPEAQA--RINARLQQYRAKAELARSTR
: ..: .. . .: :: : . ..:. : . . . . :... . : . .
CCDS55 HFSKLKKSKRGSNRSYDQSLSDSSSHSQDKHHEKEKKKYKEKDKHKQKHKKQPEPSPALV
200 210 220 230 240 250
560 570 580 590 600 610
pF1KA0 PQAWVPREKLPRPLTSSASAIRKLMRKAELMGISTDI-FPVDNSDTSSSVDGRRKHKQPA
:. : :: ::. : .: : . . :. : .. :::. :
CCDS55 PSLTVTTEKT-YTSTSNNSISGSLKRLEDTTARFTNANFQEVSAHTSSGKDVSETRGSEG
260 270 280 290 300 310
620 630 640 650 660 670
pF1KA0 LTADFVNYYFERNMRMIQIQENMAEQKNIKDKLENEQEKLHVEYNKLCESLEELQNLNGK
CCDS55 KGKKSSAHSSGQRGRKPGGGRNPGTTVSAASPFPQGSFSGTPGSVKSSSGSSVQSPQDFL
320 330 340 350 360 370
888 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 01:00:33 2016 done: Fri Nov 4 01:00:34 2016
Total Scan time: 4.420 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]