FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0244, 2039 aa
1>>>pF1KA0244 2039 - 2039 aa - 2039 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.8857+/-0.00139; mu= -3.4977+/- 0.084
mean_var=431.9312+/-86.992, 0's: 0 Z-trim(110.8): 38 B-trim: 0 in 0/53
Lambda= 0.061712
statistics sampled from 11868 (11892) to 11868 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.68), E-opt: 0.2 (0.365), width: 16
Scan time: 5.750
The best scores are: opt bits E(32554)
CCDS4966.1 PHF3 gene_id:23469|Hs108|chr6 (2039) 13591 1226.5 0
CCDS78155.1 PHF3 gene_id:23469|Hs108|chr6 ( 736) 4828 446.0 2.7e-124
CCDS33506.1 DIDO1 gene_id:11083|Hs108|chr20 (2240) 743 82.7 1.9e-14
CCDS13508.2 DIDO1 gene_id:11083|Hs108|chr20 (1189) 644 73.6 5.3e-12
>>CCDS4966.1 PHF3 gene_id:23469|Hs108|chr6 (2039 aa)
initn: 13591 init1: 13591 opt: 13591 Z-score: 6555.4 bits: 1226.5 E(32554): 0
Smith-Waterman score: 13591; 100.0% identity (100.0% similar) in 2039 aa overlap (1-2039:1-2039)
10 20 30 40 50 60
pF1KA0 MDIVDTFNHLIPTEHLDDALFLGSNLENEVCEDFSASQNVLEDSLKNMLSDKDPMLGSAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MDIVDTFNHLIPTEHLDDALFLGSNLENEVCEDFSASQNVLEDSLKNMLSDKDPMLGSAS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 NQFCLPVLDSNDPNFQMPCSTVVGLDDIMDEGVVKESGNDTIDEEELILPNRNLRDKVEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 NQFCLPVLDSNDPNFQMPCSTVVGLDDIMDEGVVKESGNDTIDEEELILPNRNLRDKVEE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 NSVRSPRKSPRLMAQEQVRSLRQSTIAKRSNAAPLSNTKKASGKTVSTAKAGVKQPERSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 NSVRSPRKSPRLMAQEQVRSLRQSTIAKRSNAAPLSNTKKASGKTVSTAKAGVKQPERSQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 VKEEVCMSLKPEYHKENRRCSRNSGQIEVVPEVSVSSSHSSVSSCLEMKDEDGLDSKHKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 VKEEVCMSLKPEYHKENRRCSRNSGQIEVVPEVSVSSSHSSVSSCLEMKDEDGLDSKHKC
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 NNPGEIDVPSHELNCSLLSETCVTIGEKKNEALMECKAKPVGSPLFKFSDKEEHEQNDSI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 NNPGEIDVPSHELNCSLLSETCVTIGEKKNEALMECKAKPVGSPLFKFSDKEEHEQNDSI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 SGKTGETVVEEMIATRKVEQDSKETVKLSHEDDHILEDAGSSDISSDAACTNPNKTENSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SGKTGETVVEEMIATRKVEQDSKETVKLSHEDDHILEDAGSSDISSDAACTNPNKTENSL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 VGLPSCVDEVTECNLELKDTMGIADKTENTLERNKIEPLGYCEDAESNRQLESTEFNKSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 VGLPSCVDEVTECNLELKDTMGIADKTENTLERNKIEPLGYCEDAESNRQLESTEFNKSN
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 LEVVDTSTFGPESNILENAICDVPDQNSKQLNAIESTKIESHETANLQDDRNSQSSSVSY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LEVVDTSTFGPESNILENAICDVPDQNSKQLNAIESTKIESHETANLQDDRNSQSSSVSY
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 LESKSVKSKHTKPVIHSKQNMTTDAPKKIVAAKYEVIHSKTKVNVKSVKRNTDVPESQQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LESKSVKSKHTKPVIHSKQNMTTDAPKKIVAAKYEVIHSKTKVNVKSVKRNTDVPESQQN
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 FHRPVKVRKKQIDKEPKIQSCNSGVKSVKNQAHSVLKKTLQDQTLVQIFKPLTHSLSDKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 FHRPVKVRKKQIDKEPKIQSCNSGVKSVKNQAHSVLKKTLQDQTLVQIFKPLTHSLSDKS
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 HAHPGCLKEPHHPAQTGHVSHSSQKQCHKPQQQAPAMKTNSHVKEELEHPGVEHFKEEDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 HAHPGCLKEPHHPAQTGHVSHSSQKQCHKPQQQAPAMKTNSHVKEELEHPGVEHFKEEDK
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 LKLKKPEKNLQPRQRRSSKSFSLDEPPLFIPDNIATIRREGSDHSSSFESKYMWTPSKQC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LKLKKPEKNLQPRQRRSSKSFSLDEPPLFIPDNIATIRREGSDHSSSFESKYMWTPSKQC
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 GFCKKPHGNRFMVGCGRCDDWFHGDCVGLSLSQAQQMGEEDKEYVCVKCCAEEDKKTEIL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 GFCKKPHGNRFMVGCGRCDDWFHGDCVGLSLSQAQQMGEEDKEYVCVKCCAEEDKKTEIL
730 740 750 760 770 780
790 800 810 820 830 840
pF1KA0 DPDTLENQATVEFHSGDKTMECEKLGLSKHTTNDRTKYIDDTVKHKVKILKRESGEGRNS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 DPDTLENQATVEFHSGDKTMECEKLGLSKHTTNDRTKYIDDTVKHKVKILKRESGEGRNS
790 800 810 820 830 840
850 860 870 880 890 900
pF1KA0 SDCRDNEIKKWQLAPLRKMGQPVLPRRSSEEKSEKIPKESTTVTCTGEKASKPGTHEKQE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 SDCRDNEIKKWQLAPLRKMGQPVLPRRSSEEKSEKIPKESTTVTCTGEKASKPGTHEKQE
850 860 870 880 890 900
910 920 930 940 950 960
pF1KA0 MKKKKVEKGVLNVHPAASASKPSADQIRQSVRHSLKDILMKRLTDSNLKVPEEKAAKVAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 MKKKKVEKGVLNVHPAASASKPSADQIRQSVRHSLKDILMKRLTDSNLKVPEEKAAKVAT
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KA0 KIEKELFSFFRDTDAKYKNKYRSLMFNLKDPKNNILFKKVLKGEVTPDHLIRMSPEELAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KIEKELFSFFRDTDAKYKNKYRSLMFNLKDPKNNILFKKVLKGEVTPDHLIRMSPEELAS
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KA0 KELAAWRRRENRHTIEMIEKEQREVERRPITKITHKGEIEIESDAPMKEQEAAMEIQEPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KELAAWRRRENRHTIEMIEKEQREVERRPITKITHKGEIEIESDAPMKEQEAAMEIQEPA
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KA0 ANKSLEKPEGSEKQKEEVDSMSKDTTSQHRQHLFDLNCKICIGRMAPPVDDLSPKKVKVV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 ANKSLEKPEGSEKQKEEVDSMSKDTTSQHRQHLFDLNCKICIGRMAPPVDDLSPKKVKVV
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KA0 VGVARKHSDNEAESIADALSSTSNILASEFFEEEKQESPKSTFSPAPRPEMPGTVEVEST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 VGVARKHSDNEAESIADALSSTSNILASEFFEEEKQESPKSTFSPAPRPEMPGTVEVEST
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KA0 FLARLNFIWKGFINMPSVAKFVTKAYPVSGSPEYLTEDLPDSIQVGGRISPQTVWDYVEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 FLARLNFIWKGFINMPSVAKFVTKAYPVSGSPEYLTEDLPDSIQVGGRISPQTVWDYVEK
1210 1220 1230 1240 1250 1260
1270 1280 1290 1300 1310 1320
pF1KA0 IKASGTKEICVVRFTPVTEEDQISYTLLFAYFSSRKRYGVAANNMKQVKDMYLIPLGATD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 IKASGTKEICVVRFTPVTEEDQISYTLLFAYFSSRKRYGVAANNMKQVKDMYLIPLGATD
1270 1280 1290 1300 1310 1320
1330 1340 1350 1360 1370 1380
pF1KA0 KIPHPLVPFDGPGLELHRPNLLLGLIIRQKLKRQHSACASTSHIAETPESAPPIALPPDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KIPHPLVPFDGPGLELHRPNLLLGLIIRQKLKRQHSACASTSHIAETPESAPPIALPPDK
1330 1340 1350 1360 1370 1380
1390 1400 1410 1420 1430 1440
pF1KA0 KSKIEVSTEEAPEEENDFFNSFTTVLHKQRNKPQQNLQEDLPTAVEPLMEVTKQEPPKPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 KSKIEVSTEEAPEEENDFFNSFTTVLHKQRNKPQQNLQEDLPTAVEPLMEVTKQEPPKPL
1390 1400 1410 1420 1430 1440
1450 1460 1470 1480 1490 1500
pF1KA0 RFLPGVLIGWENQPTTLELANKPLPVDDILQSLLGTTGQVYDQAQSVMEQNTVKEIPFLN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 RFLPGVLIGWENQPTTLELANKPLPVDDILQSLLGTTGQVYDQAQSVMEQNTVKEIPFLN
1450 1460 1470 1480 1490 1500
1510 1520 1530 1540 1550 1560
pF1KA0 EQTNSKIEKTDNVEVTDGENKEIKVKVDNISESTDKSAEIETSVVGSSSISAGSLTSLSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 EQTNSKIEKTDNVEVTDGENKEIKVKVDNISESTDKSAEIETSVVGSSSISAGSLTSLSL
1510 1520 1530 1540 1550 1560
1570 1580 1590 1600 1610 1620
pF1KA0 RGKPPDVSTEAFLTNLSIQSKQEETVESKEKTLKRQLQEDQENNLQDNQTSNSSPCRSNV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 RGKPPDVSTEAFLTNLSIQSKQEETVESKEKTLKRQLQEDQENNLQDNQTSNSSPCRSNV
1570 1580 1590 1600 1610 1620
1630 1640 1650 1660 1670 1680
pF1KA0 GKGNIDGNVSCSENLVANTARSPQFINLKRDPRQAAGRSQPVTTSESKDGDSCRNGEKHM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 GKGNIDGNVSCSENLVANTARSPQFINLKRDPRQAAGRSQPVTTSESKDGDSCRNGEKHM
1630 1640 1650 1660 1670 1680
1690 1700 1710 1720 1730 1740
pF1KA0 LPGLSHNKEHLTEQINVEEKLCSAEKNSCVQQSDNLKVAQNSPSVENIQTSQAEQAKPLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LPGLSHNKEHLTEQINVEEKLCSAEKNSCVQQSDNLKVAQNSPSVENIQTSQAEQAKPLQ
1690 1700 1710 1720 1730 1740
1750 1760 1770 1780 1790 1800
pF1KA0 EDILMQNIETVHPFRRGSAVATSHFEVGNTCPSEFPSKSITFTSRSTSPRTSTNFSPMRP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 EDILMQNIETVHPFRRGSAVATSHFEVGNTCPSEFPSKSITFTSRSTSPRTSTNFSPMRP
1750 1760 1770 1780 1790 1800
1810 1820 1830 1840 1850 1860
pF1KA0 QQPNLQHLKSSPPGFPFPGPPNFPPQSMFGFPPHLPPPLLPPPGFGFAQNPMVPWPPVVH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 QQPNLQHLKSSPPGFPFPGPPNFPPQSMFGFPPHLPPPLLPPPGFGFAQNPMVPWPPVVH
1810 1820 1830 1840 1850 1860
1870 1880 1890 1900 1910 1920
pF1KA0 LPGQPQRMMGPLSQASRYIGPQNFYQVKDIRRPERRHSDPWGRQDQQQLDRPFNRGKGDR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 LPGQPQRMMGPLSQASRYIGPQNFYQVKDIRRPERRHSDPWGRQDQQQLDRPFNRGKGDR
1870 1880 1890 1900 1910 1920
1930 1940 1950 1960 1970 1980
pF1KA0 QRFYSDSHHLKRERHEKEWEQESERHRRRDRSQDKDRDRKSREEGHKDKERARLSHGDRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 QRFYSDSHHLKRERHEKEWEQESERHRRRDRSQDKDRDRKSREEGHKDKERARLSHGDRG
1930 1940 1950 1960 1970 1980
1990 2000 2010 2020 2030
pF1KA0 TDGKASRDSRNVDKKPDKPKSEDYEKDKEREKSKHREGEKDRDRYHKDRDHTDRTKSKR
:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS49 TDGKASRDSRNVDKKPDKPKSEDYEKDKEREKSKHREGEKDRDRYHKDRDHTDRTKSKR
1990 2000 2010 2020 2030
>>CCDS78155.1 PHF3 gene_id:23469|Hs108|chr6 (736 aa)
initn: 4820 init1: 4820 opt: 4828 Z-score: 2344.8 bits: 446.0 E(32554): 2.7e-124
Smith-Waterman score: 4828; 99.5% identity (99.6% similar) in 735 aa overlap (1-735:1-735)
10 20 30 40 50 60
pF1KA0 MDIVDTFNHLIPTEHLDDALFLGSNLENEVCEDFSASQNVLEDSLKNMLSDKDPMLGSAS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 MDIVDTFNHLIPTEHLDDALFLGSNLENEVCEDFSASQNVLEDSLKNMLSDKDPMLGSAS
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 NQFCLPVLDSNDPNFQMPCSTVVGLDDIMDEGVVKESGNDTIDEEELILPNRNLRDKVEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 NQFCLPVLDSNDPNFQMPCSTVVGLDDIMDEGVVKESGNDTIDEEELILPNRNLRDKVEE
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 NSVRSPRKSPRLMAQEQVRSLRQSTIAKRSNAAPLSNTKKASGKTVSTAKAGVKQPERSQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 NSVRSPRKSPRLMAQEQVRSLRQSTIAKRSNAAPLSNTKKASGKTVSTAKAGVKQPERSQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 VKEEVCMSLKPEYHKENRRCSRNSGQIEVVPEVSVSSSHSSVSSCLEMKDEDGLDSKHKC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 VKEEVCMSLKPEYHKENRRCSRNSGQIEVVPEVSVSSSHSSVSSCLEMKDEDGLDSKHKC
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 NNPGEIDVPSHELNCSLLSETCVTIGEKKNEALMECKAKPVGSPLFKFSDKEEHEQNDSI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 NNPGEIDVPSHELNCSLLSETCVTIGEKKNEALMECKAKPVGSPLFKFSDKEEHEQNDSI
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 SGKTGETVVEEMIATRKVEQDSKETVKLSHEDDHILEDAGSSDISSDAACTNPNKTENSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 SGKTGETVVEEMIATRKVEQDSKETVKLSHEDDHILEDAGSSDISSDAACTNPNKTENSL
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 VGLPSCVDEVTECNLELKDTMGIADKTENTLERNKIEPLGYCEDAESNRQLESTEFNKSN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 VGLPSCVDEVTECNLELKDTMGIADKTENTLERNKIEPLGYCEDAESNRQLESTEFNKSN
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 LEVVDTSTFGPESNILENAICDVPDQNSKQLNAIESTKIESHETANLQDDRNSQSSSVSY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 LEVVDTSTFGPESNILENAICDVPDQNSKQLNAIESTKIESHETANLQDDRNSQSSSVSY
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 LESKSVKSKHTKPVIHSKQNMTTDAPKKIVAAKYEVIHSKTKVNVKSVKRNTDVPESQQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 LESKSVKSKHTKPVIHSKQNMTTDAPKKIVAAKYEVIHSKTKVNVKSVKRNTDVPESQQN
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 FHRPVKVRKKQIDKEPKIQSCNSGVKSVKNQAHSVLKKTLQDQTLVQIFKPLTHSLSDKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 FHRPVKVRKKQIDKEPKIQSCNSGVKSVKNQAHSVLKKTLQDQTLVQIFKPLTHSLSDKS
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 HAHPGCLKEPHHPAQTGHVSHSSQKQCHKPQQQAPAMKTNSHVKEELEHPGVEHFKEEDK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 HAHPGCLKEPHHPAQTGHVSHSSQKQCHKPQQQAPAMKTNSHVKEELEHPGVEHFKEEDK
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 LKLKKPEKNLQPRQRRSSKSFSLDEPPLFIPDNIATIRREGSDHSSSFESKYMWTPSKQC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS78 LKLKKPEKNLQPRQRRSSKSFSLDEPPLFIPDNIATIRREGSDHSSSFESKYMWTPSKQC
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 GFCKKPHGNRFMVGCGRCDDWFHGDCVGLSLSQAQQMGEEDKEYVCVKCCAEEDKKTEIL
:::::::::: . :
CCDS78 GFCKKPHGNRCVCMCV
730
>>CCDS33506.1 DIDO1 gene_id:11083|Hs108|chr20 (2240 aa)
initn: 1653 init1: 728 opt: 743 Z-score: 372.9 bits: 82.7 E(32554): 1.9e-14
Smith-Waterman score: 1276; 29.1% identity (51.5% similar) in 1310 aa overlap (862-1923:595-1870)
840 850 860 870 880 890
pF1KA0 RESGEGRNSSDCRDNEIKKWQLAPLRKMGQPVLPRRSSEEKS-EKIPKESTTVTCTGEKA
: : :. .: . ... . .. :
CCDS33 NLVPKKSSFANVAAATPAIKKPPSGFKGTIPKRPWLSATPSSGASAARQAGPAPAAATAA
570 580 590 600 610 620
900 910 920 930 940
pF1KA0 SK--PGTHEKQEMKKKKVEKGVLNVHPA--------ASASKPSADQIRQSVRHSLKDILM
:: ::. .: : .: . :: :. :.:.. ::::..:.:::.::
CCDS33 SKKFPGSAALVGAVRKPVVPSVPMASPAPGRLGAMSAAPSQPNS-QIRQNIRRSLKEILW
630 640 650 660 670 680
950 960 970 980 990
pF1KA0 KRLTDSN-LKVPEEKAAKVATKIEKELFSFFRDTDAKYKNKYRSLMFNLKDPKNNILFKK
::..::. : . :....:.: .::::.:..:. :: .::.::::.:::::::::. ::..
CCDS33 KRVNDSDDLIMTENEVGKIALHIEKEMFNLFQVTDNRYKSKYRSIMFNLKDPKNQGLFHR
690 700 710 720 730 740
1000 1010 1020 1030 1040 1050
pF1KA0 VLKGEVTPDHLIRMSPEELASKELAAWRRRENRHTIEMIEKEQREVERRPITKITHKGEI
::. :.. .:.:..::::.::::..:..: : ..: : . : .. : ...
CCDS33 VLREEISLAKLVRLKPEELVSKELSTWKERPARSVMESRTKLHNESKK---TAPRQEAIP
750 760 770 780 790 800
1060 1070 1080 1090 1100 1110
pF1KA0 EIESDAPMKEQEAAMEIQEPAANKSLEKPEGSEKQKEEV-DSMSKDTTSQHRQHLFDLNC
..:.. :....: : :: : ... :: : .: .:: :::::::: :::::::
CCDS33 DLEDSPPVSDSE---EQQESA--RAV--PEKSTAPLLDVFSSMLKDTTSQHRAHLFDLNC
810 820 830 840 850
1120 1130 1140 1150
pF1KA0 KICIGRMAPPVDDLSPKKVKVVVGVAR-----KHS--------DNEAESIADAL------
::: :.. :. .::: :. ..: . ::. :. : . .:.
CCDS33 KICTGQVPSAEDEPAPKKQKLSASVKKEDLKSKHDSSAPDPAPDSADEVMPEAVPEVASE
860 870 880 890 900 910
1160
pF1KA0 -----------------------------------------------------------S
:
CCDS33 PGLESASHPNVDRTYFPGPPGDGHPEPSPLEDLSPCPASCGSGVVTTVTVSGRDPRTAPS
920 930 940 950 960 970
1170 1180 1190
pF1KA0 STSNILASE-------FFEEEKQESPKSTF---------------SPAPR----PEMPGT
:. . .:: . : .:. :: .. :: :: : :.
CCDS33 SSCTAVASAASRPDSTHMVEARQDVPKPVLTSVMVPKSILAKPSSSPDPRYLSVPPSPNI
980 990 1000 1010 1020 1030
1200 1210 1220 1230 1240
pF1KA0 VEVEST---------FLARLNFIWKGFINMPSVAKFVTKAYPVSGSPEYLTEDLPDSIQV
:: ::.::. :::::::: :::::::::::::: .::.:::::.:..
CCDS33 STSESRSPPEGDTTLFLSRLSTIWKGFINMQSVAKFVTKAYPVSGCFDYLSEDLPDTIHI
1040 1050 1060 1070 1080 1090
1250 1260 1270 1280 1290 1300
pF1KA0 GGRISPQTVWDYVEKIKASGTKEICVVRFTPVTEEDQISYTLLFAYFSSRKRYGVAANNM
::::.:.:::::: :.:.: .::.:..:: :.:::....: :..::::: :.::.:::
CCDS33 GGRIAPKTVWDYVGKLKSSVSKELCLIRFHPATEEEEVAYISLYSYFSSRGRFGVVANNN
1100 1110 1120 1130 1140 1150
1310 1320 1330 1340 1350
pF1KA0 KQVKDMYLIPLGATDKIPHPLVPFDGPGLELHRPNLLLGLIIRQKLKR------------
..:::.:::::.: : .: :.::.::::: :::..:::.: ::.::
CCDS33 RHVKDLYLIPLSAQDPVPSKLLPFEGPGLESPRPNIILGLVICQKIKRPANSGELDKMDE
1160 1170 1180 1190 1200 1210
1360 1370 1380
pF1KA0 -----------------------------QHSACASTSHIAETPESAPPIALPPDKKSKI
.. :.. . .. :: ..:: : . .
CCDS33 KRTRLQPEEADVPAYPKVATVPQSEKKPSKYPLCSADAAVSTTPPGSPPPPPPLPEPPVL
1220 1230 1240 1250 1260 1270
1390 1400 1410 1420 1430 1440
pF1KA0 EVSTEEAPEEENDFFNSFTTVLHKQRNKPQQNLQEDLPTAVEPLMEVT--KQEPPKP-LR
.: . : . . :.. . . . . .: .... :.. : :
CCDS33 KVLSSLKPAAPSPATAATTAAAASTAASSTASSASKTASPLEHILQTLFGKKKSFDPSAR
1280 1290 1300 1310 1320 1330
1450 1460 1470 1480 1490 1500
pF1KA0 FLPGVLIGWENQPTTLELANKPLPVDDILQSLLGTTGQVYDQAQSVMEQNTVKEIPFLNE
:: : ..: : :. .:. .:. .. :: ... ... :.. . :. :
CCDS33 EPPGSTAGLPQEPKT--TAEDGVPAPPLLDPIVQQFGQ-FSKDKALEEEEDDR--PYDPE
1340 1350 1360 1370 1380
1510 1520 1530 1540 1550
pF1KA0 QTNSKIEKTDNVEVTDGENKEIKVKVDNISESTDKSA---EIETSVVGSSSISAGSLTS-
. . . :. : :. .:.. . . .. : : :: .. ..... .: .
CCDS33 EEYDPERAFDTQLVERGRRHEVERAPEAAAAEREEVAYDPEDET-ILEEAKVTVDDLPNR
1390 1400 1410 1420 1430 1440
1560 1570 1580 1590 1600 1610
pF1KA0 --LSLRG----KPPDVSTEAFLTNLSIQSKQEETVESKEKTLKRQLQEDQENNLQDNQTS
..: .: . . : .: :.:. : .... . :::: :.::. :......
CCDS33 MCADVRRNSVERPAEPVAGAATPSLVEQQKMLEELNKQIEEQKRQL-EEQEEALRQQRAA
1450 1460 1470 1480 1490 1500
1620 1630 1640 1650
pF1KA0 --------------NSSPCRSNVGKGNIDGNVSCSENLVANTARSPQFINLKRDPRQAAG
: : .:.. :... . . : . :. . : : .::::::
CCDS33 VGVSMAHFSVSDALMSPPPKSSLPKAELFQQEQQSADKPASLPPASQASN-HRDPRQA--
1510 1520 1530 1540 1550 1560
1660 1670 1680 1690 1700
pF1KA0 RSQPVTTSESKDGDSCR---NGEKHMLPGLSHNKEHLTEQINV---EEK------LCSAE
: . :.:.. : : . :: . .. :. : . ::: :.:
CCDS33 RRLATETGEGEGEPLSRLSARGAQGALPERDASRGGLVGQAPMPVPEEKEPASSPWASGE
1570 1580 1590 1600 1610 1620
1710 1720 1730 1740 1750
pF1KA0 KNSCVQQSDNLKVAQNSPSVE-NIQTSQAEQAK----P------LQEDILMQNIETVHPF
: ...:. :. . . .. :.:. :. : :: . .:. ::
CCDS33 KPPAGSEQDGWKAEPGEGTRPATVGDSSARPARRVLLPTPPCGALQPGFPLQHDGERDPF
1630 1640 1650 1660 1670 1680
1760 1770 1780 1790 1800
pF1KA0 R------RGSAVATSHFEVGNTCPSEFPSKSI---TFTSRSTSPRTSTNFSPMRPQQPNL
. .:......: . : :.: : .: . : . . .:. : :.
CCDS33 TCPGFASQDKALGSAQYEDPRNLHSAGRSSSPAGETEGDREPQARPGEGTAPLPP--PGQ
1690 1700 1710 1720 1730 1740
1810 1820 1830 1840
pF1KA0 QHLKSSPP--GFPFPGP----------PNFP-PQSMFGFPPHLPP--------PLLPPPG
. :.:: : ::: :::: :. : : .: : :::.
CCDS33 KVGGSQPPFQGQREPGPHALGMSGLHGPNFPGPR---GPAPPFPEENIASNDGPRGPPPA
1750 1760 1770 1780 1790
1850 1860 1870 1880 1890 1900
pF1KA0 -FGFAQNPMVPWPPVVHLPGQPQRMMGPLSQASRYIGPQNFYQVKDIRRPERRHSDPWGR
:: ..:. : : :: . : ::.. : . ::. :: :.
CCDS33 RFGAQKGPIPSLFSGQHGPPPYGDSRGP--SPSYLGGPRG---VAPSQFEERK--DPHGE
1800 1810 1820 1830 1840 1850
1910 1920 1930 1940 1950 1960
pF1KA0 QDQQQLDRPFNRGKGDRQRFYSDSHHLKRERHEKEWEQESERHRRRDRSQDKDRDRKSRE
. . : : :.:. : .:
CCDS33 KREFQ-DAPYNEVTGAPAQFEGTEQAPFLGSRGGAPFQFGGQRRPLLSQLKGPRGGPPPS
1860 1870 1880 1890 1900 1910
>>CCDS13508.2 DIDO1 gene_id:11083|Hs108|chr20 (1189 aa)
initn: 1390 init1: 636 opt: 644 Z-score: 328.9 bits: 73.6 E(32554): 5.3e-12
Smith-Waterman score: 1068; 37.8% identity (59.0% similar) in 598 aa overlap (862-1333:595-1181)
840 850 860 870 880 890
pF1KA0 RESGEGRNSSDCRDNEIKKWQLAPLRKMGQPVLPRRSSEEKS-EKIPKESTTVTCTGEKA
: : :. .: . ... . .. :
CCDS13 NLVPKKSSFANVAAATPAIKKPPSGFKGTIPKRPWLSATPSSGASAARQAGPAPAAATAA
570 580 590 600 610 620
900 910 920 930 940
pF1KA0 SK--PGTHEKQEMKKKKVEKGVLNVHPA--------ASASKPSADQIRQSVRHSLKDILM
:: ::. .: : .: . :: :. :.:.. ::::..:.:::.::
CCDS13 SKKFPGSAALVGAVRKPVVPSVPMASPAPGRLGAMSAAPSQPNS-QIRQNIRRSLKEILW
630 640 650 660 670 680
950 960 970 980 990
pF1KA0 KRLTDSN-LKVPEEKAAKVATKIEKELFSFFRDTDAKYKNKYRSLMFNLKDPKNNILFKK
::..::. : . :....:.: .::::.:..:. :: .::.::::.:::::::::. ::..
CCDS13 KRVNDSDDLIMTENEVGKIALHIEKEMFNLFQVTDNRYKSKYRSIMFNLKDPKNQGLFHR
690 700 710 720 730 740
1000 1010 1020 1030 1040 1050
pF1KA0 VLKGEVTPDHLIRMSPEELASKELAAWRRRENRHTIEMIEKEQREVERRPITKITHKGEI
::. :.. .:.:..::::.::::..:..: : ..: : . : .. : ...
CCDS13 VLREEISLAKLVRLKPEELVSKELSTWKERPARSVMESRTKLHNESKK---TAPRQEAIP
750 760 770 780 790 800
1060 1070 1080 1090 1100 1110
pF1KA0 EIESDAPMKEQEAAMEIQEPAANKSLEKPEGSEKQKEEV-DSMSKDTTSQHRQHLFDLNC
..:.. :....: : :: : ... :: : .: .:: :::::::: :::::::
CCDS13 DLEDSPPVSDSE---EQQESA--RAV--PEKSTAPLLDVFSSMLKDTTSQHRAHLFDLNC
810 820 830 840 850
1120 1130 1140 1150
pF1KA0 KICIGRMAPPVDDLSPKKVKVVVGVAR-----KHS--------DNEAESIADAL------
::: :.. :. .::: :. ..: . ::. :. : . .:.
CCDS13 KICTGQVPSAEDEPAPKKQKLSASVKKEDLKSKHDSSAPDPAPDSADEVMPEAVPEVASE
860 870 880 890 900 910
1160
pF1KA0 -----------------------------------------------------------S
:
CCDS13 PGLESASHPNVDRTYFPGPPGDGHPEPSPLEDLSPCPASCGSGVVTTVTVSGRDPRTAPS
920 930 940 950 960 970
1170 1180 1190
pF1KA0 STSNILASE-------FFEEEKQESPKSTF---------------SPAPR----PEMPGT
:. . .:: . : .:. :: .. :: :: : :.
CCDS13 SSCTAVASAASRPDSTHMVEARQDVPKPVLTSVMVPKSILAKPSSSPDPRYLSVPPSPNI
980 990 1000 1010 1020 1030
1200 1210 1220 1230 1240
pF1KA0 VEVEST---------FLARLNFIWKGFINMPSVAKFVTKAYPVSGSPEYLTEDLPDSIQV
:: ::.::. :::::::: :::::::::::::: .::.:::::.:..
CCDS13 STSESRSPPEGDTTLFLSRLSTIWKGFINMQSVAKFVTKAYPVSGCFDYLSEDLPDTIHI
1040 1050 1060 1070 1080 1090
1250 1260 1270 1280 1290 1300
pF1KA0 GGRISPQTVWDYVEKIKASGTKEICVVRFTPVTEEDQISYTLLFAYFSSRKRYGVAANNM
::::.:.:::::: :.:.: .::.:..:: :.:::....: :..::::: :.::.:::
CCDS13 GGRIAPKTVWDYVGKLKSSVSKELCLIRFHPATEEEEVAYISLYSYFSSRGRFGVVANNN
1100 1110 1120 1130 1140 1150
1310 1320 1330 1340 1350 1360
pF1KA0 KQVKDMYLIPLGATDKIPHPLVPFDGPGLELHRPNLLLGLIIRQKLKRQHSACASTSHIA
..:::.:::::.: : .: :.::.:::
CCDS13 RHVKDLYLIPLSAQDPVPSKLLPFEGPGKRRLSGWR
1160 1170 1180
2039 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 18:25:58 2016 done: Wed Nov 2 18:25:59 2016
Total Scan time: 5.750 Total Display time: 0.320
Function used was FASTA [36.3.4 Apr, 2011]