FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2399, 2285 aa
1>>>pF1KE2399 2285 - 2285 aa - 2285 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 16.4492+/-0.00138; mu= -23.4850+/- 0.083
mean_var=736.8565+/-152.284, 0's: 0 Z-trim(116.3): 147 B-trim: 0 in 0/54
Lambda= 0.047248
statistics sampled from 16789 (16927) to 16789 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.767), E-opt: 0.2 (0.52), width: 16
Scan time: 9.190
The best scores are: opt bits E(32554)
CCDS285.1 ARID1A gene_id:8289|Hs108|chr1 (2285) 15878 1099.3 0
CCDS44091.1 ARID1A gene_id:8289|Hs108|chr1 (2068) 9644 674.4 1.5e-192
CCDS5251.2 ARID1B gene_id:57492|Hs108|chr6 (2236) 3795 275.7 1.7e-72
CCDS55072.1 ARID1B gene_id:57492|Hs108|chr6 (2249) 3736 271.7 2.8e-71
>>CCDS285.1 ARID1A gene_id:8289|Hs108|chr1 (2285 aa)
initn: 15878 init1: 15878 opt: 15878 Z-score: 5866.2 bits: 1099.3 E(32554): 0
Smith-Waterman score: 15878; 100.0% identity (100.0% similar) in 2285 aa overlap (1-2285:1-2285)
10 20 30 40 50 60
pF1KE2 MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 PAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 PGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 LAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 LAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 SGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLT
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 SPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 PMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 PYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQ
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE2 QQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 QQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQST
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE2 TQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 TQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPP
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE2 QELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 QELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPG
610 620 630 640 650 660
670 680 690 700 710 720
pF1KE2 VSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 VSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVP
670 680 690 700 710 720
730 740 750 760 770 780
pF1KE2 GNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 GNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQ
730 740 750 760 770 780
790 800 810 820 830 840
pF1KE2 IHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 IHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQ
790 800 810 820 830 840
850 860 870 880 890 900
pF1KE2 MHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVA
850 860 870 880 890 900
910 920 930 940 950 960
pF1KE2 MHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAG
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KE2 MAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPER
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KE2 KMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 KMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNL
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KE2 NVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 NVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGS
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KE2 MQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 MQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQ
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KE2 KRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 KRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPY
1210 1220 1230 1240 1250 1260
1270 1280 1290 1300 1310 1320
pF1KE2 SRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYS
1270 1280 1290 1300 1310 1320
1330 1340 1350 1360 1370 1380
pF1KE2 PSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQNYKRPMDGTYGPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQNYKRPMDGTYGPP
1330 1340 1350 1360 1370 1380
1390 1400 1410 1420 1430 1440
pF1KE2 AKRHEGEMYSVPYSTGQGQPQQQQLPPAQPQPASQQQAAQPSPQQDVYNQYGNAYPATAT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 AKRHEGEMYSVPYSTGQGQPQQQQLPPAQPQPASQQQAAQPSPQQDVYNQYGNAYPATAT
1390 1400 1410 1420 1430 1440
1450 1460 1470 1480 1490 1500
pF1KE2 AATERRPAGGPQNQFPFQFGRDRVSAPPGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 AATERRPAGGPQNQFPFQFGRDRVSAPPGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQG
1450 1460 1470 1480 1490 1500
1510 1520 1530 1540 1550 1560
pF1KE2 RNDMTYNYANRQSTGSAPQGPAYHGVNRTDEMLHTDQRANHEGSWPSHGTRQPPYGPSAP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 RNDMTYNYANRQSTGSAPQGPAYHGVNRTDEMLHTDQRANHEGSWPSHGTRQPPYGPSAP
1510 1520 1530 1540 1550 1560
1570 1580 1590 1600 1610 1620
pF1KE2 VPPMTRPPPSNYQPPPSMQNHIPQVSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 VPPMTRPPPSNYQPPPSMQNHIPQVSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPV
1570 1580 1590 1600 1610 1620
1630 1640 1650 1660 1670 1680
pF1KE2 PASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGL
1630 1640 1650 1660 1670 1680
1690 1700 1710 1720 1730 1740
pF1KE2 LAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 LAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPG
1690 1700 1710 1720 1730 1740
1750 1760 1770 1780 1790 1800
pF1KE2 QRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 QRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASEN
1750 1760 1770 1780 1790 1800
1810 1820 1830 1840 1850 1860
pF1KE2 SEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFE
1810 1820 1830 1840 1850 1860
1870 1880 1890 1900 1910 1920
pF1KE2 SKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 SKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSS
1870 1880 1890 1900 1910 1920
1930 1940 1950 1960 1970 1980
pF1KE2 TLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 TLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKR
1930 1940 1950 1960 1970 1980
1990 2000 2010 2020 2030 2040
pF1KE2 CVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 CVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQG
1990 2000 2010 2020 2030 2040
2050 2060 2070 2080 2090 2100
pF1KE2 VSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 VSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQ
2050 2060 2070 2080 2090 2100
2110 2120 2130 2140 2150 2160
pF1KE2 DPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 DPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKN
2110 2120 2130 2140 2150 2160
2170 2180 2190 2200 2210 2220
pF1KE2 PVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQN
2170 2180 2190 2200 2210 2220
2230 2240 2250 2260 2270 2280
pF1KE2 PPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS28 PPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLF
2230 2240 2250 2260 2270 2280
pF1KE2 LIGQS
:::::
CCDS28 LIGQS
>>CCDS44091.1 ARID1A gene_id:8289|Hs108|chr1 (2068 aa)
initn: 9618 init1: 9618 opt: 9644 Z-score: 3570.2 bits: 674.4 E(32554): 1.5e-192
Smith-Waterman score: 13854; 90.5% identity (90.5% similar) in 2285 aa overlap (1-2285:1-2068)
10 20 30 40 50 60
pF1KE2 MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAAGQESEG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 PAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PAVGPPQPLGKELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPALNNNLTEP
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 PGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PGGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVFHQQHGGQQSPG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 LAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LAALQSGGGGGLEPYAGPQQNSHDHGFPNHQYNSYYPNRSAYPPPAPAYALSSPRGGTPG
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 SGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SGAAAAAGSKPPPSSSASASSSSSSFAQQRFGAMGGGGPSAAGGGTPQPTATPTLNQLLT
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 SPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SPSSARGYQGYPGGDYSGGPQDGGAGKGPADMASQCWGAAAAAAAAAAASGGAQQRSHHA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 PMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYGGTNPYSQQQGPPSGPQQGHGYPGQ
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 PYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQGQTPYYNQQSPHPQQ
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE2 QQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQST
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 QQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPPHQQSPAPYPSQQST
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE2 TQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQSQQTAYSQQRFPPP
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE2 QELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 QELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSIDDLPMGTEGALSPG
610 620 630 640 650 660
670 680 690 700 710 720
pF1KE2 VSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRGPSPSPVGSPASVAQSRSGPLSPAAVP
670 680 690 700 710 720
730 740 750 760 770 780
pF1KE2 GNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 GNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYMQRNPQMPQYSSPQPGSALSPRQPSGGQ
730 740 750 760 770 780
790 800 810 820 830 840
pF1KE2 IHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 IHTGMGSYQQNSMGSYGPQGGQYGPQGGYPRQPNYNALPNANYPSAGMAGGINPMGAGGQ
790 800 810 820 830 840
850 860 870 880 890 900
pF1KE2 MHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPPQVGSGMCPPPGGMNRKTQETAVA
850 860 870 880 890 900
910 920 930 940 950 960
pF1KE2 MHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQGINSMAGMINPQGPPYSMGGTMANNSAG
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KE2 MAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MAASPEMMGLGDVKLTPATKMNNKADGTPKTESKSKKSSSSTTTNEKITKLYELGGEPER
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KE2 KMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 KMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNL
1030 1040 1050 1060 1070 1080
1090 1100 1110 1120 1130 1140
pF1KE2 NVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 NVGTSSSAASSLKKQYIQCLYAFECKIERGEDPPPDIFAAADSKKSQPKIQPPSPAGSGS
1090 1100 1110 1120 1130 1140
1150 1160 1170 1180 1190 1200
pF1KE2 MQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 MQGPQTPQSTSSSMAEGGDLKPPTPASTPHSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQ
1150 1160 1170 1180 1190 1200
1210 1220 1230 1240 1250 1260
pF1KE2 KRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 KRNSMTPNPGYQPSMNTSDMMGRMSYEPNKDPYGSMRKAPGSDPFMSSGQGPNGGMGDPY
1210 1220 1230 1240 1250 1260
1270 1280 1290 1300 1310 1320
pF1KE2 SRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SRAAGPGLGNVAMGPRQHYPYGGPYDRVRTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYS
1270 1280 1290 1300 1310 1320
1330 1340 1350 1360 1370 1380
pF1KE2 PSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQNYKRPMDGTYGPP
:::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PSRYPPQQQQQQQQRHDSYGNQFSTQGTPSGSPFPSQQTTMYQQQQQ-------------
1330 1340 1350 1360
1390 1400 1410 1420 1430 1440
pF1KE2 AKRHEGEMYSVPYSTGQGQPQQQQLPPAQPQPASQQQAAQPSPQQDVYNQYGNAYPATAT
CCDS44 ------------------------------------------------------------
1450 1460 1470 1480 1490 1500
pF1KE2 AATERRPAGGPQNQFPFQFGRDRVSAPPGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQG
CCDS44 ------------------------------------------------------------
1510 1520 1530 1540 1550 1560
pF1KE2 RNDMTYNYANRQSTGSAPQGPAYHGVNRTDEMLHTDQRANHEGSWPSHGTRQPPYGPSAP
CCDS44 ------------------------------------------------------------
1570 1580 1590 1600 1610 1620
pF1KE2 VPPMTRPPPSNYQPPPSMQNHIPQVSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPV
::::::::::::::::::::::::::::::::::::
CCDS44 ------------------------VSSPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPV
1370 1380 1390 1400
1630 1640 1650 1660 1670 1680
pF1KE2 PASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PASHIAPAPVQPPMIRRDITFPPGSVEATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGL
1410 1420 1430 1440 1450 1460
1690 1700 1710 1720 1730 1740
pF1KE2 LAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 LAESTWALDTINILLYDDNSIMTFNLSQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPG
1470 1480 1490 1500 1510 1520
1750 1760 1770 1780 1790 1800
pF1KE2 QRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 QRTLLDPGRFSKVSSPAPMEGGEEEEELLGPKLEEEEEEEVVENDEEIAFSGKDKPASEN
1530 1540 1550 1560 1570 1580
1810 1820 1830 1840 1850 1860
pF1KE2 SEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SEEKLISKFDKLPVKIVQKNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFE
1590 1600 1610 1620 1630 1640
1870 1880 1890 1900 1910 1920
pF1KE2 SKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 SKTELLPSRPHAPCPPAPRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSS
1650 1660 1670 1680 1690 1700
1930 1940 1950 1960 1970 1980
pF1KE2 TLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 TLTEDGAKSSEAIKESSKFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKR
1710 1720 1730 1740 1750 1760
1990 2000 2010 2020 2030 2040
pF1KE2 CVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 CVCVSNTIRSLSFVPGNDFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQG
1770 1780 1790 1800 1810 1820
2050 2060 2070 2080 2090 2100
pF1KE2 VSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 VSCNKVEWWWDCLEMLRENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQ
1830 1840 1850 1860 1870 1880
2110 2120 2130 2140 2150 2160
pF1KE2 DPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 DPFSTLGPNAVLSPQRLVLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKN
1890 1900 1910 1920 1930 1940
2170 2180 2190 2200 2210 2220
pF1KE2 PVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PVCREMAVVLLANLAQGDSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQN
1950 1960 1970 1980 1990 2000
2230 2240 2250 2260 2270 2280
pF1KE2 PPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS44 PPFEPTSVDMMRRAARALLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLF
2010 2020 2030 2040 2050 2060
pF1KE2 LIGQS
:::::
CCDS44 LIGQS
>>CCDS5251.2 ARID1B gene_id:57492|Hs108|chr6 (2236 aa)
initn: 5030 init1: 1919 opt: 3795 Z-score: 1415.1 bits: 275.7 E(32554): 1.7e-72
Smith-Waterman score: 7243; 52.3% identity (70.1% similar) in 2357 aa overlap (25-2284:116-2235)
10 20 30 40 50
pF1KE2 MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAA
.. .:::... . . . . :..
CCDS52 HHHHAHHHHHHAHHLHHHHALQQQLNQFQQQQQQQQQQQQQQQQQQHPISNNNSLGGAGG
90 100 110 120 130 140
60 70 80 90 100 110
pF1KE2 GQESEGPAVGPPQPLG-KELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPAL
: . :: . :: : :. :.... : . : : : . . :. .:.:
CCDS52 GAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAGGRYEHPGL
150 160 170 180 190 200
120 130 140 150 160
pF1KE2 NN-NLTEPP---GGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVF
. . .:: ::::: . ..: . . :: : : : :: :. :
CCDS52 GALGTQQPPVAVPGGGGGPA---AVPEFNNYYGSAAPASG-G-PGGR--------AGPCF
210 220 230 240 250
170 180 190 200 210 220
pF1KE2 HQQHGGQQSPGLAALQSGGGG--GLEPYAGPQQNSHDHGFPNHQYNSYYPNRS---AYPP
.:::::::::.. ..:.... : : ::::. :.:: : : .::. : :
CCDS52 -DQHGGQQSPGMGMMHSASAAAAGAPGSMDPLQNSHE-GYPNSQCN-HYPGYSRPGAGGG
260 270 280 290 300
230 240 250 260
pF1KE2 PAPAYALSSPRGGTPGSGAAAAAGSKPPPSSSASASSSS--------------------S
. . . .. :: :.:.:.:.:. ..:.:.... :
CCDS52 GGGGGGGGGGSGGGGGGGGAGAGGAGAGAVAAAAAAAAAAAGGGGGGGYGGSSAGYGVLS
310 320 330 340 350 360
270 280 290 300
pF1KE2 SFAQQRFGAM---GGGGP---------SAAGG-----GTPQ-PT-ATPTLNQLLTSPSSA
: :: : : :::: ::::: : : :. :::::::::::::
CCDS52 SPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLTSPSPM
370 380 390 400 410 420
310 320 330 340
pF1KE2 -RGYQG-YPGGDYSGG---------PQDGGAGKGPADMASQCWGA-------AAAAAAAA
:.: : :: .::. ::. .:. : : ..: .. .: :::.
CCDS52 MRSYGGSYP--EYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQYAAAS
430 440 450 460 470 480
350 360 370 380 390 400
pF1KE2 AASGGAQQRSHHAPMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYG-GTNPYSQQQG
: ..:::::: : ::::. : : : .:::: : ::: :: :.::.::
CCDS52 PAWAAAQQRSHPA-MSPGTPG----PTMGRSQ-GSPMDPMVMKRPQLYGMGSNPHSQ---
490 500 510 520 530
410 420 430 440 450 460
pF1KE2 PPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRAQSAMGGLSYTQQ-IPP-YGQQGPSGYGQ
:::. ::: :: :::::. .:::. .::.:..: :: .:: ::::: ::: :
CCDS52 ----PQQSSPYPGGSYGPPGPQRYPIGIQGRTPGAMAGMQYPQQQMPPQYGQQGVSGYCQ
540 550 560 570 580 590
470 480 490 500 510 520
pF1KE2 QGQTPYYNQQSPHPQQQQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQ
::: ::: :::: :::.: :: .:
CCDS52 QGQQPYY--------------SQQP-----------------QPPHL----PPQAQY---
600 610
530 540 550 560 570 580
pF1KE2 PPHQQSPAPYPSQQSTTQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQS
:::
CCDS52 ---------LPSQ-----------------------------------------------
620
590 600 610 620 630 640
pF1KE2 QQSQQTAYSQQRFPPPQELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSG
::::. : :..::...:.. : : .. .: ..::.:: : ::::::::::
CCDS52 --------SQQRYQPQQDMSQEGYGTR--SQPPLAPGKPNHEDLNLIQQERPSSLPDLSG
630 640 650 660 670
650 660 670 680 690 700
pF1KE2 SIDDLPMGTEGALSPGVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRG-PSPSPVGSP
:::::: :::..:: .::.:: .::::.::::::::::::.:::: .: : :::::::::
CCDS52 SIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSP
680 690 700 710 720 730
710 720 730 740 750 760
pF1KE2 ASVAQSRSGPLSPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYM---QRNPQMP
.. ::::::.:::..::.::::.::..::.: ::...:: . :.::.: ::::::
CCDS52 VGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMAGTQRNPQMA
740 750 760 770 780 790
770 780 790 800 810
pF1KE2 QYSSPQPGSALSPRQPSGGQIHTGMGSYQQ-NSMGSYGPQGGQYGPQGGYPRQPNYNALP
::. : : ..::. :::.:.:..:.:: :: :.:::: .::::::.: : : :...:
CCDS52 QYGPQQTGPSMSPHPSPGGQMHAGISSFQQSNSSGTYGPQMSQYGPQGNYSRPPAYSGVP
800 810 820 830 840 850
820 830 840 850 860 870
pF1KE2 NANYPSAGMAGGINPMGAGGQMHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPP--
.:.: . : . ::. :..::::: : :..: ::: :.: :::. ::..: :
CCDS52 SASYSGPGPGMGIS---ANNQMHGQGPSQPCGAVPLGRMPSAGMQNRPFPGNMSSMTPSS
860 870 880 890 900
880 890 900 910 920 930
pF1KE2 -----QVGSGMCPPPGGMNRKTQETAVA-MHVAANSIQNRPPGYPNMNQGGMMGTGPPYG
: : :: :: .:::.::.:.: :..:::: :.: ..:.:::.:.:... ::.
CCDS52 PGMSQQGGPGMGPPMPTVNRKAQEAAAAVMQAAANSAQSRQGSFPGMNQSGLMASSSPYS
910 920 930 940 950 960
940 950 960 970 980 990
pF1KE2 QGINSMAGMINPQGPPYSMGGTMANNSAGMAASPEMMGLGDVKLTPATKMNNKADGTPKT
: .:. ....: :.:::::. .:.:.::. .. .::. :. :: : ..: .:::.
CCDS52 QPMNNSSSLMNTQAPPYSMAPAMVNSSAASVGLADMMSPGESKLPLPLKADGKEEGTPQP
970 980 990 1000 1010 1020
1000 1010 1020 1030 1040 1050
pF1KE2 ESKSKKSSSSTTTNEKITKLYELGGEPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDL
:::::::::::::.:::::.::::.:::::.::::::.: ::.. ...:::::.:::::
CCDS52 ESKSKKSSSSTTTGEKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDL
1030 1040 1050 1060 1070 1080
1060 1070 1080 1090 1100 1110
pF1KE2 YRLYVSVKEIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGE
.:::: :::::::.::::::::::::::::::::::::::::::::: :.::::::::::
CCDS52 FRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFAFECKIERGE
1090 1100 1110 1120 1130 1140
1120 1130 1140 1150 1160
pF1KE2 DPPPDIFAAADSKKSQPKIQPPSPAGSGSMQGPQTPQST-SSSMAE-GGDLKPPTPASTP
.:::..:...:.:: :::.::::::.:::.::::::::: :.:::: ::::::::::::
CCDS52 EPPPEVFSTGDTKK-QPKLQPPSPANSGSLQGPQTPQSTGSNSMAEVPGDLKPPTPASTP
1150 1160 1170 1180 1190 1200
1170 1180 1190 1200 1210 1220
pF1KE2 HSQIPPLPGMSRSNSVGIQDAFNDGSDSTFQKRNSMTPNPGYQPSMNTSDMMGRMSYEPN
:.:. :. : .::......: :.: :::.: :::::::: :: .:. :.:::: ::::
CCDS52 HGQMTPMQG-GRSSTISVHDPFSDVSDSSFPKRNSMTPNAPYQQGMSMPDVMGRMPYEPN
1210 1220 1230 1240 1250 1260
1230 1240 1250 1260 1270 1280
pF1KE2 KDPYGSMRKAPGS-DPFMSSGQGPNGGMGDPYSRAAGPGLGNVAMGPRQHYPYGGPYDRV
:::.:.:::.::: .:::..:: ::..: : :... . ...:..:: ::..:::. :::
CCDS52 KDPFGGMRKVPGSSEPFMTQGQMPNSSMQDMYNQSPSGAMSNLGMGQRQQFPYGASYDR-
1270 1280 1290 1300 1310 1320
1290 1300 1310 1320 1330 1340
pF1KE2 RTEPGIGPEGNMSTGAPQPNLMPSNPDSGMYSPSRYPPQQQQQQQQRHDSYGNQFSTQGT
::. ::.:. ::
CCDS52 ----------------------------------------------RHEPYGQQYPGQGP
1330
1350 1360 1370 1380 1390 1400
pF1KE2 PSGSP-FPSQQTTMYQQQQQNYKRPMDGTYGPPAKRHEGEMYSVPYSTGQGQPQQQQLPP
:::.: . ..: .: :: :::: ::: ::::::::::.::.. ::
CCDS52 PSGQPPYGGHQPGLYPQQP-NYKRHMDGMYGPPAKRHEGDMYNMQYS-------------
1340 1350 1360 1370 1380
1410 1420 1430 1440 1450 1460
pF1KE2 AQPQPASQQQAAQPSPQQDVYNQYGNAYPATATAATERRPAGGPQNQFPFQFGRDRVSAP
: ::..:::::..: .. .::: :.:.:. ..:.:...:
CCDS52 --------------SQQQEMYNQYGGSY-----SGPDRRPI---QGQYPYPYSRERMQGP
1390 1400 1410 1420
1470 1480 1490 1500 1510 1520
pF1KE2 PGTNAQQNMPPQMMGGPIQASAEVAQQGTMWQGRNDMTYNYANRQSTGSAPQGPAYHGVN
: ...::::::::.:.:. . : .:: .:::: : : :::. :. :.: : :.:
CCDS52 -GQIQTHGIPPQMMGGPLQSSSSEGPQQNMWAARNDMPYPYQNRQGPGGPTQAPPYPGMN
1430 1440 1450 1460 1470 1480
1530 1540 1550 1560 1570 1580
pF1KE2 RTDEMLHTDQRANHEGSWPSH-GTRQPPYGPSAPVPPMTRPPPSNYQPPPSMQNHIPQVS
:::.:. ::: :::..:::: . ::: .. :: . :.:::: .:: :::. ::: ..
CCDS52 RTDDMMVPDQRINHESQWPSHVSQRQPYMSSSASMQPITRPPQPSYQTPPSLPNHISRAP
1490 1500 1510 1520 1530 1540
1590 1600 1610 1620 1630 1640
pF1KE2 SPAPLPRPMENRTSPSKSPFLHSGMKMQKAGPPVPASHIAPAPVQPPMIRRDITFPPGSV
::: . : .::: :::::::: : :::::. : ::.:... : ::: :::.::::::::
CCDS52 SPASFQRSLENRMSPSKSPFLPS-MKMQKVMPTVPTSQVTGPPPQPPPIRREITFPPGSV
1550 1560 1570 1580 1590 1600
1650 1660 1670 1680 1690 1700
pF1KE2 EATQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGLLAESTWALDTINILLYDDNSIMTFNL
::.::::::::..: ::: :::::::::::::::::::::::::::::::::... ::::
CCDS52 EASQPVLKQRRKITSKDIVTPEAWRVMMSLKSGLLAESTWALDTINILLYDDSTVATFNL
1610 1620 1630 1640 1650 1660
1710 1720 1730 1740 1750 1760
pF1KE2 SQLPGLLELLVEYFRRCLIEIFGILKEYEVGDPGQRTL-LDPGRFSKVSSPAPMEGGEEE
::: :.:::::::::.:::.::::: :::::::.:..: . .: . .: : : :::
CCDS52 SQLSGFLELLVEYFRKCLIDIFGILMEYEVGDPSQKALDHNAARKDDSQSLADDSGKEEE
1670 1680 1690 1700 1710 1720
1770 1780 1790 1800 1810
pF1KE2 E-ELLGPKLEEEEEEEV----VENDEE--IAFSGKDKPASENSEEKLISKFDKLPVKIVQ
. : . :.::.:: .:.::. ::... : :. . . : :::::::.:::.
CCDS52 DAECIDDDEEDEEDEEEDSEKTESDEKSSIALTAPDAAADPKEKPKQASKFDKLPIKIVK
1730 1740 1750 1760 1770 1780
1820 1830 1840 1850 1860 1870
pF1KE2 KNDPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFESKTELLPSR-PHAPCPPA
::. :::: ::::::::::.::::::..:::::::::::::::: :. : : : : :
CCDS52 KNNLFVVDRSDKLGRVQEFNSGLLHWQLGGGDTTEHIQTHFESKMEIPPRRRPPPPLSSA
1790 1800 1810 1820 1830 1840
1880 1890 1900 1910 1920 1930
pF1KE2 PRKHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSSTLTEDGAKSSEAIKESS
::. :: . .:. :: : ::.::.::.: ..: ::. . .. :::
CCDS52 GRKK--EQEGKGDSEEQQ--------EKSIIATIDDVLSARPGALPEDANPGPQT--ESS
1850 1860 1870 1880
1940 1950 1960 1970 1980 1990
pF1KE2 KFPFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKRCVCVSNTIRSLSFVPGN
::::::. :.::::::.:::::.:.:::::::. :::::::::.:::: .:::::::::
CCDS52 KFPFGIQQAKSHRNIKLLEDEPRSRDETPLCTIAHWQDSLAKRCICVSNIVRSLSFVPGN
1890 1900 1910 1920 1930 1940
2000 2010 2020 2030 2040 2050
pF1KE2 DFEMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQGVSCNKVEWWWDCLEMLR
: ::::::::.:::::::::::.:::::.:: ::::::..:.::.:.: ::::::::.::
CCDS52 DAEMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEVLR
1950 1960 1970 1980 1990 2000
2060 2070 2080 2090 2100 2110
pF1KE2 ENTLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQDPFSTLGPNAVLSPQRL
.:::::::::::::::: : ::::::.::::::: ::::::::::: :.:::.:::::::
CCDS52 DNTLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRL
2010 2020 2030 2040 2050 2060
2120 2130 2140 2150 2160 2170
pF1KE2 VLETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKNPVCREMAVVLLANLAQG
::::: :::::::::::::::::::: ::.:.:.::...::::::::::...::.:::::
CCDS52 VLETLCKLSIQDNNVDLILATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQG
2070 2080 2090 2100 2110 2120
2180 2190 2200 2210 2220 2230
pF1KE2 DSLAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQNPPFEPTSVDMMRRAARA
:.::::::::::::::::..::::... .:.:::: .:.::: ::.:: ::::: :::.:
CCDS52 DALAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNLMHMQPPPLEPPSVDMMCRAAKA
2130 2140 2150 2160 2170 2180
2240 2250 2260 2270 2280
pF1KE2 LLALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLFLIGQS
:::.:.::::.::: :.:.::::::.: ..::::..::::::: :::
CCDS52 LLAMARVDENRSEFLLHEGRLLDISISAVLNSLVASVICDVLFQIGQL
2190 2200 2210 2220 2230
>>CCDS55072.1 ARID1B gene_id:57492|Hs108|chr6 (2249 aa)
initn: 5031 init1: 1919 opt: 3736 Z-score: 1393.3 bits: 271.7 E(32554): 2.8e-71
Smith-Waterman score: 7221; 52.4% identity (70.3% similar) in 2355 aa overlap (25-2284:116-2248)
10 20 30 40 50
pF1KE2 MAAQVAPAAASSLGNPPPPPPSELKKAEQQQREEAGGEAAAAAAAERGEMKAAA
.. .:::... . . . . :..
CCDS55 HHHHAHHHHHHAHHLHHHHALQQQLNQFQQQQQQQQQQQQQQQQQQHPISNNNSLGGAGG
90 100 110 120 130 140
60 70 80 90 100 110
pF1KE2 GQESEGPAVGPPQPLG-KELQDGAESNGGGGGGGAGSGGGPGAEPDLKNSNGNAGPRPAL
: . :: . :: : :. :.... : . : : : . . :. .:.:
CCDS55 GAPQPGPDMEQPQHGGAKDSAAGGQADPPGPPLLSKPGDEDDAPPKMGEPAGGRYEHPGL
150 160 170 180 190 200
120 130 140 150 160
pF1KE2 NN-NLTEPP---GGGGGGSSDGVGAPPHSAAAALPPPAYGFGQPYGRSPSAVAAAAAAVF
. . .:: ::::: . ..: . . :: : : : :: :. :
CCDS55 GALGTQQPPVAVPGGGGGPA---AVPEFNNYYGSAAPASG-G-PGGR--------AGPCF
210 220 230 240 250
170 180 190 200 210 220
pF1KE2 HQQHGGQQSPGLAALQSGGGG--GLEPYAGPQQNSHDHGFPNHQYNSYYPNRS---AYPP
.:::::::::.. ..:.... : : ::::. :.:: : : .::. : :
CCDS55 -DQHGGQQSPGMGMMHSASAAAAGAPGSMDPLQNSHE-GYPNSQCN-HYPGYSRPGAGGG
260 270 280 290 300
230 240 250 260
pF1KE2 PAPAYALSSPRGGTPGSGAAAAAGSKPPPSSSASASSSS--------------------S
. . . .. :: :.:.:.:.:. ..:.:.... :
CCDS55 GGGGGGGGGGSGGGGGGGGAGAGGAGAGAVAAAAAAAAAAAGGGGGGGYGGSSAGYGVLS
310 320 330 340 350 360
270 280 290 300
pF1KE2 SFAQQRFGAM---GGGGP---------SAAGG-----GTPQ-PT-ATPTLNQLLTSPSSA
: :: : : :::: ::::: : : :. :::::::::::::
CCDS55 SPRQQGGGMMMGPGGGGAASLSKAAAGSAAGGFQRFAGQNQHPSGATPTLNQLLTSPSPM
370 380 390 400 410 420
310 320 330 340
pF1KE2 -RGYQG-YPGGDYSGG---------PQDGGAGKGPADMASQCWGA-------AAAAAAAA
:.: : :: .::. ::. .:. : : ..: .. .: :::.
CCDS55 MRSYGGSYP--EYSSPSAPPPPPSQPQSQAAAAGAAAGGQQAAAGMGLGKDMGAQYAAAS
430 440 450 460 470 480
350 360 370 380 390 400
pF1KE2 AASGGAQQRSHHAPMSPGSSGGGGQPLARTPQPSSPMDQMGKMRPQPYG-GTNPYSQQQG
: ..:::::: : ::::. : : : .:::: : ::: :: :.::.::
CCDS55 PAWAAAQQRSHPA-MSPGTPG----PTMGRSQ-GSPMDPMVMKRPQLYGMGSNPHSQ---
490 500 510 520 530
410 420 430 440 450 460
pF1KE2 PPSGPQQGHGYPGQPYGSQTPQRYPMTMQGRAQSAMGGLSYTQQIPPYGQQGPSGYGQQG
:::. ::: :: :::::. .:::. .::.:..: : :: :: .
CCDS55 ----PQQSSPYPGGSYGPPGPQRYPIGIQGRTPGAMAGMQY----P---QQQDSGDATWK
540 550 560 570 580
470 480 490 500 510 520
pF1KE2 QTPYYNQQSPHPQQQQPPYSQQPPSQTPHAQPSYQQQPQSQPPQLQSSQPPYSQQPSQPP
.: . : : :.:: : .: :: : :: ::::: :::
CCDS55 ETFWL-----MP----PQYGQQGVS-------GYCQQGQ---------QPYYSQQP-QPP
590 600 610 620
530 540 550 560 570 580
pF1KE2 HQQSPAPYPSQQSTTQQHPQSQPPYSQPQAQSPYQQQQPQQPAPSTLSQQAAYPQPQSQQ
: : ::: : :.:
CCDS55 H------LP------------------PQA-----QYLPSQ-------------------
630
590 600 610 620 630 640
pF1KE2 SQQTAYSQQRFPPPQELSQDSFGSQASSAPSMTSSKGGQEDMNLSLQSRPSSLPDLSGSI
::::. : :..::...:.. : : .. .: ..::.:: : ::::::::::::
CCDS55 ------SQQRYQPQQDMSQEGYGTR--SQPPLAPGKPNHEDLNLIQQERPSSLPDLSGSI
640 650 660 670 680
650 660 670 680 690 700
pF1KE2 DDLPMGTEGALSPGVSTSGISSSQGEQSNPAQSPFSPHTSPHLPGIRG-PSPSPVGSPAS
:::: :::..:: .::.:: .::::.::::::::::::.:::: .: : :::::::::..
CCDS55 DDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVG
690 700 710 720 730 740
710 720 730 740 750 760
pF1KE2 VAQSRSGPLSPAAVPGNQMPPRPPSGQSDSIMHPSMNQSSIAQDRGYM---QRNPQMPQY
::::::.:::..::.::::.::..::.: ::...:: . :.::.: :::::: ::
CCDS55 SNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMAGTQRNPQMAQY
750 760 770 780 790 800
770 780 790 800 810 820
pF1KE2 SSPQPGSALSPRQPSGGQIHTGMGSYQQ-NSMGSYGPQGGQYGPQGGYPRQPNYNALPNA
. : : ..::. :::.:.:..:.:: :: :.:::: .::::::.: : : :...:.:
CCDS55 GPQQTGPSMSPHPSPGGQMHAGISSFQQSNSSGTYGPQMSQYGPQGNYSRPPAYSGVPSA
810 820 830 840 850 860
830 840 850 860 870
pF1KE2 NYPSAGMAGGINPMGAGGQMHGQPGIPPYGTLPPGRMSHASMGNRPYGPNMANMPP----
.: . : . ::. :..::::: : :..: ::: :.: :::. ::..: :
CCDS55 SYSGPGPGMGIS---ANNQMHGQGPSQPCGAVPLGRMPSAGMQNRPFPGNMSSMTPSSPG
870 880 890 900 910 920
880 890 900 910 920 930
pF1KE2 ---QVGSGMCPPPGGMNRKTQETAVA-MHVAANSIQNRPPGYPNMNQGGMMGTGPPYGQG
: : :: :: .:::.::.:.: :..:::: :.: ..:.:::.:.:... ::.:
CCDS55 MSQQGGPGMGPPMPTVNRKAQEAAAAVMQAAANSAQSRQGSFPGMNQSGLMASSSPYSQP
930 940 950 960 970 980
940 950 960 970 980 990
pF1KE2 INSMAGMINPQGPPYSMGGTMANNSAGMAASPEMMGLGDVKLTPATKMNNKADGTPKTES
.:. ....: :.:::::. .:.:.::. .. .::. :. :: : ..: .:::. ::
CCDS55 MNNSSSLMNTQAPPYSMAPAMVNSSAASVGLADMMSPGESKLPLPLKADGKEEGTPQPES
990 1000 1010 1020 1030 1040
1000 1010 1020 1030 1040 1050
pF1KE2 KSKKSSSSTTTNEKITKLYELGGEPERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYR
:::::::::::.:::::.::::.:::::.::::::.: ::.. ...:::::.:::::.:
CCDS55 KSKKSSSSTTTGEKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFR
1050 1060 1070 1080 1090 1100
1060 1070 1080 1090 1100 1110
pF1KE2 LYVSVKEIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQCLYAFECKIERGEDP
::: :::::::.::::::::::::::::::::::::::::::::: :.::::::::::.:
CCDS55 LYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFAFECKIERGEEP
1110 1120 1130 1140 1150 1160
1120 1130 1140 1150 1160 1170
pF1KE2 PPDIFAAADSKKSQPKIQPPSPAGSGSMQGPQTPQST-SSSMAE-GGDLKPPTPASTPHS
::..:...:.:: :::.::::::.:::.::::::::: :.:::: :::::::::::::.
CCDS55 PPEVFSTGDTKK-QPKLQPPSPANSGSLQGPQTPQSTGSNSMAEVPGDLKPPTPASTPHG
1170 1180 1190 1200 1210 1220
1180 1190 1200 1210 1220 1230
pF1KE2 QIPPLPGMSRSNSVGIQDAFNDGSDSTFQKRNSMTPNPGYQPSMNTSDMMGRMSYEPNKD
:. :. : .::......: :.: :::.: :::::::: :: .:. :.:::: ::::::
CCDS55 QMTPMQG-GRSSTISVHDPFSDVSDSSFPKRNSMTPNAPYQQGMSMPDVMGRMPYEPNKD
1230 1240 1250 1260 1270 1280
1240 1250 1260 1270 1280 1290
pF1KE2 PYGSMRKAPGS-DPFMSSGQGPNGGMGDPYSRAAGPGLGNVAMGPRQHYPYGGPYDRVRT
:.:.:::.::: .:::..:: ::..: : :... . ...:..:: ::..:::. :::
CCDS55 PFGGMRKVPGSSEPFMTQGQMPNSSMQDMYNQSPSGAMSNLGMGQRQQFPYGASYDR---
1290 1300 1310 1320 1330
1300 1310 1320 1330 1340 1350
pF1KE2 EPGIGPEGNMSTGAPQPNLMPSNPDSGMYSPSRYPPQQQQQQQQRHDSYGNQFSTQGTPS
::. ::.:. :: ::
CCDS55 --------------------------------------------RHEPYGQQYPGQGPPS
1340 1350
1360 1370 1380 1390 1400
pF1KE2 GSP-FPSQQTTMYQQQQQNYKRPMDGTYGPPAKRHEGEMYSVPYSTGQGQPQQQQLPPAQ
:.: . ..: .: :: :::: ::: ::::::::::.::.. ::
CCDS55 GQPPYGGHQPGLYPQQP-NYKRHMDGMYGPPAKRHEGDMYNMQYS---------------
1360 1370 1380 1390
1410 1420 1430 1440 1450 1460
pF1KE2 PQPASQQQAAQPSPQQDVYNQYGNAYPATATAATERRPAGGPQNQFPFQFGRDRVSAPPG
: ::..:::::..: .. .::: :.:.:. ..:.:...: :
CCDS55 ------------SQQQEMYNQYGGSY-----SGPDRRPI---QGQYPYPYSRERMQGP-G
1400 1410 1420 1430
1470 1480 1490 1500 1510 1520
pF1KE2 TNAQQNMPPQMMGGPIQASAEVAQQGTMWQGRNDMTYNYANRQSTGSAPQGPAYHGVNRT
...::::::::.:.:. . : .:: .:::: : : :::. :. :.: : :.:::
CCDS55 QIQTHGIPPQMMGGPLQSSSSEGPQQNMWAARNDMPYPYQNRQGPGGPTQAPPYPGMNRT
1440 1450 1460 1470 1480 1490
1530 1540 1550 1560 1570 1580
pF1KE2 DEMLHTDQRANHEGSWPSH-GTRQPPYGPSAPVPPMTRPPPSNYQPPPSMQNHIPQVSSP
:.:. ::: :::..:::: . ::: .. :: . :.:::: .:: :::. ::: .. ::
CCDS55 DDMMVPDQRINHESQWPSHVSQRQPYMSSSASMQPITRPPQPSYQTPPSLPNHISRAPSP
1500 1510 1520 1530 1540 1550
1590 1600 1610 1620 1630 1640
pF1KE2 APLPRPMENRTSPSKSPFLHSGMKMQKAGPPVPASHIAPAPVQPPMIRRDITFPPGSVEA
: . : .::: :::::::: : :::::. : ::.:... : ::: :::.::::::::::
CCDS55 ASFQRSLENRMSPSKSPFLPS-MKMQKVMPTVPTSQVTGPPPQPPPIRREITFPPGSVEA
1560 1570 1580 1590 1600 1610
1650 1660 1670 1680 1690 1700
pF1KE2 TQPVLKQRRRLTMKDIGTPEAWRVMMSLKSGLLAESTWALDTINILLYDDNSIMTFNLSQ
.::::::::..: ::: :::::::::::::::::::::::::::::::::... ::::::
CCDS55 SQPVLKQRRKITSKDIVTPEAWRVMMSLKSGLLAESTWALDTINILLYDDSTVATFNLSQ
1620 1630 1640 1650 1660 1670
1710 1720 1730 1740 1750 1760
pF1KE2 LPGLLELLVEYFRRCLIEIFGILKEYEVGDPGQRTL-LDPGRFSKVSSPAPMEGGEEEE-
: :.:::::::::.:::.::::: :::::::.:..: . .: . .: : : :::.
CCDS55 LSGFLELLVEYFRKCLIDIFGILMEYEVGDPSQKALDHNAARKDDSQSLADDSGKEEEDA
1680 1690 1700 1710 1720 1730
1770 1780 1790 1800 1810 1820
pF1KE2 ELLGPKLEEEEEEEV----VENDEE--IAFSGKDKPASENSEEKLISKFDKLPVKIVQKN
: . :.::.:: .:.::. ::... : :. . . : :::::::.:::.::
CCDS55 ECIDDDEEDEEDEEEDSEKTESDEKSSIALTAPDAAADPKEKPKQASKFDKLPIKIVKKN
1740 1750 1760 1770 1780 1790
1830 1840 1850 1860 1870
pF1KE2 DPFVVDCSDKLGRVQEFDSGLLHWRIGGGDTTEHIQTHFESKTELLPSR-PHAPCPPAPR
. :::: ::::::::::.::::::..:::::::::::::::: :. : : : : : :
CCDS55 NLFVVDRSDKLGRVQEFNSGLLHWQLGGGDTTEHIQTHFESKMEIPPRRRPPPPLSSAGR
1800 1810 1820 1830 1840 1850
1880 1890 1900 1910 1920 1930
pF1KE2 KHVTTAEGTPGTTDQEGPPPDGPPEKRITATMDDMLSTRSSTLTEDGAKSSEAIKESSKF
:. :: . .:. :: : ::.::.::.: ..: ::. . .. :::::
CCDS55 KK--EQEGKGDSEEQQ--------EKSIIATIDDVLSARPGALPEDANPGPQT--ESSKF
1860 1870 1880 1890 1900
1940 1950 1960 1970 1980 1990
pF1KE2 PFGISPAQSHRNIKILEDEPHSKDETPLCTLLDWQDSLAKRCVCVSNTIRSLSFVPGNDF
::::. :.::::::.:::::.:.:::::::. :::::::::.:::: .::::::::::
CCDS55 PFGIQQAKSHRNIKLLEDEPRSRDETPLCTIAHWQDSLAKRCICVSNIVRSLSFVPGNDA
1910 1920 1930 1940 1950 1960
2000 2010 2020 2030 2040 2050
pF1KE2 EMSKHPGLLLILGKLILLHHKHPERKQAPLTYEKEEEQDQGVSCNKVEWWWDCLEMLREN
::::::::.:::::::::::.:::::.:: ::::::..:.::.:.: ::::::::.::.:
CCDS55 EMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEVLRDN
1970 1980 1990 2000 2010 2020
2060 2070 2080 2090 2100 2110
pF1KE2 TLVTLANISGQLDLSPYPESICLPVLDGLLHWAVCPSAEAQDPFSTLGPNAVLSPQRLVL
::::::::::::::: : ::::::.::::::: ::::::::::: :.:::.:::::::::
CCDS55 TLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVL
2030 2040 2050 2060 2070 2080
2120 2130 2140 2150 2160 2170
pF1KE2 ETLSKLSIQDNNVDLILATPPFSRLEKLYSTMVRFLSDRKNPVCREMAVVLLANLAQGDS
::: :::::::::::::::::::: ::.:.:.::...::::::::::...::.::::::.
CCDS55 ETLCKLSIQDNNVDLILATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDA
2090 2100 2110 2120 2130 2140
2180 2190 2200 2210 2220 2230
pF1KE2 LAARAIAVQKGSIGNLLGFLEDSLAATQFQQSQASLLHMQNPPFEPTSVDMMRRAARALL
::::::::::::::::..::::... .:.:::: .:.::: ::.:: ::::: :::.:::
CCDS55 LAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNLMHMQPPPLEPPSVDMMCRAAKALL
2150 2160 2170 2180 2190 2200
2240 2250 2260 2270 2280
pF1KE2 ALAKVDENHSEFTLYESRLLDISVSPLMNSLVSQVICDVLFLIGQS
:.:.::::.::: :.:.::::::.: ..::::..::::::: :::
CCDS55 AMARVDENRSEFLLHEGRLLDISISAVLNSLVASVICDVLFQIGQL
2210 2220 2230 2240
2285 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Feb 10 14:58:44 2017 done: Fri Feb 10 14:58:46 2017
Total Scan time: 9.190 Total Display time: 0.920
Function used was FASTA [36.3.4 Apr, 2011]