FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KA0777, 1100 aa
1>>>pF1KA0777 1100 - 1100 aa - 1100 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.3044+/-0.000997; mu= 0.0386+/- 0.060
mean_var=289.7462+/-60.805, 0's: 0 Z-trim(113.6): 78 B-trim: 4 in 1/52
Lambda= 0.075347
statistics sampled from 14186 (14250) to 14186 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.742), E-opt: 0.2 (0.438), width: 16
Scan time: 5.710
The best scores are: opt bits E(32554)
CCDS3845.1 SORBS2 gene_id:8470|Hs108|chr4 (1100) 7594 839.9 0
CCDS59482.1 SORBS2 gene_id:8470|Hs108|chr4 (1200) 7594 839.9 0
CCDS47176.1 SORBS2 gene_id:8470|Hs108|chr4 (1004) 6750 748.1 2.3e-215
CCDS54825.1 SORBS2 gene_id:8470|Hs108|chr4 ( 661) 1867 217.2 1e-55
CCDS47174.1 SORBS2 gene_id:8470|Hs108|chr4 ( 731) 1848 215.1 4.5e-55
CCDS43289.2 SORBS2 gene_id:8470|Hs108|chr4 ( 666) 1847 215.0 4.5e-55
CCDS47175.1 SORBS2 gene_id:8470|Hs108|chr4 ( 824) 1848 215.2 5e-55
CCDS47173.1 SORBS2 gene_id:8470|Hs108|chr4 ( 644) 1844 214.7 5.6e-55
CCDS31252.1 SORBS1 gene_id:10580|Hs108|chr10 ( 684) 1006 123.6 1.5e-27
CCDS7442.1 SORBS1 gene_id:10580|Hs108|chr10 ( 816) 1006 123.6 1.8e-27
CCDS31253.1 SORBS1 gene_id:10580|Hs108|chr10 ( 781) 976 120.4 1.6e-26
CCDS76326.1 SORBS1 gene_id:10580|Hs108|chr10 ( 811) 976 120.4 1.7e-26
CCDS76327.1 SORBS1 gene_id:10580|Hs108|chr10 (1004) 976 120.4 2e-26
CCDS31254.1 SORBS1 gene_id:10580|Hs108|chr10 (1151) 690 89.4 5e-17
CCDS31256.1 SORBS1 gene_id:10580|Hs108|chr10 ( 905) 673 87.5 1.5e-16
CCDS73169.1 SORBS1 gene_id:10580|Hs108|chr10 (1266) 663 86.5 4.1e-16
CCDS31255.1 SORBS1 gene_id:10580|Hs108|chr10 (1292) 663 86.5 4.2e-16
>>CCDS3845.1 SORBS2 gene_id:8470|Hs108|chr4 (1100 aa)
initn: 7594 init1: 7594 opt: 7594 Z-score: 4475.3 bits: 839.9 E(32554): 0
Smith-Waterman score: 7594; 100.0% identity (100.0% similar) in 1100 aa overlap (1-1100:1-1100)
10 20 30 40 50 60
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTSLDSTDTYPQHAQSLDGTTSSSIPLYRSSEE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 MSYYQRPFSPSAYSLPASLNSSIVMQHGTSLDSTDTYPQHAQSLDGTTSSSIPLYRSSEE
10 20 30 40 50 60
70 80 90 100 110 120
pF1KA0 EKRVTVIKAPHYPGIGPVDESGIPTAIRTTVDRPKDWYKTMFKQIHMVHKPDDDTDMYNT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 EKRVTVIKAPHYPGIGPVDESGIPTAIRTTVDRPKDWYKTMFKQIHMVHKPDDDTDMYNT
70 80 90 100 110 120
130 140 150 160 170 180
pF1KA0 PYTYNAGLYNPPYSAQSHPAAKTQTYRPLSKSHSDNSPNAFKDASSPVPPPHVPPPVPPL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 PYTYNAGLYNPPYSAQSHPAAKTQTYRPLSKSHSDNSPNAFKDASSPVPPPHVPPPVPPL
130 140 150 160 170 180
190 200 210 220 230 240
pF1KA0 RPRDRSSTEKHDWDPPDRKVDTRKFRSEPRSIFEYEPGKSSILQHERPASLYQSSIDRSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 RPRDRSSTEKHDWDPPDRKVDTRKFRSEPRSIFEYEPGKSSILQHERPASLYQSSIDRSL
190 200 210 220 230 240
250 260 270 280 290 300
pF1KA0 ERPMSSASMASDFRKRRKSEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 ERPMSSASMASDFRKRRKSEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSP
250 260 270 280 290 300
310 320 330 340 350 360
pF1KA0 SSPSRAKGGDDSKICPSLCSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SSPSRAKGGDDSKICPSLCSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQM
310 320 330 340 350 360
370 380 390 400 410 420
pF1KA0 ARQNAEIWSSTEETVSPKIKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCPM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 ARQNAEIWSSTEETVSPKIKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCPM
370 380 390 400 410 420
430 440 450 460 470 480
pF1KA0 AWGSPYVPEVRSNGRSRIRHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 AWGSPYVPEVRSNGRSRIRHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQY
430 440 450 460 470 480
490 500 510 520 530 540
pF1KA0 ESEQQHKDLLRAWSQCSTEEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPPQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 ESEQQHKDLLRAWSQCSTEEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPPQ
490 500 510 520 530 540
550 560 570 580 590 600
pF1KA0 NGLCPKRRFSIEYLLEEENQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSDQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 NGLCPKRRFSIEYLLEEENQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSDQ
550 560 570 580 590 600
610 620 630 640 650 660
pF1KA0 DGVVSDHSDYIHLEGSSFCSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKGR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 DGVVSDHSDYIHLEGSSFCSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKGR
610 620 630 640 650 660
670 680 690 700 710 720
pF1KA0 CPASYTRFTTMLKHERARHENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 CPASYTRFTTMLKHERARHENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTEK
670 680 690 700 710 720
730 740 750 760 770 780
pF1KA0 AKCKASVFEALDSALKDICDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALRR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 AKCKASVFEALDSALKDICDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALRR
730 740 750 760 770 780
790 800 810 820 830 840
pF1KA0 SPLHQPLHPLPPDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SPLHQPLHPLPPDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPR
790 800 810 820 830 840
850 860 870 880 890 900
pF1KA0 SYSSTLTDMGRSAPRERRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 SYSSTLTDMGRSAPRERRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNW
850 860 870 880 890 900
910 920 930 940 950 960
pF1KA0 YEGEHHGRVGIFPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 YEGEHHGRVGIFPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLR
910 920 930 940 950 960
970 980 990 1000 1010 1020
pF1KA0 KGDRVILLKRVDQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 KGDRVILLKRVDQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDR
970 980 990 1000 1010 1020
1030 1040 1050 1060 1070 1080
pF1KA0 IHSLSSNKPQRPVFTHENIQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS38 IHSLSSNKPQRPVFTHENIQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFV
1030 1040 1050 1060 1070 1080
1090 1100
pF1KA0 GTSRRTKFFGTFPGNYVKRL
::::::::::::::::::::
CCDS38 GTSRRTKFFGTFPGNYVKRL
1090 1100
>>CCDS59482.1 SORBS2 gene_id:8470|Hs108|chr4 (1200 aa)
initn: 7594 init1: 7594 opt: 7594 Z-score: 4474.8 bits: 839.9 E(32554): 0
Smith-Waterman score: 7594; 100.0% identity (100.0% similar) in 1100 aa overlap (1-1100:101-1200)
10 20 30
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTS
::::::::::::::::::::::::::::::
CCDS59 RSVRPNLQDKRSPTQSQITVNGNSGGAVSPMSYYQRPFSPSAYSLPASLNSSIVMQHGTS
80 90 100 110 120 130
40 50 60 70 80 90
pF1KA0 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
140 150 160 170 180 190
100 110 120 130 140 150
pF1KA0 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
200 210 220 230 240 250
160 170 180 190 200 210
pF1KA0 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
260 270 280 290 300 310
220 230 240 250 260 270
pF1KA0 SIFEYEPGKSSILQHERPASLYQSSIDRSLERPMSSASMASDFRKRRKSEPAVGPPRGLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SIFEYEPGKSSILQHERPASLYQSSIDRSLERPMSSASMASDFRKRRKSEPAVGPPRGLG
320 330 340 350 360 370
280 290 300 310 320 330
pF1KA0 DQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAKGGDDSKICPSLCSYSGLNGNPSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 DQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAKGGDDSKICPSLCSYSGLNGNPSS
380 390 400 410 420 430
340 350 360 370 380 390
pF1KA0 ELDYCSTYRQHLDVPRDSPRAISFKNGWQMARQNAEIWSSTEETVSPKIKSRSCDDLLND
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 ELDYCSTYRQHLDVPRDSPRAISFKNGWQMARQNAEIWSSTEETVSPKIKSRSCDDLLND
440 450 460 470 480 490
400 410 420 430 440 450
pF1KA0 DCDSFPDPKVKSESMGSLLCEEDSKESCPMAWGSPYVPEVRSNGRSRIRHRSARNAPGFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 DCDSFPDPKVKSESMGSLLCEEDSKESCPMAWGSPYVPEVRSNGRSRIRHRSARNAPGFL
500 510 520 530 540 550
460 470 480 490 500 510
pF1KA0 KMYKKMHRINRKDLMNSEVICSVKSRILQYESEQQHKDLLRAWSQCSTEEVPRDMVPTRI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 KMYKKMHRINRKDLMNSEVICSVKSRILQYESEQQHKDLLRAWSQCSTEEVPRDMVPTRI
560 570 580 590 600 610
520 530 540 550 560 570
pF1KA0 SEFEKLIQKSKSMPNLGDDMLSPVTLEPPQNGLCPKRRFSIEYLLEEENQSGPPARGRRG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SEFEKLIQKSKSMPNLGDDMLSPVTLEPPQNGLCPKRRFSIEYLLEEENQSGPPARGRRG
620 630 640 650 660 670
580 590 600 610 620 630
pF1KA0 CQSNALVPIHIEVTSDEQPRAHVEFSDSDQDGVVSDHSDYIHLEGSSFCSESDFDHFSFT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 CQSNALVPIHIEVTSDEQPRAHVEFSDSDQDGVVSDHSDYIHLEGSSFCSESDFDHFSFT
680 690 700 710 720 730
640 650 660 670 680 690
pF1KA0 SSESFYGSSHHHHHHHHHHHRHLISSCKGRCPASYTRFTTMLKHERARHENTEEPRRQEM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SSESFYGSSHHHHHHHHHHHRHLISSCKGRCPASYTRFTTMLKHERARHENTEEPRRQEM
740 750 760 770 780 790
700 710 720 730 740 750
pF1KA0 DPGLSKLAFLVSPVPFRRKKNSAPKKQTEKAKCKASVFEALDSALKDICDQIKAEKKRGS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 DPGLSKLAFLVSPVPFRRKKNSAPKKQTEKAKCKASVFEALDSALKDICDQIKAEKKRGS
800 810 820 830 840 850
760 770 780 790 800 810
pF1KA0 LPDNSILHRLISELLPDVPERNSSLRALRRSPLHQPLHPLPPDGAIHCPPYQNDCGRMPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 LPDNSILHRLISELLPDVPERNSSLRALRRSPLHQPLHPLPPDGAIHCPPYQNDCGRMPR
860 870 880 890 900 910
820 830 840 850 860 870
pF1KA0 SASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGRSAPRERRGTPEKEKLPAKA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGRSAPRERRGTPEKEKLPAKA
920 930 940 950 960 970
880 890 900 910 920 930
pF1KA0 VYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVEKLTPPEKAQPAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 VYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVEKLTPPEKAQPAR
980 990 1000 1010 1020 1030
940 950 960 970 980 990
pF1KA0 PPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEGKIPGTNRQGIFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 PPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEGKIPGTNRQGIFP
1040 1050 1060 1070 1080 1090
1000 1010 1020 1030 1040 1050
pF1KA0 VSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHENIQGGGEPFQALY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 VSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHENIQGGGEPFQALY
1100 1110 1120 1130 1140 1150
1060 1070 1080 1090 1100
pF1KA0 NYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVKRL
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 NYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVKRL
1160 1170 1180 1190 1200
>>CCDS47176.1 SORBS2 gene_id:8470|Hs108|chr4 (1004 aa)
initn: 6743 init1: 6743 opt: 6750 Z-score: 3980.0 bits: 748.1 E(32554): 2.3e-215
Smith-Waterman score: 6866; 98.5% identity (98.5% similar) in 1011 aa overlap (90-1100:9-1004)
60 70 80 90 100 110
pF1KA0 EEKRVTVIKAPHYPGIGPVDESGIPTAIRTTVDRPKDWYKTMFKQIHMVHKPDDDTDMYN
::::::::::::::::::::::
CCDS47 MKATTPLQTVDRPKDWYKTMFKQIHMVHKP--------
10 20 30
120 130 140 150 160 170
pF1KA0 TPYTYNAGLYNPPYSAQSHPAAKTQTYRPLSKSHSDNSPNAFKDASSPVPPPHVPPPVPP
:::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 -------GLYNPPYSAQSHPAAKTQTYRPLSKSHSDNSPNAFKDASSPVPPPHVPPPVPP
40 50 60 70 80
180 190 200 210 220 230
pF1KA0 LRPRDRSSTEKHDWDPPDRKVDTRKFRSEPRSIFEYEPGKSSILQHERPASLYQSSIDRS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LRPRDRSSTEKHDWDPPDRKVDTRKFRSEPRSIFEYEPGKSSILQHERPASLYQSSIDRS
90 100 110 120 130 140
240 250 260 270 280 290
pF1KA0 LERPMSSASMASDFRKRRKSEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LERPMSSASMASDFRKRRKSEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSS
150 160 170 180 190 200
300 310 320 330 340 350
pF1KA0 PSSPSRAKGGDDSKICPSLCSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 PSSPSRAKGGDDSKICPSLCSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQ
210 220 230 240 250 260
360 370 380 390 400 410
pF1KA0 MARQNAEIWSSTEETVSPKIKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MARQNAEIWSSTEETVSPKIKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCP
270 280 290 300 310 320
420 430 440 450 460 470
pF1KA0 MAWGSPYVPEVRSNGRSRIRHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MAWGSPYVPEVRSNGRSRIRHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQ
330 340 350 360 370 380
480 490 500 510 520 530
pF1KA0 YESEQQHKDLLRAWSQCSTEEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 YESEQQHKDLLRAWSQCSTEEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPP
390 400 410 420 430 440
540 550 560 570 580 590
pF1KA0 QNGLCPKRRFSIEYLLEEENQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 QNGLCPKRRFSIEYLLEEENQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSD
450 460 470 480 490 500
600 610 620 630 640 650
pF1KA0 QDGVVSDHSDYIHLEGSSFCSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 QDGVVSDHSDYIHLEGSSFCSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKG
510 520 530 540 550 560
660 670 680 690 700 710
pF1KA0 RCPASYTRFTTMLKHERARHENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RCPASYTRFTTMLKHERARHENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTE
570 580 590 600 610 620
720 730 740 750 760 770
pF1KA0 KAKCKASVFEALDSALKDICDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KAKCKASVFEALDSALKDICDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALR
630 640 650 660 670 680
780 790 800 810 820 830
pF1KA0 RSPLHQPLHPLPPDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RSPLHQPLHPLPPDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESP
690 700 710 720 730 740
840 850 860 870 880 890
pF1KA0 RSYSSTLTDMGRSAPRERRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RSYSSTLTDMGRSAPRERRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQN
750 760 770 780 790 800
900 910 920 930 940 950
pF1KA0 WYEGEHHGRVGIFPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 WYEGEHHGRVGIFPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSL
810 820 830 840 850 860
960 970 980 990 1000 1010
pF1KA0 RKGDRVILLKRVDQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RKGDRVILLKRVDQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSD
870 880 890 900 910 920
1020 1030 1040 1050 1060 1070
pF1KA0 RIHSLSSNKPQRPVFTHENIQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 RIHSLSSNKPQRPVFTHENIQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWF
930 940 950 960 970 980
1080 1090 1100
pF1KA0 VGTSRRTKFFGTFPGNYVKRL
:::::::::::::::::::::
CCDS47 VGTSRRTKFFGTFPGNYVKRL
990 1000
>>CCDS54825.1 SORBS2 gene_id:8470|Hs108|chr4 (661 aa)
initn: 3397 init1: 1824 opt: 1867 Z-score: 1113.9 bits: 217.2 E(32554): 1e-55
Smith-Waterman score: 2800; 51.2% identity (51.2% similar) in 1119 aa overlap (1-1100:70-661)
10 20 30
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTS
::::::::::::::::::::::::::::::
CCDS54 RSVRPNLQDKRSPTQSQITVNGNSGGAVSPMSYYQRPFSPSAYSLPASLNSSIVMQHGTS
40 50 60 70 80 90
40 50 60 70 80 90
pF1KA0 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
100 110 120 130 140 150
100 110 120 130 140 150
pF1KA0 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
160 170 180 190 200 210
160 170 180 190 200 210
pF1KA0 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
220 230 240 250 260 270
220 230 240 250
pF1KA0 SIFEYEPGKSSILQHERP-------------------ASLYQSSIDRSLERPMSSASMAS
:::::::::::::::::: :::::::::::::::::::::::
CCDS54 SIFEYEPGKSSILQHERPPPKKPLDYVQDHSSGVFNEASLYQSSIDRSLERPMSSASMAS
280 290 300 310 320 330
260 270 280 290 300 310
pF1KA0 DFRKRRKSEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAKGGDD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DFRKRRKSEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAK----
340 350 360 370 380 390
320 330 340 350 360 370
pF1KA0 SKICPSLCSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQMARQNAEIWSST
CCDS54 ------------------------------------------------------------
380 390 400 410 420 430
pF1KA0 EETVSPKIKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCPMAWGSPYVPEVR
CCDS54 ------------------------------------------------------------
440 450 460 470 480 490
pF1KA0 SNGRSRIRHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQYESEQQHKDLLR
CCDS54 ------------------------------------------------------------
500 510 520 530 540 550
pF1KA0 AWSQCSTEEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPPQNGLCPKRRFSI
CCDS54 ------------------------------------------------------------
560 570 580 590 600 610
pF1KA0 EYLLEEENQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSDQDGVVSDHSDYI
CCDS54 ------------------------------------------------------------
620 630 640 650 660 670
pF1KA0 HLEGSSFCSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKGRCPASYTRFTTM
CCDS54 ------------------------------------------------------------
680 690 700 710 720 730
pF1KA0 LKHERARHENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTEKAKCKASVFEAL
CCDS54 ------------------------------------------------------------
740 750 760 770 780 790
pF1KA0 DSALKDICDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALRRSPLHQPLHPLP
CCDS54 ------------------------------------------------------------
800 810 820 830 840 850
pF1KA0 PDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGR
:::::::::::::::::
CCDS54 -------------------------------------------DRESPRSYSSTLTDMGR
400 410
860 870 880 890 900 910
pF1KA0 SAPRERRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 SAPRERRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGI
420 430 440 450 460 470
920 930 940 950 960 970
pF1KA0 FPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 FPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRV
480 490 500 510 520 530
980 990 1000 1010 1020 1030
pF1KA0 DQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQR
540 550 560 570 580 590
1040 1050 1060 1070 1080 1090
pF1KA0 PVFTHENIQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PVFTHENIQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGT
600 610 620 630 640 650
1100
pF1KA0 FPGNYVKRL
:::::::::
CCDS54 FPGNYVKRL
660
>>CCDS47174.1 SORBS2 gene_id:8470|Hs108|chr4 (731 aa)
initn: 3400 init1: 1824 opt: 1848 Z-score: 1102.2 bits: 215.1 E(32554): 4.5e-55
Smith-Waterman score: 2694; 48.9% identity (48.9% similar) in 1172 aa overlap (1-1100:87-731)
10 20 30
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTS
::::::::::::::::::::::::::::::
CCDS47 RSVRPNLQDKRSPTQSQITVNGNSGGAVSPMSYYQRPFSPSAYSLPASLNSSIVMQHGTS
60 70 80 90 100 110
40 50 60 70 80 90
pF1KA0 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
120 130 140 150 160 170
100 110 120 130 140 150
pF1KA0 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
180 190 200 210 220 230
160 170 180 190 200 210
pF1KA0 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
240 250 260 270 280 290
220
pF1KA0 SIFEYEPGKSSILQHERP------------------------------------------
::::::::::::::::::
CCDS47 SIFEYEPGKSSILQHERPPPLPTTPTPVPREPGRKPLSSSRLGEVTGSPSPPPRSGAPTP
300 310 320 330 340 350
230 240 250
pF1KA0 ------------------------------ASLYQSSIDRSLERPMSSASMASDFRKRRK
::::::::::::::::::::::::::::::
CCDS47 SSRAPALSPTRPPKKPLDYVQDHSSGVFNEASLYQSSIDRSLERPMSSASMASDFRKRRK
360 370 380 390 400 410
260 270 280 290 300 310
pF1KA0 SEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAKGGDDSKICPSL
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAK-----------
420 430 440 450 460
320 330 340 350 360 370
pF1KA0 CSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQMARQNAEIWSSTEETVSPK
CCDS47 ------------------------------------------------------------
380 390 400 410 420 430
pF1KA0 IKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCPMAWGSPYVPEVRSNGRSRI
CCDS47 ------------------------------------------------------------
440 450 460 470 480 490
pF1KA0 RHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQYESEQQHKDLLRAWSQCST
CCDS47 ------------------------------------------------------------
500 510 520 530 540 550
pF1KA0 EEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPPQNGLCPKRRFSIEYLLEEE
CCDS47 ------------------------------------------------------------
560 570 580 590 600 610
pF1KA0 NQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSDQDGVVSDHSDYIHLEGSSF
CCDS47 ------------------------------------------------------------
620 630 640 650 660 670
pF1KA0 CSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKGRCPASYTRFTTMLKHERAR
CCDS47 ------------------------------------------------------------
680 690 700 710 720 730
pF1KA0 HENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTEKAKCKASVFEALDSALKDI
CCDS47 ------------------------------------------------------------
740 750 760 770 780 790
pF1KA0 CDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALRRSPLHQPLHPLPPDGAIHC
CCDS47 ------------------------------------------------------------
800 810 820 830 840 850
pF1KA0 PPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGRSAPRERR
::::::::::::::::::::::::
CCDS47 ------------------------------------DRESPRSYSSTLTDMGRSAPRERR
470 480
860 870 880 890 900 910
pF1KA0 GTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 GTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVE
490 500 510 520 530 540
920 930 940 950 960 970
pF1KA0 KLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEG
550 560 570 580 590 600
980 990 1000 1010 1020 1030
pF1KA0 KIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHEN
610 620 630 640 650 660
1040 1050 1060 1070 1080 1090
pF1KA0 IQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 IQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVK
670 680 690 700 710 720
1100
pF1KA0 RL
::
CCDS47 RL
730
>>CCDS43289.2 SORBS2 gene_id:8470|Hs108|chr4 (666 aa)
initn: 3397 init1: 1824 opt: 1847 Z-score: 1102.1 bits: 215.0 E(32554): 4.5e-55
Smith-Waterman score: 2744; 50.0% identity (50.0% similar) in 1147 aa overlap (1-1100:47-666)
10 20 30
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTS
::::::::::::::::::::::::::::::
CCDS43 RSVRPNLQDKRSPTQSQITVNGNSGGAVSPMSYYQRPFSPSAYSLPASLNSSIVMQHGTS
20 30 40 50 60 70
40 50 60 70 80 90
pF1KA0 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
80 90 100 110 120 130
100 110 120 130 140 150
pF1KA0 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
140 150 160 170 180 190
160 170 180 190 200 210
pF1KA0 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
200 210 220 230 240 250
220
pF1KA0 SIFEYEPGKSSILQHERP------------------------------------------
::::::::::::::::::
CCDS43 SIFEYEPGKSSILQHERPTDRINPDDIDLENEPWYKFFSELEFGRPPPKKPLDYVQDHSS
260 270 280 290 300 310
230 240 250 260 270 280
pF1KA0 -----ASLYQSSIDRSLERPMSSASMASDFRKRRKSEPAVGPPRGLGDQSASRTSPGRVD
:::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 GVFNEASLYQSSIDRSLERPMSSASMASDFRKRRKSEPAVGPPRGLGDQSASRTSPGRVD
320 330 340 350 360 370
290 300 310 320 330 340
pF1KA0 LPGSSTTLTKSFTSSSPSSPSRAKGGDDSKICPSLCSYSGLNGNPSSELDYCSTYRQHLD
::::::::::::::::::::::::
CCDS43 LPGSSTTLTKSFTSSSPSSPSRAK------------------------------------
380 390 400
350 360 370 380 390 400
pF1KA0 VPRDSPRAISFKNGWQMARQNAEIWSSTEETVSPKIKSRSCDDLLNDDCDSFPDPKVKSE
CCDS43 ------------------------------------------------------------
410 420 430 440 450 460
pF1KA0 SMGSLLCEEDSKESCPMAWGSPYVPEVRSNGRSRIRHRSARNAPGFLKMYKKMHRINRKD
CCDS43 ------------------------------------------------------------
470 480 490 500 510 520
pF1KA0 LMNSEVICSVKSRILQYESEQQHKDLLRAWSQCSTEEVPRDMVPTRISEFEKLIQKSKSM
CCDS43 ------------------------------------------------------------
530 540 550 560 570 580
pF1KA0 PNLGDDMLSPVTLEPPQNGLCPKRRFSIEYLLEEENQSGPPARGRRGCQSNALVPIHIEV
CCDS43 ------------------------------------------------------------
590 600 610 620 630 640
pF1KA0 TSDEQPRAHVEFSDSDQDGVVSDHSDYIHLEGSSFCSESDFDHFSFTSSESFYGSSHHHH
CCDS43 ------------------------------------------------------------
650 660 670 680 690 700
pF1KA0 HHHHHHHRHLISSCKGRCPASYTRFTTMLKHERARHENTEEPRRQEMDPGLSKLAFLVSP
CCDS43 ------------------------------------------------------------
710 720 730 740 750 760
pF1KA0 VPFRRKKNSAPKKQTEKAKCKASVFEALDSALKDICDQIKAEKKRGSLPDNSILHRLISE
CCDS43 ------------------------------------------------------------
770 780 790 800 810 820
pF1KA0 LLPDVPERNSSLRALRRSPLHQPLHPLPPDGAIHCPPYQNDCGRMPRSASFQDVDTANSS
CCDS43 ------------------------------------------------------------
830 840 850 860 870 880
pF1KA0 CHHQDRGGALQDRESPRSYSSTLTDMGRSAPRERRGTPEKEKLPAKAVYDFKAQTSKELS
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 -----------DRESPRSYSSTLTDMGRSAPRERRGTPEKEKLPAKAVYDFKAQTSKELS
410 420 430 440
890 900 910 920 930 940
pF1KA0 FKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVEKLTPPEKAQPARPPPPAQPGEIGEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 FKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVEKLTPPEKAQPARPPPPAQPGEIGEA
450 460 470 480 490 500
950 960 970 980 990 1000
pF1KA0 IAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 IAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEGKIPGTNRQGIFPVSYVEVVKKNTKG
510 520 530 540 550 560
1010 1020 1030 1040 1050 1060
pF1KA0 AEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHENIQGGGEPFQALYNYTPRNEDELELR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 AEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHENIQGGGEPFQALYNYTPRNEDELELR
570 580 590 600 610 620
1070 1080 1090 1100
pF1KA0 ESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVKRL
:::::::::::::::::::::::::::::::::::::
CCDS43 ESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVKRL
630 640 650 660
>>CCDS47175.1 SORBS2 gene_id:8470|Hs108|chr4 (824 aa)
initn: 3400 init1: 1824 opt: 1848 Z-score: 1101.4 bits: 215.2 E(32554): 5e-55
Smith-Waterman score: 2694; 48.9% identity (48.9% similar) in 1172 aa overlap (1-1100:180-824)
10 20 30
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTS
::::::::::::::::::::::::::::::
CCDS47 KSAVAAASQSSDCRVSQITVNGNSGGAVSPMSYYQRPFSPSAYSLPASLNSSIVMQHGTS
150 160 170 180 190 200
40 50 60 70 80 90
pF1KA0 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
210 220 230 240 250 260
100 110 120 130 140 150
pF1KA0 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
270 280 290 300 310 320
160 170 180 190 200 210
pF1KA0 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
330 340 350 360 370 380
220
pF1KA0 SIFEYEPGKSSILQHERP------------------------------------------
::::::::::::::::::
CCDS47 SIFEYEPGKSSILQHERPPPLPTTPTPVPREPGRKPLSSSRLGEVTGSPSPPPRSGAPTP
390 400 410 420 430 440
230 240 250
pF1KA0 ------------------------------ASLYQSSIDRSLERPMSSASMASDFRKRRK
::::::::::::::::::::::::::::::
CCDS47 SSRAPALSPTRPPKKPLDYVQDHSSGVFNEASLYQSSIDRSLERPMSSASMASDFRKRRK
450 460 470 480 490 500
260 270 280 290 300 310
pF1KA0 SEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAKGGDDSKICPSL
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SEPAVGPPRGLGDQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAK-----------
510 520 530 540 550
320 330 340 350 360 370
pF1KA0 CSYSGLNGNPSSELDYCSTYRQHLDVPRDSPRAISFKNGWQMARQNAEIWSSTEETVSPK
CCDS47 ------------------------------------------------------------
380 390 400 410 420 430
pF1KA0 IKSRSCDDLLNDDCDSFPDPKVKSESMGSLLCEEDSKESCPMAWGSPYVPEVRSNGRSRI
CCDS47 ------------------------------------------------------------
440 450 460 470 480 490
pF1KA0 RHRSARNAPGFLKMYKKMHRINRKDLMNSEVICSVKSRILQYESEQQHKDLLRAWSQCST
CCDS47 ------------------------------------------------------------
500 510 520 530 540 550
pF1KA0 EEVPRDMVPTRISEFEKLIQKSKSMPNLGDDMLSPVTLEPPQNGLCPKRRFSIEYLLEEE
CCDS47 ------------------------------------------------------------
560 570 580 590 600 610
pF1KA0 NQSGPPARGRRGCQSNALVPIHIEVTSDEQPRAHVEFSDSDQDGVVSDHSDYIHLEGSSF
CCDS47 ------------------------------------------------------------
620 630 640 650 660 670
pF1KA0 CSESDFDHFSFTSSESFYGSSHHHHHHHHHHHRHLISSCKGRCPASYTRFTTMLKHERAR
CCDS47 ------------------------------------------------------------
680 690 700 710 720 730
pF1KA0 HENTEEPRRQEMDPGLSKLAFLVSPVPFRRKKNSAPKKQTEKAKCKASVFEALDSALKDI
CCDS47 ------------------------------------------------------------
740 750 760 770 780 790
pF1KA0 CDQIKAEKKRGSLPDNSILHRLISELLPDVPERNSSLRALRRSPLHQPLHPLPPDGAIHC
CCDS47 ------------------------------------------------------------
800 810 820 830 840 850
pF1KA0 PPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGRSAPRERR
::::::::::::::::::::::::
CCDS47 ------------------------------------DRESPRSYSSTLTDMGRSAPRERR
560 570 580
860 870 880 890 900 910
pF1KA0 GTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 GTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVE
590 600 610 620 630 640
920 930 940 950 960 970
pF1KA0 KLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEG
650 660 670 680 690 700
980 990 1000 1010 1020 1030
pF1KA0 KIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KIPGTNRQGIFPVSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHEN
710 720 730 740 750 760
1040 1050 1060 1070 1080 1090
pF1KA0 IQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 IQGGGEPFQALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVK
770 780 790 800 810 820
1100
pF1KA0 RL
::
CCDS47 RL
>>CCDS47173.1 SORBS2 gene_id:8470|Hs108|chr4 (644 aa)
initn: 2555 init1: 1824 opt: 1844 Z-score: 1100.6 bits: 214.7 E(32554): 5.6e-55
Smith-Waterman score: 2701; 50.7% identity (50.7% similar) in 1100 aa overlap (1-1100:87-644)
10 20 30
pF1KA0 MSYYQRPFSPSAYSLPASLNSSIVMQHGTS
::::::::::::::::::::::::::::::
CCDS47 RSVRPNLQDKRSPTQSQITVNGNSGGAVSPMSYYQRPFSPSAYSLPASLNSSIVMQHGTS
60 70 80 90 100 110
40 50 60 70 80 90
pF1KA0 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 LDSTDTYPQHAQSLDGTTSSSIPLYRSSEEEKRVTVIKAPHYPGIGPVDESGIPTAIRTT
120 130 140 150 160 170
100 110 120 130 140 150
pF1KA0 VDRPKDWYKTMFKQIHMVHKPDDDTDMYNTPYTYNAGLYNPPYSAQSHPAAKTQTYRPLS
::::::::::::::::::::: ::::::::::::::::::::::::
CCDS47 VDRPKDWYKTMFKQIHMVHKP---------------GLYNPPYSAQSHPAAKTQTYRPLS
180 190 200 210 220
160 170 180 190 200 210
pF1KA0 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 KSHSDNSPNAFKDASSPVPPPHVPPPVPPLRPRDRSSTEKHDWDPPDRKVDTRKFRSEPR
230 240 250 260 270 280
220 230 240 250 260 270
pF1KA0 SIFEYEPGKSSILQHERPASLYQSSIDRSLERPMSSASMASDFRKRRKSEPAVGPPRGLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 SIFEYEPGKSSILQHERPASLYQSSIDRSLERPMSSASMASDFRKRRKSEPAVGPPRGLG
290 300 310 320 330 340
280 290 300 310 320 330
pF1KA0 DQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAKGGDDSKICPSLCSYSGLNGNPSS
:::::::::::::::::::::::::::::::::::::
CCDS47 DQSASRTSPGRVDLPGSSTTLTKSFTSSSPSSPSRAK-----------------------
350 360 370
340 350 360 370 380 390
pF1KA0 ELDYCSTYRQHLDVPRDSPRAISFKNGWQMARQNAEIWSSTEETVSPKIKSRSCDDLLND
CCDS47 ------------------------------------------------------------
400 410 420 430 440 450
pF1KA0 DCDSFPDPKVKSESMGSLLCEEDSKESCPMAWGSPYVPEVRSNGRSRIRHRSARNAPGFL
CCDS47 ------------------------------------------------------------
460 470 480 490 500 510
pF1KA0 KMYKKMHRINRKDLMNSEVICSVKSRILQYESEQQHKDLLRAWSQCSTEEVPRDMVPTRI
CCDS47 ------------------------------------------------------------
520 530 540 550 560 570
pF1KA0 SEFEKLIQKSKSMPNLGDDMLSPVTLEPPQNGLCPKRRFSIEYLLEEENQSGPPARGRRG
CCDS47 ------------------------------------------------------------
580 590 600 610 620 630
pF1KA0 CQSNALVPIHIEVTSDEQPRAHVEFSDSDQDGVVSDHSDYIHLEGSSFCSESDFDHFSFT
CCDS47 ------------------------------------------------------------
640 650 660 670 680 690
pF1KA0 SSESFYGSSHHHHHHHHHHHRHLISSCKGRCPASYTRFTTMLKHERARHENTEEPRRQEM
CCDS47 ------------------------------------------------------------
700 710 720 730 740 750
pF1KA0 DPGLSKLAFLVSPVPFRRKKNSAPKKQTEKAKCKASVFEALDSALKDICDQIKAEKKRGS
CCDS47 ------------------------------------------------------------
760 770 780 790 800 810
pF1KA0 LPDNSILHRLISELLPDVPERNSSLRALRRSPLHQPLHPLPPDGAIHCPPYQNDCGRMPR
CCDS47 ------------------------------------------------------------
820 830 840 850 860 870
pF1KA0 SASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGRSAPRERRGTPEKEKLPAKA
::::::::::::::::::::::::::::::::::::
CCDS47 ------------------------DRESPRSYSSTLTDMGRSAPRERRGTPEKEKLPAKA
380 390 400 410
880 890 900 910 920 930
pF1KA0 VYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVEKLTPPEKAQPAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 VYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVGIFPISYVEKLTPPEKAQPAR
420 430 440 450 460 470
940 950 960 970 980 990
pF1KA0 PPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEGKIPGTNRQGIFP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 PPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKRVDQNWYEGKIPGTNRQGIFP
480 490 500 510 520 530
1000 1010 1020 1030 1040 1050
pF1KA0 VSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHENIQGGGEPFQALY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 VSYVEVVKKNTKGAEDYPDPPIPHSYSSDRIHSLSSNKPQRPVFTHENIQGGGEPFQALY
540 550 560 570 580 590
1060 1070 1080 1090 1100
pF1KA0 NYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVKRL
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 NYTPRNEDELELRESDVIDVMEKCDDGWFVGTSRRTKFFGTFPGNYVKRL
600 610 620 630 640
>>CCDS31252.1 SORBS1 gene_id:10580|Hs108|chr10 (684 aa)
initn: 1402 init1: 347 opt: 1006 Z-score: 607.9 bits: 123.6 E(32554): 1.5e-27
Smith-Waterman score: 1006; 56.3% identity (76.9% similar) in 286 aa overlap (822-1100:403-682)
800 810 820 830 840 850
pF1KA0 PDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGR
:. .. . ... .: :::: . .:. .
CCDS31 GAPGDLTSLENERQIYKSVLEGGDIPLQGLSGLKRPSSSASTKDSESPRHFIP--ADYLE
380 390 400 410 420 430
860 870 880 890 900 910
pF1KA0 SAPRE-RRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVG
:. . :: .:: ::.: .:::::: ::: ..::: ::: ..:::::::::::::::
CCDS31 STEEFIRRRHDDKEMRPARAKFDFKAQTLKELPLQKGDIVYIYKQIDQNWYEGEHHGRVG
440 450 460 470 480 490
920 930 940 950 960 970
pF1KA0 IFPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKR
::: .:.: : : ::::: . : .: : ::::::.:::.::.::.:.:::.:. ::..
CCDS31 IFPRTYIELLPPAEKAQPKKLTP-VQVLEYGEAIAKFNFNGDTQVEMSFRKGERITLLRQ
500 510 520 530 540
980 990 1000 1010 1020
pF1KA0 VDQNWYEGKIPGTNRQGIFPVSYVEVVKKN-TKGAEDYPDPPIPHSYSSDRIHSLSSNKP
::.:::::.::::.::::::..::.:.:. .:. :: : .: : : .: . : ..:
CCDS31 VDENWYEGRIPGTSRQGIFPITYVDVIKRPLVKNPVDYMD--LPFSSSPSRSATASPQQP
550 560 570 580 590 600
1030 1040 1050 1060 1070 1080
pF1KA0 Q---RPVFTHENIQGGGEPF--QALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSR
: : : : . : . . : ::::.: :.:.::::::..:..::::::::::::::::
CCDS31 QAQQRRV-TPDRSQTSQDLFSYQALYSYIPQNDDELELRDGDIVDVMEKCDDGWFVGTSR
610 620 630 640 650 660
1090 1100
pF1KA0 RTKFFGTFPGNYVKRL
::: :::::::::: :
CCDS31 RTKQFGTFPGNYVKPLYL
670 680
>>CCDS7442.1 SORBS1 gene_id:10580|Hs108|chr10 (816 aa)
initn: 1402 init1: 347 opt: 1006 Z-score: 606.8 bits: 123.6 E(32554): 1.8e-27
Smith-Waterman score: 1006; 56.3% identity (76.9% similar) in 286 aa overlap (822-1100:535-814)
800 810 820 830 840 850
pF1KA0 PDGAIHCPPYQNDCGRMPRSASFQDVDTANSSCHHQDRGGALQDRESPRSYSSTLTDMGR
:. .. . ... .: :::: . .:. .
CCDS74 GAPGDLTSLENERQIYKSVLEGGDIPLQGLSGLKRPSSSASTKDSESPRHFIP--ADYLE
510 520 530 540 550 560
860 870 880 890 900 910
pF1KA0 SAPRE-RRGTPEKEKLPAKAVYDFKAQTSKELSFKKGDTVYILRKIDQNWYEGEHHGRVG
:. . :: .:: ::.: .:::::: ::: ..::: ::: ..:::::::::::::::
CCDS74 STEEFIRRRHDDKEMRPARAKFDFKAQTLKELPLQKGDIVYIYKQIDQNWYEGEHHGRVG
570 580 590 600 610 620
920 930 940 950 960 970
pF1KA0 IFPISYVEKLTPPEKAQPARPPPPAQPGEIGEAIAKYNFNADTNVELSLRKGDRVILLKR
::: .:.: : : ::::: . : .: : ::::::.:::.::.::.:.:::.:. ::..
CCDS74 IFPRTYIELLPPAEKAQPKKLTP-VQVLEYGEAIAKFNFNGDTQVEMSFRKGERITLLRQ
630 640 650 660 670 680
980 990 1000 1010 1020
pF1KA0 VDQNWYEGKIPGTNRQGIFPVSYVEVVKKN-TKGAEDYPDPPIPHSYSSDRIHSLSSNKP
::.:::::.::::.::::::..::.:.:. .:. :: : .: : : .: . : ..:
CCDS74 VDENWYEGRIPGTSRQGIFPITYVDVIKRPLVKNPVDYMD--LPFSSSPSRSATASPQQP
690 700 710 720 730
1030 1040 1050 1060 1070 1080
pF1KA0 Q---RPVFTHENIQGGGEPF--QALYNYTPRNEDELELRESDVIDVMEKCDDGWFVGTSR
: : : : . : . . : ::::.: :.:.::::::..:..::::::::::::::::
CCDS74 QAQQRRV-TPDRSQTSQDLFSYQALYSYIPQNDDELELRDGDIVDVMEKCDDGWFVGTSR
740 750 760 770 780 790
1090 1100
pF1KA0 RTKFFGTFPGNYVKRL
::: :::::::::: :
CCDS74 RTKQFGTFPGNYVKPLYL
800 810
1100 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 10:18:42 2016 done: Thu Nov 3 10:18:43 2016
Total Scan time: 5.710 Total Display time: 0.380
Function used was FASTA [36.3.4 Apr, 2011]