FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2637, 758 aa
1>>>pF1KE2637 758 - 758 aa - 758 aa
Library: human.CCDS.faa
18921897 residues in 33420 sequences
Statistics: Expectation_n fit: rho(ln(x))= 13.5281+/-0.00105; mu= -15.3487+/- 0.063
mean_var=501.2325+/-101.777, 0's: 0 Z-trim(116.3): 71 B-trim: 0 in 0/55
Lambda= 0.057287
statistics sampled from 17029 (17094) to 17029 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.783), E-opt: 0.2 (0.511), width: 16
Scan time: 2.890
The best scores are: opt bits E(33420)
CCDS11501.1 MAPT gene_id:4137|Hs109|chr17 ( 758) 5100 436.3 8.9e-122
CCDS45715.1 MAPT gene_id:4137|Hs109|chr17 ( 776) 3473 301.8 2.7e-81
CCDS11499.1 MAPT gene_id:4137|Hs109|chr17 ( 441) 2013 181.0 3.7e-45
CCDS45716.1 MAPT gene_id:4137|Hs109|chr17 ( 412) 2008 180.5 4.7e-45
CCDS11500.1 MAPT gene_id:4137|Hs109|chr17 ( 383) 2007 180.4 4.7e-45
CCDS56033.1 MAPT gene_id:4137|Hs109|chr17 ( 410) 1189 112.9 1.1e-24
CCDS11502.1 MAPT gene_id:4137|Hs109|chr17 ( 352) 1183 112.3 1.4e-24
CCDS33369.1 MAP2 gene_id:4133|Hs109|chr2 ( 559) 1136 108.6 2.9e-23
CCDS2385.1 MAP2 gene_id:4133|Hs109|chr2 ( 471) 839 84.0 6.3e-16
CCDS86916.1 MAP2 gene_id:4133|Hs109|chr2 (1823) 847 85.1 1.1e-15
CCDS2384.1 MAP2 gene_id:4133|Hs109|chr2 (1827) 847 85.1 1.1e-15
CCDS46818.1 MAP4 gene_id:4134|Hs109|chr3 (1135) 726 74.9 8e-13
>>CCDS11501.1 MAPT gene_id:4137|Hs109|chr17 (758 aa)
initn: 5100 init1: 5100 opt: 5100 Z-score: 2300.0 bits: 436.3 E(33420): 8.9e-122
Smith-Waterman score: 5100; 100.0% identity (100.0% similar) in 758 aa overlap (1-758:1-758)
10 20 30 40 50 60
pF1KE2 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 HVTQEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 HVTQEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 GRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 QDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 QDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 FTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 FTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 AAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 AAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 DPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 DPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQK
430 440 450 460 470 480
490 500 510 520 530 540
pF1KE2 GQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREP
490 500 510 520 530 540
550 560 570 580 590 600
pF1KE2 KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLD
550 560 570 580 590 600
610 620 630 640 650 660
pF1KE2 LSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEK
610 620 630 640 650 660
670 680 690 700 710 720
pF1KE2 LDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDT
670 680 690 700 710 720
730 740 750
pF1KE2 SPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
::::::::::::::::::::::::::::::::::::::
CCDS11 SPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
730 740 750
>>CCDS45715.1 MAPT gene_id:4137|Hs109|chr17 (776 aa)
initn: 3757 init1: 3410 opt: 3473 Z-score: 1573.1 bits: 301.8 E(33420): 2.7e-81
Smith-Waterman score: 5054; 97.7% identity (97.7% similar) in 776 aa overlap (1-758:1-776)
10 20 30 40 50 60
pF1KE2 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 HVTQEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 HVTQEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 GRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 QDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 QDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE2 FTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 FTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPA
310 320 330 340 350 360
370 380 390 400 410 420
pF1KE2 AAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 AAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSS
370 380 390 400 410 420
430 440 450 460 470 480
pF1KE2 DPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 DPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQK
430 440 450 460 470 480
490 500 510 520
pF1KE2 GQANATRIPAKTPPAPKTPPSS------------------GEPPKSGDRSGYSSPGSPGT
:::::::::::::::::::::: ::::::::::::::::::::
CCDS45 GQANATRIPAKTPPAPKTPPSSATKQVQRRPPPAGPRSERGEPPKSGDRSGYSSPGSPGT
490 500 510 520 530 540
530 540 550 560 570 580
pF1KE2 PGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTEN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 PGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTEN
550 560 570 580 590 600
590 600 610 620 630 640
pF1KE2 LKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 LKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSL
610 620 630 640 650 660
650 660 670 680 690 700
pF1KE2 GNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 GNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAK
670 680 690 700 710 720
710 720 730 740 750
pF1KE2 TDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 TDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
730 740 750 760 770
>>CCDS11499.1 MAPT gene_id:4137|Hs109|chr17 (441 aa)
initn: 2816 init1: 2002 opt: 2013 Z-score: 924.4 bits: 181.0 E(33420): 3.7e-45
Smith-Waterman score: 2271; 58.2% identity (58.2% similar) in 758 aa overlap (1-758:1-441)
10 20 30 40 50 60
pF1KE2 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 HVTQEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEG
::::
CCDS11 HVTQ--------------------------------------------------------
190 200 210 220 230 240
pF1KE2 GRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPA
CCDS11 ------------------------------------------------------------
250 260 270 280 290 300
pF1KE2 QDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLE
CCDS11 ------------------------------------------------------------
310 320 330 340 350 360
pF1KE2 FTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPA
CCDS11 ------------------------------------------------------------
370 380 390 400 410 420
pF1KE2 AAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSS
:::::::::::::::::::
CCDS11 ---------------ARMVSKSKDGTGSDDKKAK--------------------------
130 140
430 440 450 460 470 480
pF1KE2 DPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQK
::::::::::::::::::::
CCDS11 ----------------------------------------GADGKTKIATPRGAAPPGQK
150 160
490 500 510 520 530 540
pF1KE2 GQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREP
170 180 190 200 210 220
550 560 570 580 590 600
pF1KE2 KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLD
230 240 250 260 270 280
610 620 630 640 650 660
pF1KE2 LSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEK
290 300 310 320 330 340
670 680 690 700 710 720
pF1KE2 LDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 LDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDT
350 360 370 380 390 400
730 740 750
pF1KE2 SPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
::::::::::::::::::::::::::::::::::::::
CCDS11 SPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
410 420 430 440
>>CCDS45716.1 MAPT gene_id:4137|Hs109|chr17 (412 aa)
initn: 2488 init1: 2002 opt: 2008 Z-score: 922.6 bits: 180.5 E(33420): 4.7e-45
Smith-Waterman score: 2012; 84.9% identity (88.9% similar) in 370 aa overlap (404-758:43-412)
380 390 400 410 420
pF1KE2 LKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHP---TPGS--SDPLIQPS-
::. : .: . ::: :: :.
CCDS45 DHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPGSETSDAKSTPTA
20 30 40 50 60 70
430 440 450 460 470
pF1KE2 ---------SPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPG
.:.. : . . :.. ..:. . : :::::::::::::::::::
CCDS45 EAEEAGIGDTPSLEDEAAGHVTQARMVSKSKDGTGSDDKKAKGADGKTKIATPRGAAPPG
80 90 100 110 120 130
480 490 500 510 520 530
pF1KE2 QKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 QKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTR
140 150 160 170 180 190
540 550 560 570 580 590
pF1KE2 EPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 EPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKK
200 210 220 230 240 250
600 610 620 630 640 650
pF1KE2 LDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 LDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKS
260 270 280 290 300 310
660 670 680 690 700 710
pF1KE2 EKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 EKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSG
320 330 340 350 360 370
720 730 740 750
pF1KE2 DTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
::::::::::::::::::::::::::::::::::::::::
CCDS45 DTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
380 390 400 410
>>CCDS11500.1 MAPT gene_id:4137|Hs109|chr17 (383 aa)
initn: 2296 init1: 2002 opt: 2007 Z-score: 922.6 bits: 180.4 E(33420): 4.7e-45
Smith-Waterman score: 2007; 91.8% identity (95.2% similar) in 331 aa overlap (428-758:53-383)
400 410 420 430 440 450
pF1KE2 RSSAKTLKNRPCLSPKHPTPGSSDPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEM
.:.. : . . :.. ..:. .
CCDS11 RKDQGGYTMHQDQEGDTDAGLKAEEAGIGDTPSLEDEAAGHVTQARMVSKSKDGTGSDDK
30 40 50 60 70 80
460 470 480 490 500 510
pF1KE2 KLKGADGKTKIATPRGAAPPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSP
: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KAKGADGKTKIATPRGAAPPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSP
90 100 110 120 130 140
520 530 540 550 560 570
pF1KE2 GSPGTPGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GSPGTPGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKI
150 160 170 180 190 200
580 590 600 610 620 630
pF1KE2 GSTENLKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GSTENLKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTS
210 220 230 240 250 260
640 650 660 670 680 690
pF1KE2 KCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRE
270 280 290 300 310 320
700 710 720 730 740 750
pF1KE2 NAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQG
330 340 350 360 370 380
pF1KE2 L
:
CCDS11 L
>>CCDS56033.1 MAPT gene_id:4137|Hs109|chr17 (410 aa)
initn: 1990 init1: 1070 opt: 1189 Z-score: 556.8 bits: 112.9 E(33420): 1.1e-24
Smith-Waterman score: 1993; 54.1% identity (54.1% similar) in 758 aa overlap (1-758:1-410)
10 20 30 40 50 60
pF1KE2 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MAEPRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 SETSDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAG
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 HVTQEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEG
::::
CCDS56 HVTQ--------------------------------------------------------
190 200 210 220 230 240
pF1KE2 GRHAPELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPA
CCDS56 ------------------------------------------------------------
250 260 270 280 290 300
pF1KE2 QDGRPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLE
CCDS56 ------------------------------------------------------------
310 320 330 340 350 360
pF1KE2 FTFHVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPSLGEDTKEADLPEPSEKQPA
CCDS56 ------------------------------------------------------------
370 380 390 400 410 420
pF1KE2 AAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSAKTLKNRPCLSPKHPTPGSS
:::::::::::::::::::
CCDS56 ---------------ARMVSKSKDGTGSDDKKAK--------------------------
130 140
430 440 450 460 470 480
pF1KE2 DPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPRGAAPPGQK
::::::::::::::::::::
CCDS56 ----------------------------------------GADGKTKIATPRGAAPPGQK
150 160
490 500 510 520 530 540
pF1KE2 GQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 GQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPSLPTPPTREP
170 180 190 200 210 220
550 560 570 580 590 600
pF1KE2 KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQIINKKLD
::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQPGGGKVQI------
230 240 250 260 270
610 620 630 640 650 660
pF1KE2 LSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEK
:::::::::::::::::::::::::::::::::::
CCDS56 -------------------------VYKPVDLSKVTSKCGSLGNIHHKPGGGQVEVKSEK
280 290 300 310
670 680 690 700 710 720
pF1KE2 LDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHGAEIVYKSPVVSGDT
320 330 340 350 360 370
730 740 750
pF1KE2 SPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
::::::::::::::::::::::::::::::::::::::
CCDS56 SPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
380 390 400 410
>>CCDS11502.1 MAPT gene_id:4137|Hs109|chr17 (352 aa)
initn: 1470 init1: 1070 opt: 1183 Z-score: 555.1 bits: 112.3 E(33420): 1.4e-24
Smith-Waterman score: 1729; 82.5% identity (85.8% similar) in 331 aa overlap (428-758:53-352)
400 410 420 430 440 450
pF1KE2 RSSAKTLKNRPCLSPKHPTPGSSDPLIQPSSPAVCPEPPSSPKYVSSVTSRTGSSGAKEM
.:.. : . . :.. ..:. .
CCDS11 RKDQGGYTMHQDQEGDTDAGLKAEEAGIGDTPSLEDEAAGHVTQARMVSKSKDGTGSDDK
30 40 50 60 70 80
460 470 480 490 500 510
pF1KE2 KLKGADGKTKIATPRGAAPPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSP
: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KAKGADGKTKIATPRGAAPPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSP
90 100 110 120 130 140
520 530 540 550 560 570
pF1KE2 GSPGTPGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 GSPGTPGSRSRTPSLPTPPTREPKKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKI
150 160 170 180 190 200
580 590 600 610 620 630
pF1KE2 GSTENLKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTS
::::::::::::::::: ::::::::::::
CCDS11 GSTENLKHQPGGGKVQI-------------------------------VYKPVDLSKVTS
210 220 230
640 650 660 670 680 690
pF1KE2 KCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 KCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRE
240 250 260 270 280 290
700 710 720 730 740 750
pF1KE2 NAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS11 NAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQG
300 310 320 330 340 350
pF1KE2 L
:
CCDS11 L
>>CCDS33369.1 MAP2 gene_id:4133|Hs109|chr2 (559 aa)
initn: 995 init1: 918 opt: 1136 Z-score: 531.3 bits: 108.6 E(33420): 2.9e-23
Smith-Waterman score: 1146; 40.8% identity (64.7% similar) in 573 aa overlap (219-758:3-559)
190 200 210 220 230 240
pF1KE2 KHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPAQDGRPP--
:: .: .. : : . :.. .::
CCDS33 MADERKDEAKAPHWTSAPLTEASAHS-HPPEI
10 20 30
250 260 270 280 290
pF1KE2 --QTAAREAT--SIPGFP----AEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAP
: .: :. : ::: :::. . . .:. . . . :. : ....
CCDS33 KDQGGAGEGLVRSANGFPYREDEEGAFGEHGSQGTYSNTKENGINGELTSADRETAEEVS
40 50 60 70 80 90
300 310 320 330 340 350
pF1KE2 LEFTFHV--EITPNVQKEQAHSEEHLGR-AAFPGAPGEG---PEARGPSLGEDTKEADLP
... : : . .. :: . .: . ::.: : : : . :: . . . . .
CCDS33 ARIVQVVTAEAVAVLKGEQEKEAQHKDQTAALPLAAEETANLPPSPPPSPASE-QTVTVE
100 110 120 130 140 150
360 370 380 390 400 410
pF1KE2 EPSEKQPAAAPRGKPVSRVPQLKARMVSKSKDGTGSDDKKAKTSTRSSA-KTLKNRPCLS
: . . : :: : : : .. ... . . . ..:. ::: . .: .:
CCDS33 EAAGGESALAP-----SVFKQAKDKVSNSTLSKIPALQGSTKSPRYSSACPSTTKRATFS
160 170 180 190 200
420 430 440 450 460
pF1KE2 PK---HPTP-GSSDPLIQPSSP-----AVCPEPPSS-PKYVSSVTSRTGSSGAKEMKLKG
. .:: ::.: : .: . :: :: :. : . : : :: .. . .
CCDS33 DSLLIQPTSAGSTDRLPYSKSGNKDGVTKSPEKRSSLPRPSSILPPRRGVSGDRDENSFS
210 220 230 240 250 260
470 480 490 500 510 520
pF1KE2 ADGKTKIATPRGA-APPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSP
... . .. : . . : ... ..: :. :: . : : ::. ..:. ::.:
CCDS33 LNSSISSSARRTTRSEPIRRAGKSGTSTPT-TPGSTAITP--GTPPSYSSRT----PGTP
270 280 290 300 310
530 540 550 560 570
pF1KE2 GTPGSRSRTPSLPTPPTRE---P--KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKS
::: : ::: : : : ::::..:::::::.. : .:. :.::::::::
CCDS33 GTP-SYPRTPHTPGTPKSAILVPSEKKVAIIRTPPKSPATPK-QLRLINQPLPDLKNVKS
320 330 340 350 360 370
580 590 600 610 620 630
pF1KE2 KIGSTENLKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKV
:::::.:.:.:: ::.:.:.:::.:.:.:::.::::::::: :::.:::: : .:::.:
CCDS33 KIGSTDNIKYQPKGGQVRILNKKIDFSKVQSRCGSKDNIKHSAGGGNVQIVTKKIDLSHV
380 390 400 410 420 430
640 650 660 670 680 690
pF1KE2 TSKCGSLGNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTF
::::::: ::.:.::::.:...: :::::...:.:.::::: ::::::: ::...::.:
CCDS33 TSKCGSLKNIRHRPGGGRVKIESVKLDFKEKAQAKVGSLDNAHHVPGGGNVKIDSQKLNF
440 450 460 470 480 490
700 710 720 730 740 750
pF1KE2 RENAKAKTDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAK
::.:::..::::::. .:: :. .:::.::::::.:::....::::::::..:.:.:::
CCDS33 REHAKARVDHGAEIITQSPGRSSVASPRRLSNVSSSGSINLLESPQLATLAEDVTAALAK
500 510 520 530 540 550
pF1KE2 QGL
:::
CCDS33 QGL
>>CCDS2385.1 MAP2 gene_id:4133|Hs109|chr2 (471 aa)
initn: 821 init1: 711 opt: 839 Z-score: 399.6 bits: 84.0 E(33420): 6.3e-16
Smith-Waterman score: 899; 40.2% identity (62.0% similar) in 502 aa overlap (297-758:17-471)
270 280 290 300 310 320
pF1KE2 LPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPL-EFTFHVEITPNVQKEQAHSEEHLGRA
::: : . : . :.. :.:. . : : :.
CCDS23 MADERKDEAKAPHWTSAPLTEASAHSH-PPEI-KDQGGAGEGLVRS
10 20 30 40
330 340 350 360 370
pF1KE2 A--FPGAPGE-GPEARGPSLGE--DTKEADLPEPSEKQPAAAPRGKPVS-RVPQL-KARM
: :: : : .. : : .::: . .: : .. :: :. :. :.
CCDS23 ANGFPYREDEEGAFGEHGSQGTYSNTKENGI--NGELTSADRETAEEVSARIVQVVTAEA
50 60 70 80 90 100
380 390 400 410 420
pF1KE2 VSKSKDGTGSDDKKAKTSTRSSAKTL--KNRPCLSPKHP-TPGSSDPL-----------I
:. : : ..:.:. . ...: : .. : :. : .:.: . . .
CCDS23 VAVLK---GEQEKEAQHKDQTAALPLAAEETANLPPSPPPSPASEQTVTVEEAAGGESAL
110 120 130 140 150
430 440 450 460 470
pF1KE2 QPS---------SPAVC--PEPPSS-PKYVSSVTSRTGSSGAKEMKLKGADGKTKIATPR
:: : .: :: :: :. : . : : :: .. . . ... . .. :
CCDS23 APSVFKQAKDKVSDGVTKSPEKRSSLPRPSSILPPRRGVSGDRDENSFSLNSSISSSARR
160 170 180 190 200 210
480 490 500 510 520 530
pF1KE2 GA-APPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRSRTPS
. . : ... ..: :. :: . : : ::. ..:. ::.:::: : :::
CCDS23 TTRSEPIRRAGKSGTSTPT-TPGSTAITP--GTPPSYSSRT----PGTPGTP-SYPRTPH
220 230 240 250 260 270
540 550 560 570 580
pF1KE2 LPTPPTRE---P--KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTENLKHQ
: : : ::::..:::::::.. : .:. :.:::::::::::::.:.:.:
CCDS23 TPGTPKSAILVPSEKKVAIIRTPPKSPATPK-QLRLINQPLPDLKNVKSKIGSTDNIKYQ
280 290 300 310 320 330
590 600 610 620 630 640
pF1KE2 PGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSLGNIH
: ::.:::..:: .:::.:::::::: ::.
CCDS23 PKGGQVQIVTKK-------------------------------IDLSHVTSKCGSLKNIR
340 350
650 660 670 680 690 700
pF1KE2 HKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAKTDHG
:.::::.:...: :::::...:.:.::::: ::::::: ::...::.:::.:::..:::
CCDS23 HRPGGGRVKIESVKLDFKEKAQAKVGSLDNAHHVPGGGNVKIDSQKLNFREHAKARVDHG
360 370 380 390 400 410
710 720 730 740 750
pF1KE2 AEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
:::. .:: :. .:::.::::::.:::....::::::::..:.:.::::::
CCDS23 AEIITQSPGRSSVASPRRLSNVSSSGSINLLESPQLATLAEDVTAALAKQGL
420 430 440 450 460 470
>>CCDS86916.1 MAP2 gene_id:4133|Hs109|chr2 (1823 aa)
initn: 821 init1: 711 opt: 847 Z-score: 395.0 bits: 85.1 E(33420): 1.1e-15
Smith-Waterman score: 945; 33.4% identity (56.7% similar) in 746 aa overlap (34-758:1179-1823)
10 20 30 40 50 60
pF1KE2 PRQEFEVMEDHAGTYGLGDRKDQGGYTMHQDQEGDTDAGLKESPLQTPTEDGSEEPGSET
: : : .. .: : :...: :
CCDS86 ETSPESSLIQDEIAVKLSVEIPCPPAVSEADLATDERADVQMEFIQGPKEESKETP----
1150 1160 1170 1180 1190 1200
70 80 90 100 110 120
pF1KE2 SDAKSTPTAEDVTAPLVDEGAPGKQAAAQPHTEIPEGTTAEEAGIGDTPSLEDEAAGHVT
: . ::. ::. :: : .. : : . : :. . . . .:. .:
CCDS86 -DISITPS--DVAEPL-HETIVSEPAEIQSEEEEIEAQGEYDKLLFRSDTLQ------IT
1210 1220 1230 1240 1250
130 140 150 160 170 180
pF1KE2 QEPESGKVVQEGFLREPGPPGLSHQLMSGMPGAPLLPEGPREATRQPSGTGPEDTEGGRH
. :: ..: :. : : :. :. . . : .. : : .. :.: :
CCDS86 DLGVSG--AREEFV-ETCPS--EHK---GVIESVVTIEDDFITVVQ---TTTDEGESGSH
1260 1270 1280 1290 1300
190 200 210 220 230 240
pF1KE2 APELLKHQLLGDLHQEGPPLKGAGGKERPGSKEEVDEDRDVDESSPQDSPPSKASPAQDG
. .. : : : . ..::. ..: :. .:.:.. .. :. .::
CCDS86 S---VRFAAL-----EQPEV-----ERRPSPHDE--EEFEVEEAAEAQAEPKDGSP----
1310 1320 1330 1340
250 260 270 280 290 300
pF1KE2 RPPQTAAREATSIPGFPAEGAIPLPVDFLSKVSTEIPASEPDGPSVGRAKGQDAPLEFTF
. : . :: ... . .: :. : : : : :. :. .: .:
CCDS86 EAPASPEREEVALSEYKTETYD----DY--KDETTIDDSIMDADSLWVDTQDDDRSIMTE
1350 1360 1370 1380 1390
310 320 330 340 350 360
pF1KE2 HVEITPNVQKEQAHSEEHLGRAAFPGAPGEGPEARGPS-LGEDTKEADLPEPSEKQPAAA
..: : ..:.:..: . :... : : : . .. ... ::: . .
CCDS86 QLETIP--KEEKAEKEAR--RSSLEKHRKEKPFKTGRGRISTPERKVAKKEPSTVSRDEV
1400 1410 1420 1430 1440 1450
370 380 390 400 410 420
pF1KE2 PRGKPVSRVPQLKARMVSKSKDGTGSDDKK--AKTSTRSSAKTLKNRPCLSPKHPTPGSS
: : : . ::....:.. . : ..: : . . . : . :.. : . :.
CCDS86 RRKKAVYK----KAELAKKTEVQAHSPSRKFILKPAIKYTRPT--HLSCVKRKTTAAGGE
1460 1470 1480 1490 1500
430 440 450 460
pF1KE2 DPLIQPS---------SPAVC--PEPPSS-PKYVSSVTSRTGSSGAKEMKLKGADGKTKI
. : :: : .: :: :: :. : . : : :: .. . . ... .
CCDS86 SALA-PSVFKQAKDKVSDGVTKSPEKRSSLPRPSSILPPRRGVSGDRDENSFSLNSSISS
1510 1520 1530 1540 1550 1560
470 480 490 500 510 520
pF1KE2 ATPRGA-APPGQKGQANATRIPAKTPPAPKTPPSSGEPPKSGDRSGYSSPGSPGTPGSRS
.. : . . : ... ..: :. :: . : : ::. ..:. ::.:::: :
CCDS86 SARRTTRSEPIRRAGKSGTSTPT-TPGSTAITP--GTPPSYSSRT----PGTPGTP-SYP
1570 1580 1590 1600 1610
530 540 550 560 570 580
pF1KE2 RTPSLPTPPTRE---P--KKVAVVRTPPKSPSSAKSRLQTAPVPMPDLKNVKSKIGSTEN
::: : : : ::::..:::::::.. : .:. :.:::::::::::::.:
CCDS86 RTPHTPGTPKSAILVPSEKKVAIIRTPPKSPATPK-QLRLINQPLPDLKNVKSKIGSTDN
1620 1630 1640 1650 1660 1670
590 600 610 620 630 640
pF1KE2 LKHQPGGGKVQIINKKLDLSNVQSKCGSKDNIKHVPGGGSVQIVYKPVDLSKVTSKCGSL
.:.:: ::.:::..:: .:::.::::::::
CCDS86 IKYQPKGGQVQIVTKK-------------------------------IDLSHVTSKCGSL
1680 1690 1700
650 660 670 680 690 700
pF1KE2 GNIHHKPGGGQVEVKSEKLDFKDRVQSKIGSLDNITHVPGGGNKKIETHKLTFRENAKAK
::.:.::::.:...: :::::...:.:.::::: ::::::: ::...::.:::.:::.
CCDS86 KNIRHRPGGGRVKIESVKLDFKEKAQAKVGSLDNAHHVPGGGNVKIDSQKLNFREHAKAR
1710 1720 1730 1740 1750 1760
710 720 730 740 750
pF1KE2 TDHGAEIVYKSPVVSGDTSPRHLSNVSSTGSIDMVDSPQLATLADEVSASLAKQGL
.::::::. .:: :. .:::.::::::.:::....::::::::..:.:.::::::
CCDS86 VDHGAEIITQSPGRSSVASPRRLSNVSSSGSINLLESPQLATLAEDVTAALAKQGL
1770 1780 1790 1800 1810 1820
758 residues in 1 query sequences
18921897 residues in 33420 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Feb 19 20:46:04 2021 done: Fri Feb 19 20:46:05 2021
Total Scan time: 2.890 Total Display time: 0.140
Function used was FASTA [36.3.4 Apr, 2011]