FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA1994, 1073 aa 1>>>pF1KSDA1994 1073 - 1073 aa - 1073 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 12.4050+/-0.00111; mu= -8.3442+/- 0.067 mean_var=431.0705+/-89.727, 0's: 0 Z-trim(114.9): 76 B-trim: 3 in 1/53 Lambda= 0.061773 statistics sampled from 15406 (15473) to 15406 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.749), E-opt: 0.2 (0.475), width: 16 Scan time: 6.020 The best scores are: opt bits E(32554) CCDS31966.1 TSC22D1 gene_id:8848|Hs108|chr13 (1073) 6910 630.9 4.5e-180 CCDS58291.1 TSC22D1 gene_id:8848|Hs108|chr13 ( 570) 3592 335.0 2.9e-91 CCDS9392.1 TSC22D1 gene_id:8848|Hs108|chr13 ( 144) 653 72.6 7.1e-13 CCDS3149.1 TSC22D2 gene_id:9819|Hs108|chr3 ( 780) 613 69.6 3.1e-11 >>CCDS31966.1 TSC22D1 gene_id:8848|Hs108|chr13 (1073 aa) initn: 6910 init1: 6910 opt: 6910 Z-score: 3346.3 bits: 630.9 E(32554): 4.5e-180 Smith-Waterman score: 6910; 99.9% identity (100.0% similar) in 1073 aa overlap (1-1073:1-1073) 10 20 30 40 50 60 pF1KSD MHQPPESTAAAAAAADISARKMAHPAMFPRRGSGSGSASALNAAGTGVGSNATSSEDFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 MHQPPESTAAAAAAADISARKMAHPAMFPRRGSGSGSASALNAAGTGVGSNATSSEDFPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD PSLLQPPPPAASSTSGPQPPPPQSLNLLSQAQLQAQPLAPGGTQMKKKSGFQITSVTPAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 PSLLQPPPPAASSTSGPQPPPPQSLNLLSQAQLQAQPLAPGGTQMKKKSGFQITSVTPAQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD ISASISSNNSIAEDTESYDDLDESHTEDLSSSEILDVSLSRATDLGEPERSSSEETLNNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 ISASISSNNSIAEDTESYDDLDESHTEDLSSSEILDVSLSRATDLGEPERSSSEETLNNF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD QEAETPGAVSPNQPHLPQPHLPHLPQQNVVINGNAHPHHLHHHHQIHHGHHLQHGHHHPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 QEAETPGAVSPNQPHLPQPHLPHLPQQNVVINGNAHPHHLHHHHQIHHGHHLQHGHHHPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD HVAVASASITGGPPSSPVSRKLSTTGSSDSITPVAPTSAVSSSGSPASVMTNMRAPSTTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 HVAVASASITGGPPSSPVSRKLSTTGSSDSITPVAPTSAVSSSGSPASVMTNMRAPSTTG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD GIGINSVTGTSTVNNVNITAVGSFNPNVTSSILGNVNISTSNIPSAAGVSVGPGVTSGVN :::::::::::::::::::::::::::::::.:::::::::::::::::::::::::::: CCDS31 GIGINSVTGTSTVNNVNITAVGSFNPNVTSSMLGNVNISTSNIPSAAGVSVGPGVTSGVN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KSD VNILSGMGNGTISSSAAVSSVPNAAAGMTGGSVSSQQQQPTVNTSRFRVVKLDSSSEPFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 VNILSGMGNGTISSSAAVSSVPNAAAGMTGGSVSSQQQQPTVNTSRFRVVKLDSSSEPFK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KSD KGRWTCTEFYEKENAVPATEGVLINKVVETVKQNPIEVTSERESTSGSSVSSSVSTLSHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 KGRWTCTEFYEKENAVPATEGVLINKVVETVKQNPIEVTSERESTSGSSVSSSVSTLSHY 430 440 450 460 470 480 490 500 510 520 530 540 pF1KSD TESVGSGEMGAPTVVVQQQQQQQQQQQQQPALQGVTLQQMDFGSTGPQSIPAVSIPQSIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 TESVGSGEMGAPTVVVQQQQQQQQQQQQQPALQGVTLQQMDFGSTGPQSIPAVSIPQSIS 490 500 510 520 530 540 550 560 570 580 590 600 pF1KSD QSQISQVQLQSQELSYQQKQGLQPVPLQATMSAATGIQPSPVNVVGVTSALGQQPSISSL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 QSQISQVQLQSQELSYQQKQGLQPVPLQATMSAATGIQPSPVNVVGVTSALGQQPSISSL 550 560 570 580 590 600 610 620 630 640 650 660 pF1KSD AQPQLPYSQAAPPVQTPLPGAPPPQQLQYGQQQPMVSTQMAPGHVKSVTQNPASEYVQQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 AQPQLPYSQAAPPVQTPLPGAPPPQQLQYGQQQPMVSTQMAPGHVKSVTQNPASEYVQQQ 610 620 630 640 650 660 670 680 690 700 710 720 pF1KSD PILQTAMSSGQPSSAGVGAGTTVIPVAQPQGIQLPVQPTAVPAQPAGASVQPVGQAPAAV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 PILQTAMSSGQPSSAGVGAGTTVIPVAQPQGIQLPVQPTAVPAQPAGASVQPVGQAPAAV 670 680 690 700 710 720 730 740 750 760 770 780 pF1KSD SAVPTGSQIANIGQQANIPTAVQQPSTQVPPSVIQQGAPPSSQVVPPAQTGIIHQGVQTS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SAVPTGSQIANIGQQANIPTAVQQPSTQVPPSVIQQGAPPSSQVVPPAQTGIIHQGVQTS 730 740 750 760 770 780 790 800 810 820 830 840 pF1KSD APSLPQQLVIASQSSLLTVPPQPQGVEPVAQGIVSQQLPAVSSLPSASSISVTSQVSSTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 APSLPQQLVIASQSSLLTVPPQPQGVEPVAQGIVSQQLPAVSSLPSASSISVTSQVSSTG 790 800 810 820 830 840 850 860 870 880 890 900 pF1KSD PSGMPSAPTNLVPPQNIAQTPATQNGNLVQSVSQPPLIATNTNLPLAQQIPLSSTQFSAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 PSGMPSAPTNLVPPQNIAQTPATQNGNLVQSVSQPPLIATNTNLPLAQQIPLSSTQFSAQ 850 860 870 880 890 900 910 920 930 940 950 960 pF1KSD SLAQAIGSQIEDARRAAEPSLVGLPQTISGDSGGMSAVSDGSSSSLAASASLFPLKVLPL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 SLAQAIGSQIEDARRAAEPSLVGLPQTISGDSGGMSAVSDGSSSSLAASASLFPLKVLPL 910 920 930 940 950 960 970 980 990 1000 1010 1020 pF1KSD TTPLVDGEDESSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEKNSQL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 TTPLVDGEDESSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEKNSQL 970 980 990 1000 1010 1020 1030 1040 1050 1060 1070 pF1KSD EQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS31 EQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA 1030 1040 1050 1060 1070 >>CCDS58291.1 TSC22D1 gene_id:8848|Hs108|chr13 (570 aa) initn: 3592 init1: 3592 opt: 3592 Z-score: 1752.0 bits: 335.0 E(32554): 2.9e-91 Smith-Waterman score: 3592; 99.8% identity (100.0% similar) in 556 aa overlap (1-556:1-556) 10 20 30 40 50 60 pF1KSD MHQPPESTAAAAAAADISARKMAHPAMFPRRGSGSGSASALNAAGTGVGSNATSSEDFPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 MHQPPESTAAAAAAADISARKMAHPAMFPRRGSGSGSASALNAAGTGVGSNATSSEDFPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KSD PSLLQPPPPAASSTSGPQPPPPQSLNLLSQAQLQAQPLAPGGTQMKKKSGFQITSVTPAQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 PSLLQPPPPAASSTSGPQPPPPQSLNLLSQAQLQAQPLAPGGTQMKKKSGFQITSVTPAQ 70 80 90 100 110 120 130 140 150 160 170 180 pF1KSD ISASISSNNSIAEDTESYDDLDESHTEDLSSSEILDVSLSRATDLGEPERSSSEETLNNF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 ISASISSNNSIAEDTESYDDLDESHTEDLSSSEILDVSLSRATDLGEPERSSSEETLNNF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KSD QEAETPGAVSPNQPHLPQPHLPHLPQQNVVINGNAHPHHLHHHHQIHHGHHLQHGHHHPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 QEAETPGAVSPNQPHLPQPHLPHLPQQNVVINGNAHPHHLHHHHQIHHGHHLQHGHHHPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KSD HVAVASASITGGPPSSPVSRKLSTTGSSDSITPVAPTSAVSSSGSPASVMTNMRAPSTTG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 HVAVASASITGGPPSSPVSRKLSTTGSSDSITPVAPTSAVSSSGSPASVMTNMRAPSTTG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KSD GIGINSVTGTSTVNNVNITAVGSFNPNVTSSILGNVNISTSNIPSAAGVSVGPGVTSGVN :::::::::::::::::::::::::::::::.:::::::::::::::::::::::::::: CCDS58 GIGINSVTGTSTVNNVNITAVGSFNPNVTSSMLGNVNISTSNIPSAAGVSVGPGVTSGVN 310 320 330 340 350 360 370 380 390 400 410 420 pF1KSD VNILSGMGNGTISSSAAVSSVPNAAAGMTGGSVSSQQQQPTVNTSRFRVVKLDSSSEPFK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 VNILSGMGNGTISSSAAVSSVPNAAAGMTGGSVSSQQQQPTVNTSRFRVVKLDSSSEPFK 370 380 390 400 410 420 430 440 450 460 470 480 pF1KSD KGRWTCTEFYEKENAVPATEGVLINKVVETVKQNPIEVTSERESTSGSSVSSSVSTLSHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 KGRWTCTEFYEKENAVPATEGVLINKVVETVKQNPIEVTSERESTSGSSVSSSVSTLSHY 430 440 450 460 470 480 490 500 510 520 530 540 pF1KSD TESVGSGEMGAPTVVVQQQQQQQQQQQQQPALQGVTLQQMDFGSTGPQSIPAVSIPQSIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS58 TESVGSGEMGAPTVVVQQQQQQQQQQQQQPALQGVTLQQMDFGSTGPQSIPAVSIPQSIS 490 500 510 520 530 540 550 560 570 580 590 600 pF1KSD QSQISQVQLQSQELSYQQKQGLQPVPLQATMSAATGIQPSPVNVVGVTSALGQQPSISSL :::::::::::::::: CCDS58 QSQISQVQLQSQELSYLTMKVVLLIVYLCM 550 560 570 >>CCDS9392.1 TSC22D1 gene_id:8848|Hs108|chr13 (144 aa) initn: 653 init1: 653 opt: 653 Z-score: 344.6 bits: 72.6 E(32554): 7.1e-13 Smith-Waterman score: 653; 99.0% identity (100.0% similar) in 105 aa overlap (969-1073:40-144) 940 950 960 970 980 990 pF1KSD SDGSSSSLAASASLFPLKVLPLTTPLVDGEDESSSGASVVAIDNKIEQAMDLVKSHLMYA :.:::::::::::::::::::::::::::: CCDS93 AMDLGVYQLRHFSISFLSSLLGTENASVRLDNSSSGASVVAIDNKIEQAMDLVKSHLMYA 10 20 30 40 50 60 1000 1010 1020 1030 1040 1050 pF1KSD VREEVEVLKEQIKELIEKNSQLEQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS93 VREEVEVLKEQIKELIEKNSQLEQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTT 70 80 90 100 110 120 1060 1070 pF1KSD QPPAQPASQGSGPTA ::::::::::::::: CCDS93 QPPAQPASQGSGPTA 130 140 >>CCDS3149.1 TSC22D2 gene_id:9819|Hs108|chr3 (780 aa) initn: 794 init1: 337 opt: 613 Z-score: 315.3 bits: 69.6 E(32554): 3.1e-11 Smith-Waterman score: 1027; 32.2% identity (53.7% similar) in 1007 aa overlap (106-1071:7-775) 80 90 100 110 120 130 pF1KSD GPQPPPPQSLNLLSQAQLQAQPLAPGGTQMKKKSGFQITSVTPAQISASISSNNSIAEDT :::: ::::::: ::...::. ::: CCDS31 MSKMPAKKKSCFQITSVTTAQVATSIT------EDT 10 20 30 140 150 160 170 180 190 pF1KSD ESYDDLDESHTEDLSSSEILDVSLSRATDLGEPE---RSSSEETLNNFQEAETPGAVSPN :: :: :::.:::.:: ::.:: ::::: : : :::::::::: .:::::.:::: CCDS31 ESLDDPDESRTEDVSS-EIFDV--SRATDYGPEEVCERSSSEETLNNVGDAETPGTVSPN 40 50 60 70 80 200 210 220 230 240 250 pF1KSD QPHLPQPHLPHLPQQNVVINGNAHPHHLHHHHQIHHGHHLQHGHHHPSHVAVASASITGG ....: :. .:.:.: .:: CCDS31 ----------------LLLDG-----------QL---------------AAAAAAPANGG 90 100 260 270 280 290 300 310 pF1KSD PPSSPVSRKLSTTGSSDSITPVAPTSAVSSSGSPASVMTNMRAPSTTGGIGINSVTGTST . :: . :..:. .: : :......:: :.. :: CCDS31 ---GVVSAR-SVSGA------LASTLAAAATSAPA--------PGAPGG----------- 110 120 130 320 330 340 350 360 370 pF1KSD VNNVNITAVGSFNPNVTSSILGNVNISTSNIPSAAGVSVGPGVTSGVNVNILSGMGNGTI :. :: :.:: CCDS31 -------------------------------PQLAGSSAGP------------------- 140 380 390 400 410 420 430 pF1KSD SSSAAVSSVPNAAAGMTGGSVSSQQQQPTVNTSRFRVVKLD-SSSEPFKKGRWTCTEFYE :...:. : ::. .:::::.::: .:.::...::::: :.:: CCDS31 -----VTAAPS--------------QPPTTCSSRFRVIKLDHGSGEPYRRGRWTCMEYYE 150 160 170 180 440 450 460 470 480 490 pF1KSD KENAVPATEGVLINKVVETVKQNP-IEVTSERESTSGSSVSSSVSTLSHYTESVGSGEMG .. ... .... . .... .. :.::.: :.. .: : ... . . : : : CCDS31 RD-----SDSSVLTRSGDCIRHSSTFDQTAERDSGLGATGGSVVVVVASMQGAHGP-ESG 190 200 210 220 230 240 500 510 520 530 540 550 pF1KSD APTVVVQQQQQQQQQQQQQPALQGVTLQQMDFGSTGPQSIPAVSIPQSISQSQISQVQLQ . . .. .: .....::. : ..:. :: : . ...:: CCDS31 TDSSLTAVSQLPPSEKMSQPT----PAQPQSFSVGQPQP-PPPPVGGAVAQS-------- 250 260 270 280 560 570 580 590 600 610 pF1KSD SQELSYQQKQGLQPVPLQATMSAATGIQPSPVNVVGVTSALGQQPSISSLAQPQLPYSQA : : . : :. :.:: ::. .. : : :. .:::: . . CCDS31 SAPLPPFPGAATGPQPM---MAAAQPSQPQGAGPGGQT----LPPTNVTLAQPAM----S 290 300 310 320 330 620 630 640 650 660 pF1KSD APPVQTPLPGAPPPQQ-LQYGQQQPMVSTQMAPGHVKSVTQNPASEYVQQQPILQTAMSS :: : ::: :: :.. :: :. :::. : . :::.::. : CCDS31 LPPQPGPAVGAPAAQQPQQFAYPQP----QIPPGHLLPVQPSGQSEYLQQHVAGLQPPSP 340 350 360 370 380 390 670 680 690 700 710 720 pF1KSD GQPSSAGVGAG---TTVIPVAQPQGIQLPVQPTAVPAQPAGASVQPVGQAPAAVSAVPTG .::::.:..:. ....::. :. ..: :: ::: :: ..: : .. : CCDS31 AQPSSTGAAASPATAATLPVGTGQNA------SSVGAQLMGASSQP-SEAMAPRTGPAQG 400 410 420 430 440 730 740 750 760 770 780 pF1KSD SQIANIGQQANIPTAVQQPSTQVPPSVIQQGAPPSSQVVPPAQTGIIHQGVQTSAPSLPQ .:.: : :: : :::... :. :: : : : :.: : CCDS31 GQVA--------PC---QP-TGVPPATV--GG-----VVQPC-LGPAGAGQPQSVP--PP 450 460 470 480 790 800 810 820 830 840 pF1KSD QLVIASQSSLLTVPPQPQGVEPVAQGIVSQQLPAVSSLPSASSISVTSQVSSTGPSGMPS :. .... : .:: :..: : :. ..::. ::. :.:.:: . ::. CCDS31 QM--GGSGPLSAVPGGPHAVVP---GV--PNVPAAVPAPSVPSVSTTSVT-------MPN 490 500 510 520 530 850 860 870 880 890 900 pF1KSD APTNLVPPQNIA-QTPATQNGNLVQSVSQPPLIATNTNLPLAQQIPLSS-TQFSAQSLAQ .:. :. :... .::........: :. : : . . : . .: :. .::..:. : CCDS31 VPAPLAQSQQLSSHTPVSRSSSIIQHVGLP-LAPGTHSAPTS--LPQSDLSQFQTQT--Q 540 550 560 570 580 910 920 930 940 950 pF1KSD AIGSQIEDARRAAEPSLVGLPQT-ISGDSGGMSAVSDGSSSSLAASASLFPLK-----VL . .:..:.:: .:: ::: .: . . .:. ..::: .: :.. :. CCDS31 PLVGQVDDTRRKSEP----LPQPPLSLIAENKPVVKPPVADSLANPLQLTPMNSLATSVF 590 600 610 620 630 640 960 970 980 990 pF1KSD PLTTPLVDGEDE------------------------SSSGASVVAIDNKIEQAMDLVKSH .. : :::... :.::..:::::::::::::::::: CCDS31 SIAIP-VDGDEDRNPSTAFYQAFHLNTLKESKSLWDSASGGGVVAIDNKIEQAMDLVKSH 650 660 670 680 690 700 1000 1010 1020 1030 1040 1050 pF1KSD LMYAVREEVEVLKEQIKELIEKNSQLEQENNLLKTLASPEQLAQFQAQLQTGSPPATTQP :::::::::::::::::::.:.:: ::.:: :::.:.: .::.:. . : ..: .:.: CCDS31 LMYAVREEVEVLKEQIKELVERNSLLERENALLKSLSSNDQLSQLPT--QQANPGSTSQQ 710 720 730 740 750 1060 1070 pF1KSD QGTTQPPAQPASQGSGPTA :.. : ::.. . : CCDS31 QAVIAQPPQPTQPPQQPNVSSA 760 770 780 1073 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 07:57:13 2016 done: Thu Nov 3 07:57:14 2016 Total Scan time: 6.020 Total Display time: 0.090 Function used was FASTA [36.3.4 Apr, 2011]