FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KSDA0669, 780 aa 1>>>pF1KSDA0669 780 - 780 aa - 780 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 17.1311+/-0.000517; mu= -34.1922+/- 0.032 mean_var=814.9057+/-174.061, 0's: 0 Z-trim(124.6): 42 B-trim: 878 in 1/60 Lambda= 0.044928 statistics sampled from 46619 (46678) to 46619 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.799), E-opt: 0.2 (0.547), width: 16 Scan time: 12.110 The best scores are: opt bits E(85289) NP_904358 (OMIM: 607715) TSC22 domain family prote (1073) 613 55.6 1.3e-06 NP_006013 (OMIM: 607715) TSC22 domain family prote ( 144) 402 41.2 0.0037 >>NP_904358 (OMIM: 607715) TSC22 domain family protein 1 (1073 aa) initn: 794 init1: 337 opt: 613 Z-score: 239.5 bits: 55.6 E(85289): 1.3e-06 Smith-Waterman score: 967; 32.5% identity (58.5% similar) in 805 aa overlap (71-775:312-1071) 50 60 70 80 90 pF1KSD TEDVSSEIFDVSRATDYGPEEVCERSSSEETLNNVGDAETPGTVSPNLL--LDGQLAAA- :.::: . . :. .::. . :.. . NP_904 SSGSPASVMTNMRAPSTTGGIGINSVTGTSTVNNV-NITAVGSFNPNVTSSMLGNVNIST 290 300 310 320 330 340 100 110 120 130 140 pF1KSD -----AAAPANGGGVVSARSV---SGALASTLAAAATSAPAPGAPGGPQLAGSSAGPVTA ::. . : ::.:. .: :: .:....:. . .:.: :: ..: :.. NP_904 SNIPSAAGVSVGPGVTSGVNVNILSGMGNGTISSSAAVSSVPNAA-----AGMTGGSVSS 350 360 370 380 390 150 160 170 180 190 200 pF1KSD APSQPPTTCSSRFRVIKLDHGSGEPYRRGRWTCMEYYERD-----SDSSVLTRSGDCIRH .: ::. .:::::.::: .:.::...::::: :.::.. ... .... . ... NP_904 Q-QQQPTVNTSRFRVVKLD-SSSEPFKKGRWTCTEFYEKENAVPATEGVLINKVVETVKQ 400 410 420 430 440 450 210 220 230 240 250 260 pF1KSD SSTFDQTAERDSGLGATGGSVVVVVASMQGAHGP-ESGTDSSLTAVSQLPPSEKMSQPT- . .. :.::.: :.. .: : ... . . : : :. . .. .: .....::. NP_904 NP-IEVTSERESTSGSSVSSSVSTLSHYTESVGSGEMGAPTVVVQQQQQQQQQQQQQPAL 460 470 480 490 500 510 270 280 290 300 pF1KSD ---PAQPQSFSVGQPQPPPP-PVGGAVAQS--------SAPLPPFPGAATGPQPM---MA : ..:. :: : . ...:: : : . : :. :. NP_904 QGVTLQQMDFGSTGPQSIPAVSIPQSISQSQISQVQLQSQELSYQQKQGLQPVPLQATMS 520 530 540 550 560 570 310 320 330 340 350 pF1KSD AAQPSQPQGAGPGGQT----LPPTNVTLAQPAM----SLPPQPGPAVGAPAAQQPQQFAY :: ::. .. : : :. .:::: . . :: : ::: :: :.. NP_904 AATGIQPSPVNVVGVTSALGQQPSISSLAQPQLPYSQAAPPVQTPLPGAPPPQQ-LQYGQ 580 590 600 610 620 630 360 370 380 390 400 410 pF1KSD PQP----QIPPGHLLPVQPSGQSEYLQQHVAGLQPPSPAQPSSTGAAASPATAATLPVGT :: :. :::. : . :::.::. : .::::.:..:. ....::. NP_904 QQPMVSTQMAPGHVKSVTQNPASEYVQQQPILQTAMSSGQPSSAGVGAG---TTVIPVAQ 640 650 660 670 680 420 430 440 450 pF1KSD GQNA------SSVGAQLMGASSQP-SEAMAPRTGPAQGGQVA--------PC---QP-TG :. ..: :: ::: :: ..: : .. :.:.: : :: : NP_904 PQGIQLPVQPTAVPAQPAGASVQPVGQAPAAVSAVPTGSQIANIGQQANIPTAVQQPSTQ 690 700 710 720 730 740 460 470 480 490 500 pF1KSD VPPATV--GG-----VVQPC-LGPAGAGQPQSVP--PPQM--GGSGPLSAVPGGPHAVVP :::... :. :: : : : :.: : :. .... : .:: :..: : NP_904 VPPSVIQQGAPPSSQVVPPAQTGIIHQGVQTSAPSLPQQLVIASQSSLLTVPPQPQGVEP 750 760 770 780 790 800 510 520 530 540 550 pF1KSD ---GV--PNVPAAVPAPSVPSVSTTSVT-------MPNVPAPLAQSQQLSSHTPVSRSSS :. ..::. ::. :.:.:: . ::..:. :. :... .::...... NP_904 VAQGIVSQQLPAVSSLPSASSISVTSQVSSTGPSGMPSAPTNLVPPQNIA-QTPATQNGN 810 820 830 840 850 860 560 570 580 590 600 pF1KSD IIQHVGLPLAPGTHSA-PTS--LPQSDLSQFQTQT--QPLVGQVDDTRRKSEP----LPQ ..: :. : .:.. : . .: :. .::..:. : . .:..:.:: .:: ::: NP_904 LVQSVSQPPLIATNTNLPLAQQIPLSS-TQFSAQSLAQAIGSQIEDARRAAEPSLVGLPQ 870 880 890 900 910 920 610 620 630 640 650 660 pF1KSD PPLSLIAENKPVVKPPVADSLANPLQLTPMNSLATSVFSIAIP-VDGDEDRNPSTAFYQA .: . . .:. ..::: .: :.. :. .. : :::... NP_904 T-ISGDSGGMSAVSDGSSSSLAASASLFPLK-----VLPLTTPLVDGEDE---------- 930 940 950 960 970 670 680 690 700 710 720 pF1KSD FHLNTLKESKSLWDSASGGGVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVER :.::..:::::::::::::::::::::::::::::::::::::.:. NP_904 --------------SSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEK 980 990 1000 1010 730 740 750 760 770 780 pF1KSD NSLLERENALLKSLSSNDQLSQLPTQ--QANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA :: ::.:: :::.:.: .::.:. .: ..: .:.: :.. : ::.. . : NP_904 NSQLEQENNLLKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA 1020 1030 1040 1050 1060 1070 >>NP_006013 (OMIM: 607715) TSC22 domain family protein 1 (144 aa) initn: 384 init1: 337 opt: 402 Z-score: 177.7 bits: 41.2 E(85289): 0.0037 Smith-Waterman score: 402; 53.3% identity (77.8% similar) in 135 aa overlap (643-775:9-142) 620 630 640 650 660 670 pF1KSD KPVVKPPVADSLANPLQLTPMNSLATSVFSIAIPVDGDEDRNPSTAFYQAFHLNTLKESK .:. . . :. : .: ... :.: . : NP_006 MKSQWCRPVAMDLGVYQLRHFSISFLSSL-LGTENASV 10 20 30 680 690 700 710 720 730 pF1KSD SLWDSASGGGVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELVERNSLLERENAL : .:.::..:::::::::::::::::::::::::::::::::::::.:.:: ::.:: : NP_006 RLDNSSSGASVVAIDNKIEQAMDLVKSHLMYAVREEVEVLKEQIKELIEKNSQLEQENNL 40 50 60 70 80 90 740 750 760 770 780 pF1KSD LKSLSSNDQLSQLPTQ--QANPGSTSQQQAVIAQPPQPTQPPQQPNVSSA ::.:.: .::.:. .: ..: .:.: :.. : ::.. . : NP_006 LKTLASPEQLAQFQAQLQTGSPPATTQPQGTTQPPAQPASQGSGPTA 100 110 120 130 140 780 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 02:38:38 2016 done: Thu Nov 3 02:38:40 2016 Total Scan time: 12.110 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]