FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7374, 226 aa 1>>>pF1KB7374 226 - 226 aa - 226 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.3268+/-0.000791; mu= 9.2850+/- 0.048 mean_var=86.3189+/-17.388, 0's: 0 Z-trim(109.8): 16 B-trim: 429 in 1/50 Lambda= 0.138045 statistics sampled from 11149 (11164) to 11149 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.728), E-opt: 0.2 (0.343), width: 16 Scan time: 2.360 The best scores are: opt bits E(32554) CCDS73111.1 PTPN20 gene_id:26095|Hs108|chr10 ( 226) 1529 313.8 5.5e-86 CCDS73106.1 PTPN20 gene_id:26095|Hs108|chr10 ( 217) 1451 298.3 2.5e-81 CCDS73110.1 PTPN20 gene_id:26095|Hs108|chr10 ( 420) 1299 268.2 5.8e-72 CCDS73105.1 PTPN20 gene_id:26095|Hs108|chr10 ( 411) 1221 252.6 2.7e-67 CCDS81454.1 PTPN20 gene_id:26095|Hs108|chr10 ( 206) 1092 226.8 8.1e-60 CCDS73107.1 PTPN20 gene_id:26095|Hs108|chr10 ( 269) 1014 211.3 4.8e-55 CCDS73115.1 PTPN20 gene_id:26095|Hs108|chr10 ( 145) 990 206.4 7.7e-54 CCDS73114.1 PTPN20 gene_id:26095|Hs108|chr10 ( 339) 760 160.8 1e-39 CCDS73108.1 PTPN20 gene_id:26095|Hs108|chr10 ( 136) 686 145.9 1.2e-35 CCDS81456.1 PTPN20 gene_id:26095|Hs108|chr10 ( 197) 553 119.4 1.6e-27 >>CCDS73111.1 PTPN20 gene_id:26095|Hs108|chr10 (226 aa) initn: 1529 init1: 1529 opt: 1529 Z-score: 1657.2 bits: 313.8 E(32554): 5.5e-86 Smith-Waterman score: 1529; 100.0% identity (100.0% similar) in 226 aa overlap (1-226:1-226) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ 130 140 150 160 170 180 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA :::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA 190 200 210 220 >>CCDS73106.1 PTPN20 gene_id:26095|Hs108|chr10 (217 aa) initn: 1451 init1: 1451 opt: 1451 Z-score: 1573.5 bits: 298.3 E(32554): 2.5e-81 Smith-Waterman score: 1451; 100.0% identity (100.0% similar) in 215 aa overlap (12-226:3-217) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MIVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ 120 130 140 150 160 170 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA :::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA 180 190 200 210 >>CCDS73110.1 PTPN20 gene_id:26095|Hs108|chr10 (420 aa) initn: 1324 init1: 1299 opt: 1299 Z-score: 1405.4 bits: 268.2 E(32554): 5.8e-72 Smith-Waterman score: 1299; 99.5% identity (100.0% similar) in 195 aa overlap (1-195:1-195) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ 130 140 150 160 170 180 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA ::::::::::::::. CCDS73 PSNREKNRYRDILPYDSTRVPLGKSKDYINASYIRIVNCGEEYFYIATQGPLLSTIDDFW 190 200 210 220 230 240 >>CCDS73105.1 PTPN20 gene_id:26095|Hs108|chr10 (411 aa) initn: 1246 init1: 1221 opt: 1221 Z-score: 1321.6 bits: 252.6 E(32554): 2.7e-67 Smith-Waterman score: 1221; 99.5% identity (100.0% similar) in 184 aa overlap (12-195:3-186) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MIVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ 120 130 140 150 160 170 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA ::::::::::::::. CCDS73 PSNREKNRYRDILPYDSTRVPLGKSKDYINASYIRIVNCGEEYFYIATQGPLLSTIDDFW 180 190 200 210 220 230 >>CCDS81454.1 PTPN20 gene_id:26095|Hs108|chr10 (206 aa) initn: 1092 init1: 1092 opt: 1092 Z-score: 1187.4 bits: 226.8 E(32554): 8.1e-60 Smith-Waterman score: 1092; 100.0% identity (100.0% similar) in 164 aa overlap (1-164:1-164) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ :::::::::::::::::::::::::::::::::::::::::::: CCDS81 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMFNIMDIVAQMREQRSG 130 140 150 160 170 180 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA CCDS81 MVQTKEQYHFCYDIVLEVLRKLLTLD 190 200 >>CCDS73107.1 PTPN20 gene_id:26095|Hs108|chr10 (269 aa) initn: 1014 init1: 1014 opt: 1014 Z-score: 1101.7 bits: 211.3 E(32554): 4.8e-55 Smith-Waterman score: 1014; 100.0% identity (100.0% similar) in 153 aa overlap (12-164:3-155) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MIVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ :::::::::::::::::::::::::::::::::::::::::::: CCDS73 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMTGTSHSVKQLQFTKWP 120 130 140 150 160 170 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA CCDS73 DHGTPASADSFIKYIRYARKSHLTGPMVVHCSAGIGRTGVFLCVDVVFCAIVKNCSFNIM 180 190 200 210 220 230 >>CCDS73115.1 PTPN20 gene_id:26095|Hs108|chr10 (145 aa) initn: 990 init1: 990 opt: 990 Z-score: 1080.0 bits: 206.4 E(32554): 7.7e-54 Smith-Waterman score: 990; 100.0% identity (100.0% similar) in 145 aa overlap (82-226:1-145) 60 70 80 90 100 110 pF1KB7 EKVKLSLRNFPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPL :::::::::::::::::::::::::::::: CCDS73 MWTARGPFRRDRWSSEDEEAAGPSQALSPL 10 20 30 120 130 140 150 160 170 pF1KB7 LSDTRKIVSEGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LSDTRKIVSEGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNL 40 50 60 70 80 90 180 190 200 210 220 pF1KB7 PGEFNSGNQPSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA ::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PGEFNSGNQPSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA 100 110 120 130 140 >>CCDS73114.1 PTPN20 gene_id:26095|Hs108|chr10 (339 aa) initn: 760 init1: 760 opt: 760 Z-score: 826.7 bits: 160.8 E(32554): 1e-39 Smith-Waterman score: 760; 99.1% identity (100.0% similar) in 114 aa overlap (82-195:1-114) 60 70 80 90 100 110 pF1KB7 EKVKLSLRNFPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPL :::::::::::::::::::::::::::::: CCDS73 MWTARGPFRRDRWSSEDEEAAGPSQALSPL 10 20 30 120 130 140 150 160 170 pF1KB7 LSDTRKIVSEGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LSDTRKIVSEGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNL 40 50 60 70 80 90 180 190 200 210 220 pF1KB7 PGEFNSGNQPSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA :::::::::::::::::::::::. CCDS73 PGEFNSGNQPSNREKNRYRDILPYDSTRVPLGKSKDYINASYIRIVNCGEEYFYIATQGP 100 110 120 130 140 150 >>CCDS73108.1 PTPN20 gene_id:26095|Hs108|chr10 (136 aa) initn: 686 init1: 686 opt: 686 Z-score: 753.3 bits: 145.9 E(32554): 1.2e-35 Smith-Waterman score: 739; 61.9% identity (61.9% similar) in 215 aa overlap (12-226:3-136) 10 20 30 40 50 60 pF1KB7 MSSPRDFRAEPVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN ::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MIVNDYEGNDSEAEDLNFRETLPSSSQENTPRSKVFENKVNSEKVKLSLRN 10 20 30 40 50 70 80 90 100 110 120 pF1KB7 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLSDTRKIVS ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 FPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPLLS------- 60 70 80 90 100 130 140 150 160 170 180 pF1KB7 EGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNLPGEFNSGNQ CCDS73 ------------------------------------------------------------ 190 200 210 220 pF1KB7 PSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA ::::::::::::::::::::::::::::::: CCDS73 --------------VQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA 110 120 130 >>CCDS81456.1 PTPN20 gene_id:26095|Hs108|chr10 (197 aa) initn: 553 init1: 553 opt: 553 Z-score: 607.6 bits: 119.4 E(32554): 1.6e-27 Smith-Waterman score: 553; 100.0% identity (100.0% similar) in 83 aa overlap (82-164:1-83) 60 70 80 90 100 110 pF1KB7 EKVKLSLRNFPHNDYEDVFEEPSESGSDPSMWTARGPFRRDRWSSEDEEAAGPSQALSPL :::::::::::::::::::::::::::::: CCDS81 MWTARGPFRRDRWSSEDEEAAGPSQALSPL 10 20 30 120 130 140 150 160 170 pF1KB7 LSDTRKIVSEGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMALELKNL ::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS81 LSDTRKIVSEGELDQLAQIRPLIFNFHEQTAIKDCLKILEEKTAAYDIMQEFMTGTSHSV 40 50 60 70 80 90 180 190 200 210 220 pF1KB7 PGEFNSGNQPSNREKNRYRDILPFQHHGYSGPNERTTFWHGSNEGAVSLLLRYCA CCDS81 KQLQFTKWPDHGTPASADSFIKYIRYARKSHLTGPMVVHCSAGIGRTGVFLCVDVVFCAI 100 110 120 130 140 150 226 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 06:58:51 2016 done: Fri Nov 4 06:58:52 2016 Total Scan time: 2.360 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]