FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0682, 389 aa 1>>>pF1KE0682 389 - 389 aa - 389 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.2790+/-0.000931; mu= -10.1683+/- 0.056 mean_var=323.5037+/-66.564, 0's: 0 Z-trim(115.2): 13 B-trim: 0 in 0/55 Lambda= 0.071307 statistics sampled from 15745 (15753) to 15745 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.484), width: 16 Scan time: 3.440 The best scores are: opt bits E(32554) CCDS1159.1 SH2D2A gene_id:9047|Hs108|chr1 ( 389) 2739 295.0 7.8e-80 CCDS53380.1 SH2D2A gene_id:9047|Hs108|chr1 ( 371) 2584 279.0 4.7e-75 CCDS53381.1 SH2D2A gene_id:9047|Hs108|chr1 ( 399) 2044 223.5 2.7e-58 >>CCDS1159.1 SH2D2A gene_id:9047|Hs108|chr1 (389 aa) initn: 2739 init1: 2739 opt: 2739 Z-score: 1546.7 bits: 295.0 E(32554): 7.8e-80 Smith-Waterman score: 2739; 100.0% identity (100.0% similar) in 389 aa overlap (1-389:1-389) 10 20 30 40 50 60 pF1KE0 MEFPLAQICPQGSHEAPIPTFSTFQITDMTRRSCQNLGYTAASPQAPEAASNTGNAERAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 MEFPLAQICPQGSHEAPIPTFSTFQITDMTRRSCQNLGYTAASPQAPEAASNTGNAERAE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 EVPGEGSLFLQAETRAWFQKTQAHWLLQHGAAPAWFHGFITRREAERLLEPKPQGCYLVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 EVPGEGSLFLQAETRAWFQKTQAHWLLQHGAAPAWFHGFITRREAERLLEPKPQGCYLVR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 FSESAVTFVLTYRSRTCCRHFLLAQLRDGRHVVLGEDSAHARLQDLLLHYTAHPLSPYGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 FSESAVTFVLTYRSRTCCRHFLLAQLRDGRHVVLGEDSAHARLQDLLLHYTAHPLSPYGE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 TLTEPLARQTPEPAGLSLRTEESNFGSKSQDPNPQYSPIIKQGQAPVPMQKEGAGEKEPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 TLTEPLARQTPEPAGLSLRTEESNFGSKSQDPNPQYSPIIKQGQAPVPMQKEGAGEKEPS 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 QLLRPKPPIPAKPQLPPEVYTIPVPRHRPAPRPKPSNPIYNEPDEPIAFYAMGRGSPGEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 QLLRPKPPIPAKPQLPPEVYTIPVPRHRPAPRPKPSNPIYNEPDEPIAFYAMGRGSPGEA 250 260 270 280 290 300 310 320 330 340 350 360 pF1KE0 PSNIYVEVEDEGLPATLGHPVLRKSWSRPVPGGQNTGGSQLHSENSVIGQGPPLPHQPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS11 PSNIYVEVEDEGLPATLGHPVLRKSWSRPVPGGQNTGGSQLHSENSVIGQGPPLPHQPPP 310 320 330 340 350 360 370 380 pF1KE0 AWRHTLPHNLSRQVLQDRGQAWLPLGPPQ ::::::::::::::::::::::::::::: CCDS11 AWRHTLPHNLSRQVLQDRGQAWLPLGPPQ 370 380 >>CCDS53380.1 SH2D2A gene_id:9047|Hs108|chr1 (371 aa) initn: 2584 init1: 2584 opt: 2584 Z-score: 1460.9 bits: 279.0 E(32554): 4.7e-75 Smith-Waterman score: 2584; 100.0% identity (100.0% similar) in 368 aa overlap (22-389:4-371) 10 20 30 40 50 60 pF1KE0 MEFPLAQICPQGSHEAPIPTFSTFQITDMTRRSCQNLGYTAASPQAPEAASNTGNAERAE ::::::::::::::::::::::::::::::::::::::: CCDS53 MSPSTFQITDMTRRSCQNLGYTAASPQAPEAASNTGNAERAE 10 20 30 40 70 80 90 100 110 120 pF1KE0 EVPGEGSLFLQAETRAWFQKTQAHWLLQHGAAPAWFHGFITRREAERLLEPKPQGCYLVR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 EVPGEGSLFLQAETRAWFQKTQAHWLLQHGAAPAWFHGFITRREAERLLEPKPQGCYLVR 50 60 70 80 90 100 130 140 150 160 170 180 pF1KE0 FSESAVTFVLTYRSRTCCRHFLLAQLRDGRHVVLGEDSAHARLQDLLLHYTAHPLSPYGE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 FSESAVTFVLTYRSRTCCRHFLLAQLRDGRHVVLGEDSAHARLQDLLLHYTAHPLSPYGE 110 120 130 140 150 160 190 200 210 220 230 240 pF1KE0 TLTEPLARQTPEPAGLSLRTEESNFGSKSQDPNPQYSPIIKQGQAPVPMQKEGAGEKEPS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TLTEPLARQTPEPAGLSLRTEESNFGSKSQDPNPQYSPIIKQGQAPVPMQKEGAGEKEPS 170 180 190 200 210 220 250 260 270 280 290 300 pF1KE0 QLLRPKPPIPAKPQLPPEVYTIPVPRHRPAPRPKPSNPIYNEPDEPIAFYAMGRGSPGEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 QLLRPKPPIPAKPQLPPEVYTIPVPRHRPAPRPKPSNPIYNEPDEPIAFYAMGRGSPGEA 230 240 250 260 270 280 310 320 330 340 350 360 pF1KE0 PSNIYVEVEDEGLPATLGHPVLRKSWSRPVPGGQNTGGSQLHSENSVIGQGPPLPHQPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 PSNIYVEVEDEGLPATLGHPVLRKSWSRPVPGGQNTGGSQLHSENSVIGQGPPLPHQPPP 290 300 310 320 330 340 370 380 pF1KE0 AWRHTLPHNLSRQVLQDRGQAWLPLGPPQ ::::::::::::::::::::::::::::: CCDS53 AWRHTLPHNLSRQVLQDRGQAWLPLGPPQ 350 360 370 >>CCDS53381.1 SH2D2A gene_id:9047|Hs108|chr1 (399 aa) initn: 2037 init1: 2037 opt: 2044 Z-score: 1160.2 bits: 223.5 E(32554): 2.7e-58 Smith-Waterman score: 2709; 97.5% identity (97.5% similar) in 399 aa overlap (1-389:1-399) 10 20 30 40 50 60 pF1KE0 MEFPLAQICPQGSHEAPIPTFSTFQITDMTRRSCQNLGYTAASPQAPEAASNTGNAERAE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 MEFPLAQICPQGSHEAPIPTFSTFQITDMTRRSCQNLGYTAASPQAPEAASNTGNAERAE 10 20 30 40 50 60 70 80 90 100 110 pF1KE0 EVPGEGSLFLQAETRAWFQKTQAHWLLQHGAAPAWFHGFITRR----------EAERLLE ::::::::::::::::::::::::::::::::::::::::::: ::::::: CCDS53 EVPGEGSLFLQAETRAWFQKTQAHWLLQHGAAPAWFHGFITRRVRPPLSVTHREAERLLE 70 80 90 100 110 120 120 130 140 150 160 170 pF1KE0 PKPQGCYLVRFSESAVTFVLTYRSRTCCRHFLLAQLRDGRHVVLGEDSAHARLQDLLLHY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 PKPQGCYLVRFSESAVTFVLTYRSRTCCRHFLLAQLRDGRHVVLGEDSAHARLQDLLLHY 130 140 150 160 170 180 180 190 200 210 220 230 pF1KE0 TAHPLSPYGETLTEPLARQTPEPAGLSLRTEESNFGSKSQDPNPQYSPIIKQGQAPVPMQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 TAHPLSPYGETLTEPLARQTPEPAGLSLRTEESNFGSKSQDPNPQYSPIIKQGQAPVPMQ 190 200 210 220 230 240 240 250 260 270 280 290 pF1KE0 KEGAGEKEPSQLLRPKPPIPAKPQLPPEVYTIPVPRHRPAPRPKPSNPIYNEPDEPIAFY :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 KEGAGEKEPSQLLRPKPPIPAKPQLPPEVYTIPVPRHRPAPRPKPSNPIYNEPDEPIAFY 250 260 270 280 290 300 300 310 320 330 340 350 pF1KE0 AMGRGSPGEAPSNIYVEVEDEGLPATLGHPVLRKSWSRPVPGGQNTGGSQLHSENSVIGQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS53 AMGRGSPGEAPSNIYVEVEDEGLPATLGHPVLRKSWSRPVPGGQNTGGSQLHSENSVIGQ 310 320 330 340 350 360 360 370 380 pF1KE0 GPPLPHQPPPAWRHTLPHNLSRQVLQDRGQAWLPLGPPQ ::::::::::::::::::::::::::::::::::::::: CCDS53 GPPLPHQPPPAWRHTLPHNLSRQVLQDRGQAWLPLGPPQ 370 380 390 389 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 02:45:29 2016 done: Sat Nov 5 02:45:29 2016 Total Scan time: 3.440 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]