FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0783, 352 aa 1>>>pF1KE0783 352 - 352 aa - 352 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.4111+/-0.000724; mu= 7.3906+/- 0.043 mean_var=140.2648+/-28.646, 0's: 0 Z-trim(114.4): 59 B-trim: 0 in 0/53 Lambda= 0.108293 statistics sampled from 14866 (14927) to 14866 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.79), E-opt: 0.2 (0.459), width: 16 Scan time: 2.910 The best scores are: opt bits E(32554) CCDS74304.1 HSH2D gene_id:84941|Hs108|chr19 ( 352) 2423 389.5 2.3e-108 CCDS45315.1 SH2D7 gene_id:646892|Hs108|chr15 ( 451) 365 68.0 1.7e-11 >>CCDS74304.1 HSH2D gene_id:84941|Hs108|chr19 (352 aa) initn: 2423 init1: 2423 opt: 2423 Z-score: 2059.0 bits: 389.5 E(32554): 2.3e-108 Smith-Waterman score: 2423; 100.0% identity (100.0% similar) in 352 aa overlap (1-352:1-352) 10 20 30 40 50 60 pF1KE0 MTEAGKLPLPLPPRLDWFVHTQMGQLAQDGVPEWFHGAISREDAENLLESQPLGSFLIRV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 MTEAGKLPLPLPPRLDWFVHTQMGQLAQDGVPEWFHGAISREDAENLLESQPLGSFLIRV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 SHSHVGYTLSYKAQSSCCHFMVKLLDDGTFMIPGEKVAHTSLDALVTFHQQKPIEPRREL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 SHSHVGYTLSYKAQSSCCHFMVKLLDDGTFMIPGEKVAHTSLDALVTFHQQKPIEPRREL 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 LTQPCRQKDPANVDYEDLFLYSNAVAEEAACPVSAPEEASPKPVLCHQSKERKPSAEMNR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 LTQPCRQKDPANVDYEDLFLYSNAVAEEAACPVSAPEEASPKPVLCHQSKERKPSAEMNR 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 ITTKEATSSCPPKSPLGETRQKLWRSLKMLPERGQRVRQQLKSHLATVNLSSLLDVRRST :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 ITTKEATSSCPPKSPLGETRQKLWRSLKMLPERGQRVRQQLKSHLATVNLSSLLDVRRST 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 VISGPGTGKGSQDHSGDPTSGDRGYTDPCVATSLKSPSQPQAPKDRKVPTRKAERSVSCI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 VISGPGTGKGSQDHSGDPTSGDRGYTDPCVATSLKSPSQPQAPKDRKVPTRKAERSVSCI 250 260 270 280 290 300 310 320 330 340 350 pF1KE0 EVTPGDRSWHQMVVRALSSQESKPEHQGLAEPENDQLPEEYQQPPPFAPGYC :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS74 EVTPGDRSWHQMVVRALSSQESKPEHQGLAEPENDQLPEEYQQPPPFAPGYC 310 320 330 340 350 >>CCDS45315.1 SH2D7 gene_id:646892|Hs108|chr15 (451 aa) initn: 347 init1: 291 opt: 365 Z-score: 319.8 bits: 68.0 E(32554): 1.7e-11 Smith-Waterman score: 375; 28.8% identity (57.6% similar) in 330 aa overlap (15-328:31-345) 10 20 30 40 pF1KE0 MTEAGKLPLPLPPRLDWFVHTQMGQLAQDG-VPEWFHGAISRED : ::..:: . :.: .: :::: :.:.. CCDS45 MEDSLKQLSLGRDPEGAGDSQALAELQELALKWFMETQAPFILQNGALPPWFHGFITRKQ 10 20 30 40 50 60 50 60 70 80 90 100 pF1KE0 AENLLESQPLGSFLIRVSHSHVGYTLSYKAQSSCCHFMVKLLDDGTFMIPGEKVAHTSLD .:.::... :::::::.: .:: :::.... : ::... : . ..: :. .:..: CCDS45 TEQLLRDKALGSFLIRLSDRATGYILSYRGSDRCRHFVINQLRNRRYIISGDTQSHSTLA 70 80 90 100 110 120 110 120 130 140 150 pF1KE0 ALVTFHQQKPIEPRRELLTQPC-RQKDPANVDYEDLFLYSNAVAEEAACPVSA-----PE :: .:. .:: .:.:: : : .: : :... : : :..: :. CCDS45 ELVHHYQEAQLEPFKEMLTAACPRPEDNDLYDAITRGLHQTIVDPENP-PATAFLTVVPD 130 140 150 160 170 160 170 180 190 200 210 pF1KE0 EA-----SPKPVLCHQSKERKPSAEMN-RITTKEATSSCPPK-SPLGETRQKLWRSLKML .: :::: . .. : ... : ..: . : . ::: : ..: . CCDS45 KAASPRSSPKPQVSFLHAQK--SLDVSPRNLSQEESMEAPIRVSPLPEKSSSLLEESFGG 180 190 200 210 220 230 220 230 240 250 260 pF1KE0 PERGQRVRQQLKS-HLATVNLSSLLDVRRSTVISGPGTGKGSQDHSGDPTSGDRGYTDPC : .. . .:. . : ..:.. . :.. : .: . . ... . ..:... : CCDS45 P--SDIIYADLRRMNQARLGLGTEGSGRHGPVPAGSQAYSPGREAQRRLSDGEQNRPDG- 240 250 260 270 280 290 270 280 290 300 310 320 pF1KE0 VATSLKSPSQPQAPKDRKVPTRKAERSVSCIEVTPG-DRSWHQMVVRALSSQESKPEHQG .. :.. : :.: . :: : .: .. . .:.: . :::..: :: CCDS45 LGPVLSGVSPDQGPTES--PT-----SWGCSDAMGSLGATWRQEFPKL--SQEAQPCSQG 300 310 320 330 340 330 340 350 pF1KE0 LAEPENDQLPEEYQQPPPFAPGYC CCDS45 SSADIYEFIGTEGLLQEARDTPDQEGSTYEQIPACWGGPARAPHPGASPTYSPWVHGYKR 350 360 370 380 390 400 352 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 18:40:06 2016 done: Sat Nov 5 18:40:06 2016 Total Scan time: 2.910 Total Display time: -0.020 Function used was FASTA [36.3.4 Apr, 2011]