FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0455, 226 aa 1>>>pF1KE0455 226 - 226 aa - 226 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 4.5795+/-0.000626; mu= 17.7739+/- 0.038 mean_var=54.3262+/-10.809, 0's: 0 Z-trim(110.1): 17 B-trim: 46 in 1/49 Lambda= 0.174008 statistics sampled from 11382 (11392) to 11382 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.711), E-opt: 0.2 (0.35), width: 16 Scan time: 1.630 The best scores are: opt bits E(32554) CCDS6275.1 LAPTM4B gene_id:55353|Hs108|chr8 ( 317) 1537 393.2 9.8e-110 CCDS1696.1 LAPTM4A gene_id:9741|Hs108|chr2 ( 233) 720 188.1 4.2e-48 >>CCDS6275.1 LAPTM4B gene_id:55353|Hs108|chr8 (317 aa) initn: 1537 init1: 1537 opt: 1537 Z-score: 2083.6 bits: 393.2 E(32554): 9.8e-110 Smith-Waterman score: 1537; 100.0% identity (100.0% similar) in 226 aa overlap (1-226:92-317) 10 20 30 pF1KE0 MKMVAPWTRFYSNSCCLCCHVRTGTILLGV :::::::::::::::::::::::::::::: CCDS62 SRQQRRGGLQARRSTLLKTCARARATAPGAMKMVAPWTRFYSNSCCLCCHVRTGTILLGV 70 80 90 100 110 120 40 50 60 70 80 90 pF1KE0 WYLIINAVVLLILLSALADPDQYNFSSSELGGDFEFMDDANMCIAIAISLLMILICAMAT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 WYLIINAVVLLILLSALADPDQYNFSSSELGGDFEFMDDANMCIAIAISLLMILICAMAT 130 140 150 160 170 180 100 110 120 130 140 150 pF1KE0 YGAYKQRAAWIIPFFCYQIFDFALNMLVAITVLIYPNSIQEYIRQLPPNFPYRDDVMSVN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 YGAYKQRAAWIIPFFCYQIFDFALNMLVAITVLIYPNSIQEYIRQLPPNFPYRDDVMSVN 190 200 210 220 230 240 160 170 180 190 200 210 pF1KE0 PTCLVLIILLFISIILTFKGYLISCVWNCYRYINGRNSSDVLVYVTSNDTTVLLPPYDDA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS62 PTCLVLIILLFISIILTFKGYLISCVWNCYRYINGRNSSDVLVYVTSNDTTVLLPPYDDA 250 260 270 280 290 300 220 pF1KE0 TVNGAAKEPPPPYVSA :::::::::::::::: CCDS62 TVNGAAKEPPPPYVSA 310 >>CCDS1696.1 LAPTM4A gene_id:9741|Hs108|chr2 (233 aa) initn: 738 init1: 300 opt: 720 Z-score: 977.1 bits: 188.1 E(32554): 4.2e-48 Smith-Waterman score: 720; 45.5% identity (75.9% similar) in 224 aa overlap (9-226:13-233) 10 20 30 40 50 pF1KE0 MKMVAPWTRFYSNSCCLCCHVRTGTILLGVWYLIINAVVLLILLSALADPDQY--- ::::. :: :::::::::.::.::...: .. ..: .. :... CCDS16 MVSMSFKRNRSDRFYSTRCCGCCHVRTGTIILGTWYMVVNLLMAILLTVEVTHPNSMPAV 10 20 30 40 50 60 60 70 80 90 100 110 pF1KE0 NFSSSELGGDF--EFMDDANMCIAIAISLLMILICAMATYGAYKQRAAWIIPFFCYQIFD :.. .:. . : : : : :. .:.:.::..: .: .::: . ...:.::::::..:: CCDS16 NIQYEVIGNYYSSERMAD-NACVLFAVSVLMFIISSMLVYGAISYQVGWLIPFFCYRLFD 70 80 90 100 110 120 130 140 150 160 170 pF1KE0 FALNMLVAITVLIYPNSIQEYIRQLPPNFPYRDDVMSVNPTCLVLIILLFISIILTFKGY :.:. ::::. : : :.::. ::: .:::.::..... .::..:.:.:..... ::.: CCDS16 FVLSCLVAISSLTYLPRIKEYLDQLP-DFPYKDDLLALDSSCLLFIVLVFFALFIIFKAY 120 130 140 150 160 170 180 190 200 210 220 pF1KE0 LISCVWNCYRYINGRNSSDVLVY-VTSNDTTVLLPPYDDATVNGAAKEPPPPYVSA ::.::::::.:::.:: .. :: . .:: :. : :. :::::::. : CCDS16 LINCVWNCYKYINNRNVPEIAVYPAFEAPPQYVLPTYEMA-VKMPEKEPPPPYLPA 180 190 200 210 220 230 226 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 07:41:49 2016 done: Thu Nov 3 07:41:49 2016 Total Scan time: 1.630 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]