FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0405, 298 aa 1>>>pF1KE0405 298 - 298 aa - 298 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3985+/-0.000848; mu= 15.3112+/- 0.051 mean_var=94.3635+/-19.290, 0's: 0 Z-trim(108.9): 109 B-trim: 378 in 1/50 Lambda= 0.132030 statistics sampled from 10417 (10553) to 10417 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.699), E-opt: 0.2 (0.324), width: 16 Scan time: 2.660 The best scores are: opt bits E(32554) CCDS6695.1 OGN gene_id:4969|Hs108|chr9 ( 298) 1946 380.8 6.8e-106 CCDS31870.1 EPYC gene_id:1833|Hs108|chr12 ( 322) 764 155.7 4.3e-38 CCDS1439.1 OPTC gene_id:26254|Hs108|chr1 ( 332) 711 145.6 4.8e-35 >>CCDS6695.1 OGN gene_id:4969|Hs108|chr9 (298 aa) initn: 1946 init1: 1946 opt: 1946 Z-score: 2014.6 bits: 380.8 E(32554): 6.8e-106 Smith-Waterman score: 1946; 100.0% identity (100.0% similar) in 298 aa overlap (1-298:1-298) 10 20 30 40 50 60 pF1KE0 MKTLQSTLLLLLLVPLIKPAPPTQQDSRIIYDYGTDNFEESIFSQDYEDKYLDGKNIKEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 MKTLQSTLLLLLLVPLIKPAPPTQQDSRIIYDYGTDNFEESIFSQDYEDKYLDGKNIKEK 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 ETVIIPNEKSLQLQKDEAITPLPPKKENDEMPTCLLCVCLSGSVYCEEVDIDAVPPLPKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 ETVIIPNEKSLQLQKDEAITPLPPKKENDEMPTCLLCVCLSGSVYCEEVDIDAVPPLPKE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 SAYLYARFNKIKKLTAKDFADIPNLRRLDFTGNLIEDIEDGTFSKLSLLEELSLAENQLL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 SAYLYARFNKIKKLTAKDFADIPNLRRLDFTGNLIEDIEDGTFSKLSLLEELSLAENQLL 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 KLPVLPPKLTLFNAKYNKIKSRGIKANAFKKLNNLTFLYLDHNALESVPLNLPESLRVIH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 KLPVLPPKLTLFNAKYNKIKSRGIKANAFKKLNNLTFLYLDHNALESVPLNLPESLRVIH 190 200 210 220 230 240 250 260 270 280 290 pF1KE0 LQFNNIASITDDTFCKANDTSYIRDRIEEIRLEGNPIVLGKHPNSFICLKRLPIGSYF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS66 LQFNNIASITDDTFCKANDTSYIRDRIEEIRLEGNPIVLGKHPNSFICLKRLPIGSYF 250 260 270 280 290 >>CCDS31870.1 EPYC gene_id:1833|Hs108|chr12 (322 aa) initn: 783 init1: 760 opt: 764 Z-score: 797.4 bits: 155.7 E(32554): 4.3e-38 Smith-Waterman score: 770; 39.7% identity (68.3% similar) in 325 aa overlap (1-296:1-320) 10 20 30 40 50 pF1KE0 MKTLQSTLLLLLLVPLIKPAPPTQQDSRIIYDYGTDNFEESIFSQD----YEDKYLDGKN :::: . .: :.. :: .. : :: ..... .. . : ::. .: . CCDS31 MKTLAGLVLGLVIFDAAVTAPTLES---INYD--SETYDATLEDLDNLYNYENIPVDKVE 10 20 30 40 50 60 70 80 90 pF1KE0 IK--------EKETVIIPN--EKSLQLQKDEAITPL------PPKKE---------NDEM :. ..: . : ::. . ...: :: : . : :... CCDS31 IEIATVMPSGNRELLTPPPQPEKAQEEEEEEESTPRLIDGSSPQEPEFTGVLGPHTNEDF 60 70 80 90 100 110 100 110 120 130 140 150 pF1KE0 PTCLLCVCLSGSVYCEEVDIDAVPPLPKESAYLYARFNKIKKLTAKDFADIPNLRRLDFT ::::::.:.: .:::.. ..::.:::::..::.:.:::.:::.. .:::.. .:.:.:.: CCDS31 PTCLLCTCISTTVYCDDHELDAIPPLPKNTAYFYSRFNRIKKINKNDFASLSDLKRIDLT 120 130 140 150 160 170 160 170 180 190 200 210 pF1KE0 GNLIEDIEDGTFSKLSLLEELSLAENQLLKLPVLPPKLTLFNAKYNKIKSRGIKANAFKK .::: .:.. .: :: :.:: : .:.. .:: :: ::... . :.. .::: .::: CCDS31 SNLISEIDEDAFRKLPQLRELVLRDNKIRQLPELPTTLTFIDISNNRLGRKGIKQEAFKD 180 190 200 210 220 230 220 230 240 250 260 270 pF1KE0 LNNLTFLYLDHNALESVPLNLPESLRVIHLQFNNIASITDDTFCKANDTSYIRDRIEEIR . .: ::: : :. .:: :::.::..::: ::: . .::::.... .::: .:.:: CCDS31 MYDLHHLYLTDNNLDHIPLPLPENLRALHLQNNNILEMHEDTFCNVKNLTYIRKALEDIR 240 250 260 270 280 290 280 290 pF1KE0 LEGNPIVLGKHPNSFICLKRLPIGSYF :.:::: :.: :....:: :::.:: CCDS31 LDGNPINLSKTPQAYMCLPRLPVGSLV 300 310 320 >>CCDS1439.1 OPTC gene_id:26254|Hs108|chr1 (332 aa) initn: 753 init1: 704 opt: 711 Z-score: 742.7 bits: 145.6 E(32554): 4.8e-35 Smith-Waterman score: 711; 45.8% identity (77.4% similar) in 212 aa overlap (86-297:120-331) 60 70 80 90 100 110 pF1KE0 NIKEKETVIIPNEKSLQLQKDEAITPLPPKKENDEMPTCLLCVCLSGSVYCEEVDIDAVP . : .::::.::::..::::...:.. .: CCDS14 SPAKSTTAPGTPSSNPTMTRPTTAGLLLSSQPNHGLPTCLVCVCLGSSVYCDDIDLEDIP 90 100 110 120 130 140 120 130 140 150 160 170 pF1KE0 PLPKESAYLYARFNKIKKLTAKDFADIPNLRRLDFTGNLIEDIEDGTFSKLSLLEELSLA :::...::::::::.:... :.:: . .:.:.:...::: .:.. .: : :..: : CCDS14 PLPRRTAYLYARFNRISRIRAEDFKGLTKLKRIDLSNNLISSIDNDAFRLLHALQDLILP 150 160 170 180 190 200 180 190 200 210 220 230 pF1KE0 ENQLLKLPVLPPKLTLFNAKYNKIKSRGIKANAFKKLNNLTFLYLDHNALESVPLNLPES :::: ::::: . ..... :...: ::. ::. ...: ::::. : :.:.: :: : CCDS14 ENQLEALPVLPSGIEFLDVRLNRLQSSGIQPAAFRAMEKLQFLYLSDNLLDSIPGPLPLS 210 220 230 240 250 260 240 250 260 270 280 290 pF1KE0 LRVIHLQFNNIASITDDTFCKANDTSYIRDRIEEIRLEGNPIVLGKHPNSFICLKRLPIG :: .::: : : .. :.:: .. .. : ..:.:::.:::: :. :....:: ::::: CCDS14 LRSVHLQNNLIETMQRDVFCDPEEHKHTRRQLEDIRLDGNPINLSLFPSAYFCLPRLPIG 270 280 290 300 310 320 pF1KE0 SYF . CCDS14 RFT 330 298 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 12:11:19 2016 done: Thu Nov 3 12:11:19 2016 Total Scan time: 2.660 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]