FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9805, 225 aa 1>>>pF1KB9805 225 - 225 aa - 225 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.4397+/-0.000702; mu= 13.1053+/- 0.043 mean_var=159.8311+/-31.815, 0's: 0 Z-trim(116.1): 35 B-trim: 0 in 0/54 Lambda= 0.101448 statistics sampled from 16686 (16721) to 16686 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.514), width: 16 Scan time: 2.670 The best scores are: opt bits E(32554) CCDS33507.2 BHLHE23 gene_id:128408|Hs108|chr20 ( 241) 1512 231.7 3.1e-61 CCDS6179.1 BHLHE22 gene_id:27319|Hs108|chr8 ( 381) 487 81.9 6e-16 >>CCDS33507.2 BHLHE23 gene_id:128408|Hs108|chr20 (241 aa) initn: 1512 init1: 1512 opt: 1512 Z-score: 1212.9 bits: 231.7 E(32554): 3.1e-61 Smith-Waterman score: 1512; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:17-241) 10 20 30 40 pF1KB9 MAELKSLSGDAYLALSHGYAAAAAGLAYGAAREPEAARGYGTPG :::::::::::::::::::::::::::::::::::::::::::: CCDS33 MSIRPPGEPPSPGGAAMAELKSLSGDAYLALSHGYAAAAAGLAYGAAREPEAARGYGTPG 10 20 30 40 50 60 50 60 70 80 90 100 pF1KB9 PGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPREQRSLRLS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 PGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPREQRSLRLS 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB9 INARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALDEMRRL :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 INARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALDEMRRL 130 140 150 160 170 180 170 180 190 200 210 220 pF1KB9 VAFLNQGQGLAAPVNAAPLTPFGQATVCPFSAGAALGPCPDKCAAFSGTPSALCKHCHEK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 VAFLNQGQGLAAPVNAAPLTPFGQATVCPFSAGAALGPCPDKCAAFSGTPSALCKHCHEK 190 200 210 220 230 240 pF1KB9 P : CCDS33 P >>CCDS6179.1 BHLHE22 gene_id:27319|Hs108|chr8 (381 aa) initn: 618 init1: 466 opt: 487 Z-score: 399.8 bits: 81.9 E(32554): 6e-16 Smith-Waterman score: 599; 50.2% identity (68.5% similar) in 235 aa overlap (9-225:154-381) 10 20 30 pF1KB9 MAELKSLSGDAYLALSHGYAA--AAAGLAYGAAREPEA : :.: : : :. : . :.:. :. CCDS61 ALCLKYGESASRGSVAESSGGEQSPDDDSDGRCELVLRAGVADPRASPGAGGGGAKAAEG 130 140 150 160 170 180 40 50 60 70 80 90 pF1KB9 ARGYGTPGPGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPR . : :...: :. . . .. ::. .:: . . .:.... .. . CCDS61 CSNAHLHG-GASVP--PGGLGGGGGGGSSSGSSGGGGGS--GSGSGGSSSSSSSSSKKSK 190 200 210 220 230 100 110 120 130 140 150 pF1KB9 EQRSLRLSINARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQ ::..:::.::::::::::::::::: :::::::::::::::::::::::::::::::::: CCDS61 EQKALRLNINARERRRMHDLNDALDELRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQ 240 250 260 270 280 290 160 170 180 190 200 pF1KB9 ALDEMRRLVAFLNQGQGL---------AAPVNAAPLTP----FGQATVCPFSAGAALGP- ::.:::::::.:::::.. :: . :: : : . ::. ::::: : : CCDS61 ALEEMRRLVAYLNQGQAISAASLPSSAAAAAAAAALHPALGAYEQAAGYPFSAG--LPPA 300 310 320 330 340 350 210 220 pF1KB9 --CPDKCAAFSGTPSALCKHCHEKP ::.::: :... :.:::.: ::: CCDS61 ASCPEKCALFNSVSSSLCKQCTEKP 360 370 380 225 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 19:07:04 2016 done: Fri Nov 4 19:07:04 2016 Total Scan time: 2.670 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]