Result of FASTA (ccds) for pF1KB9805
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB9805, 225 aa
  1>>>pF1KB9805 225 - 225 aa - 225 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 6.4397+/-0.000702; mu= 13.1053+/- 0.043
 mean_var=159.8311+/-31.815, 0's: 0 Z-trim(116.1): 35  B-trim: 0 in 0/54
 Lambda= 0.101448
 statistics sampled from 16686 (16721) to 16686 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.514), width:  16
 Scan time:  2.670

The best scores are:                                      opt bits E(32554)
CCDS33507.2 BHLHE23 gene_id:128408|Hs108|chr20     ( 241) 1512 231.7 3.1e-61
CCDS6179.1 BHLHE22 gene_id:27319|Hs108|chr8        ( 381)  487 81.9   6e-16


>>CCDS33507.2 BHLHE23 gene_id:128408|Hs108|chr20          (241 aa)
 initn: 1512 init1: 1512 opt: 1512  Z-score: 1212.9  bits: 231.7 E(32554): 3.1e-61
Smith-Waterman score: 1512; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:17-241)

                               10        20        30        40    
pF1KB9                 MAELKSLSGDAYLALSHGYAAAAAGLAYGAAREPEAARGYGTPG
                       ::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MSIRPPGEPPSPGGAAMAELKSLSGDAYLALSHGYAAAAAGLAYGAAREPEAARGYGTPG
               10        20        30        40        50        60

           50        60        70        80        90       100    
pF1KB9 PGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPREQRSLRLS
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 PGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPREQRSLRLS
               70        80        90       100       110       120

          110       120       130       140       150       160    
pF1KB9 INARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALDEMRRL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 INARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALDEMRRL
              130       140       150       160       170       180

          170       180       190       200       210       220    
pF1KB9 VAFLNQGQGLAAPVNAAPLTPFGQATVCPFSAGAALGPCPDKCAAFSGTPSALCKHCHEK
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 VAFLNQGQGLAAPVNAAPLTPFGQATVCPFSAGAALGPCPDKCAAFSGTPSALCKHCHEK
              190       200       210       220       230       240

        
pF1KB9 P
       :
CCDS33 P
        

>>CCDS6179.1 BHLHE22 gene_id:27319|Hs108|chr8             (381 aa)
 initn: 618 init1: 466 opt: 487  Z-score: 399.8  bits: 81.9 E(32554): 6e-16
Smith-Waterman score: 599; 50.2% identity (68.5% similar) in 235 aa overlap (9-225:154-381)

                                     10        20          30      
pF1KB9                       MAELKSLSGDAYLALSHGYAA--AAAGLAYGAAREPEA
                                     :   :.:  : :   :. : . :.:.  :.
CCDS61 ALCLKYGESASRGSVAESSGGEQSPDDDSDGRCELVLRAGVADPRASPGAGGGGAKAAEG
           130       140       150       160       170       180   

         40        50        60        70        80        90      
pF1KB9 ARGYGTPGPGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPR
         .    : :...:  :.  . . .. ::. .::    .        . .:.... .. .
CCDS61 CSNAHLHG-GASVP--PGGLGGGGGGGSSSGSSGGGGGS--GSGSGGSSSSSSSSSKKSK
           190          200       210         220       230        

        100       110       120       130       140       150      
pF1KB9 EQRSLRLSINARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQ
       ::..:::.::::::::::::::::: ::::::::::::::::::::::::::::::::::
CCDS61 EQKALRLNINARERRRMHDLNDALDELRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQ
      240       250       260       270       280       290        

        160       170                180           190       200   
pF1KB9 ALDEMRRLVAFLNQGQGL---------AAPVNAAPLTP----FGQATVCPFSAGAALGP-
       ::.:::::::.:::::..         :: . :: : :    . ::.  :::::  : : 
CCDS61 ALEEMRRLVAYLNQGQAISAASLPSSAAAAAAAAALHPALGAYEQAAGYPFSAG--LPPA
      300       310       320       330       340       350        

              210       220     
pF1KB9 --CPDKCAAFSGTPSALCKHCHEKP
         ::.::: :... :.:::.: :::
CCDS61 ASCPEKCALFNSVSSSLCKQCTEKP
        360       370       380 




225 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 19:07:04 2016 done: Fri Nov  4 19:07:04 2016
 Total Scan time:  2.670 Total Display time: -0.030

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com