Result of FASTA (ccds) for pF1KB9631
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB9631, 272 aa
  1>>>pF1KB9631 272 - 272 aa - 272 aa
Library: human.CCDS.faa
  18511270 residues in 32554 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 7.4093+/-0.000656; mu= 8.5281+/- 0.040
 mean_var=162.3655+/-32.518, 0's: 0 Z-trim(117.2): 43  B-trim: 0 in 0/54
 Lambda= 0.100653
 statistics sampled from 17888 (17932) to 17888 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.83), E-opt: 0.2 (0.551), width:  16
 Scan time:  3.120

The best scores are:                                      opt bits E(32554)
CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4        ( 272) 1811 273.6 1.1e-73
CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5         ( 237)  437 74.0 1.1e-13
CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10      ( 214)  414 70.6   1e-12


>>CCDS3698.1 NEUROG2 gene_id:63973|Hs108|chr4             (272 aa)
 initn: 1811 init1: 1811 opt: 1811  Z-score: 1436.6  bits: 273.6 E(32554): 1.1e-73
Smith-Waterman score: 1811; 100.0% identity (100.0% similar) in 272 aa overlap (1-272:1-272)

               10        20        30        40        50        60
pF1KB9 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 MFVKSETLELKEEEDVLVLLGSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAG
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB9 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 QGARGGVAAGAEGCRPARLLGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNR
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB9 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 ERNRMHNLNAALDALREVLPTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGG
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB9 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS36 LPGALFSEAVLLSPGGASAALSSSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSP
              190       200       210       220       230       240

              250       260       270  
pF1KB9 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI
       ::::::::::::::::::::::::::::::::
CCDS36 ASPAGSDMDYWQPPPPDKHRYAPHLPIARDCI
              250       260       270  

>>CCDS4187.1 NEUROG1 gene_id:4762|Hs108|chr5              (237 aa)
 initn: 476 init1: 373 opt: 437  Z-score: 359.1  bits: 74.0 E(32554): 1.1e-13
Smith-Waterman score: 437; 46.2% identity (67.7% similar) in 186 aa overlap (51-233:33-215)

               30        40        50        60        70          
pF1KB9 GSASPALAALTPLSSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CRPARL
                                     :: :..: :. :  . .  :: .  : ...
CCDS41 ARLETCISDLDCASSSGSDLSGFLTDEEDCARLQQAASAS-GPPAPARRGAPNISRASEV
             10        20        30        40         50        60 

      80        90       100       110       120       130         
pF1KB9 LGLVHDCKRRPSRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVL
        :   : ..:  : :. .: ...   .. ....::.:::.::::::::::::::::: ::
CCDS41 PGAQDDEQER-RRRRGRTR-VRSEALLHSLRRSRRVKANDRERNRMHNLNAALDALRSVL
              70          80        90       100       110         

     140       150       160       170       180       190         
pF1KB9 PTFPEDAKLTKIETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASA
       :.::.:.:::::::::::.::::::.:::::::.   :::.    :  . :   ::  : 
CCDS41 PSFPDDTKLTKIETLRFAYNYIWALAETLRLADQGLPGGGARERLLPPQCVPCLPGPPSP
     120       130       140       150       160       170         

     200         210       220       230       240       250       
pF1KB9 ALS--SSGDSPSPASTWSCTNSPAPSSSVSSNSTSPYSCTLSPASPAGSDMDYWQPPPPD
       : .  : :.. . ::  :  .::: : . .    .:                        
CCDS41 ASDAESWGSGAAAASPLSDPSSPAASEDFTYRPGDPVFSFPSLPKDLLHTTPCFIPYH  
     180       190       200       210       220       230         

       260       270  
pF1KB9 KHRYAPHLPIARDCI

>>CCDS31212.1 NEUROG3 gene_id:50674|Hs108|chr10           (214 aa)
 initn: 423 init1: 372 opt: 414  Z-score: 341.7  bits: 70.6 E(32554): 1e-12
Smith-Waterman score: 427; 48.4% identity (64.7% similar) in 190 aa overlap (64-247:43-212)

            40        50        60        70           80        90
pF1KB9 SSSADEEEEEEPGASGGARRQRGAEAGQGARGGVAAGAEG-CR--PARLLGLVHDCKRRP
                                     ::. : . :: ::  : .: .      :: 
CCDS31 VTRETERSFPRASEDEVTCPTSAPPSPTRTRGNCAEAEEGGCRGAPRKLRA------RRG
             20        30        40        50        60            

              100       110       120       130       140       150
pF1KB9 SRARAVSRGAKTAETVQRIKKTRRLKANNRERNRMHNLNAALDALREVLPTFPEDAKLTK
       .:.:      :.  .... ...:: :::.::::::::::.:::::: ::::::.::::::
CCDS31 GRSRP-----KSELALSKQRRSRRKKANDRERNRMHNLNSALDALRGVLPTFPDDAKLTK
         70             80        90       100       110       120 

              160       170       180       190       200       210
pF1KB9 IETLRFAHNYIWALTETLRLADHCGGGGGGLPGALFSEAVLLSPGGASAALSSSGDSPSP
       :::::::::::::::.:::.:::   .    :.   .:  : ::::      : ::  : 
CCDS31 IETLRFAHNYIWALTQTLRIADHSLYALEP-PAPHCGE--LGSPGG------SPGDWGSL
             130       140       150          160             170  

              220       230          240       250       260       
pF1KB9 ASTWSCTNSPAPSSSVSSNST---SPYSCTLSPASPAGSDMDYWQPPPPDKHRYAPHLPI
        :  : ..: .:..:.        . .:  :::.: : ::                    
CCDS31 YSPVSQAGSLSPAASLEERPGLLGATFSACLSPGSLAFSDFL                  
            180       190       200       210                      

       270  
pF1KB9 ARDCI




272 residues in 1 query   sequences
18511270 residues in 32554 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Fri Nov  4 17:50:59 2016 done: Fri Nov  4 17:50:59 2016
 Total Scan time:  3.120 Total Display time: -0.030

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com