Result of FASTA (omim) for pF1KE0486
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KE0486, 180 aa
  1>>>pF1KE0486 180 - 180 aa - 180 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 4.9005+/-0.000329; mu= 14.2091+/- 0.020
 mean_var=56.4251+/-11.577, 0's: 0 Z-trim(114.4): 12  B-trim: 604 in 1/52
 Lambda= 0.170741
 statistics sampled from 24203 (24213) to 24203 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.679), E-opt: 0.2 (0.284), width:  16
 Scan time:  5.700

The best scores are:                                      opt bits E(85289)
NP_076958 (OMIM: 610152) centromere protein M isof ( 180) 1160 293.6 1.1e-79
NP_001291299 (OMIM: 610152) centromere protein M i ( 146)  953 242.6 2.1e-64
XP_011528670 (OMIM: 610152) PREDICTED: centromere  ( 236)  870 222.2 4.6e-58
NP_001002876 (OMIM: 610152) centromere protein M i ( 107)  668 172.3 2.2e-43
NP_001291300 (OMIM: 610152) centromere protein M i ( 132)  653 168.6 3.5e-42
NP_001291302 (OMIM: 610152) centromere protein M i (  73)  461 121.2 3.6e-28
NP_001103685 (OMIM: 610152) centromere protein M i (  58)  296 80.5 5.2e-16
NP_001291301 (OMIM: 610152) centromere protein M i ( 125)  291 79.5 2.3e-15


>>NP_076958 (OMIM: 610152) centromere protein M isoform   (180 aa)
 initn: 1160 init1: 1160 opt: 1160  Z-score: 1551.3  bits: 293.6 E(85289): 1.1e-79
Smith-Waterman score: 1160; 100.0% identity (100.0% similar) in 180 aa overlap (1-180:1-180)

               10        20        30        40        50        60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_076 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_076 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_076 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
              130       140       150       160       170       180

>>NP_001291299 (OMIM: 610152) centromere protein M isofo  (146 aa)
 initn: 953 init1: 953 opt: 953  Z-score: 1277.1  bits: 242.6 E(85289): 2.1e-64
Smith-Waterman score: 953; 100.0% identity (100.0% similar) in 146 aa overlap (35-180:1-146)

           10        20        30        40        50        60    
pF1KE0 RPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNRPRID
                                     ::::::::::::::::::::::::::::::
NP_001                               MLKEDCASELKVHLAKSLPLPSSVNRPRID
                                             10        20        30

           70        80        90       100       110       120    
pF1KE0 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
               40        50        60        70        80        90

          130       140       150       160       170       180
pF1KE0 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
              100       110       120       130       140      

>>XP_011528670 (OMIM: 610152) PREDICTED: centromere prot  (236 aa)
 initn: 870 init1: 870 opt: 870  Z-score: 1163.5  bits: 222.2 E(85289): 4.6e-58
Smith-Waterman score: 870; 100.0% identity (100.0% similar) in 134 aa overlap (1-134:1-134)

               10        20        30        40        50        60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
XP_011 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
       ::::::::::::::                                              
XP_011 AHTYQSPLLYCDLEGLPSWKPGGRPCSPHSMASRCIIPCPGPPALDVGHSARERDPCRPQ
              130       140       150       160       170       180

>>NP_001002876 (OMIM: 610152) centromere protein M isofo  (107 aa)
 initn: 668 init1: 668 opt: 668  Z-score: 899.7  bits: 172.3 E(85289): 2.2e-43
Smith-Waterman score: 668; 99.1% identity (100.0% similar) in 106 aa overlap (1-106:1-106)

               10        20        30        40        50        60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
       :::::::::::::::::::::::::::::::::::::::::::.::              
NP_001 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGGGRL             
               70        80        90       100                    

              130       140       150       160       170       180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL

>>NP_001291300 (OMIM: 610152) centromere protein M isofo  (132 aa)
 initn: 653 init1: 653 opt: 653  Z-score: 878.4  bits: 168.6 E(85289): 3.5e-42
Smith-Waterman score: 653; 100.0% identity (100.0% similar) in 103 aa overlap (1-103:1-103)

               10        20        30        40        50        60
pF1KE0 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MSVLRPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNR
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KE0 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKL
       :::::::::::::::::::::::::::::::::::::::::::                 
NP_001 PRIDLIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGGKYVPRLLLPTPSQGKA
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KE0 AHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
                                                                   
NP_001 GAAVGFLLRHPG                                                
              130                                                  

>>NP_001291302 (OMIM: 610152) centromere protein M isofo  (73 aa)
 initn: 461 init1: 461 opt: 461  Z-score: 626.6  bits: 121.2 E(85289): 3.6e-28
Smith-Waterman score: 461; 98.6% identity (100.0% similar) in 72 aa overlap (35-106:1-72)

           10        20        30        40        50        60    
pF1KE0 RPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNRPRID
                                     ::::::::::::::::::::::::::::::
NP_001                               MLKEDCASELKVHLAKSLPLPSSVNRPRID
                                             10        20        30

           70        80        90       100       110       120    
pF1KE0 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
       :::::::::::::::::::::::::::::::::::::::.::                  
NP_001 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGGGRL                 
               40        50        60        70                    

          130       140       150       160       170       180
pF1KE0 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL

>>NP_001103685 (OMIM: 610152) centromere protein M isofo  (58 aa)
 initn: 296 init1: 296 opt: 296  Z-score: 408.5  bits: 80.5 E(85289): 5.2e-16
Smith-Waterman score: 296; 97.9% identity (100.0% similar) in 48 aa overlap (133-180:11-58)

            110       120       130       140       150       160  
pF1KE0 GAGRESHCSIHRHTVVKLAHTYQSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSA
                                     :.::::::::::::::::::::::::::::
NP_001                     MGRVWDLPGVLKVEGFRATMAQRLVRVLQICAGHVPGVSA
                                   10        20        30        40

            170       180
pF1KE0 LNLLSLLRSSEGPSLEDL
       ::::::::::::::::::
NP_001 LNLLSLLRSSEGPSLEDL
               50        

>>NP_001291301 (OMIM: 610152) centromere protein M isofo  (125 aa)
 initn: 294 init1: 277 opt: 291  Z-score: 396.8  bits: 79.5 E(85289): 2.3e-15
Smith-Waterman score: 291; 74.2% identity (83.3% similar) in 66 aa overlap (35-100:1-64)

           10        20        30        40        50        60    
pF1KE0 RPLDKLPGLNTATILLVGTEDALLQQLADSMLKEDCASELKVHLAKSLPLPSSVNRPRID
                                     ::::::::::::::::::::::::::::::
NP_001                               MLKEDCASELKVHLAKSLPLPSSVNRPRID
                                             10        20        30

           70        80        90       100       110       120    
pF1KE0 LIVFVVNLHSKYSLQNTEESLRHVDASFFLGKVCFLATGAGRESHCSIHRHTVVKLAHTY
       :::::::::::: ..... :   :    : . ::::                        
NP_001 LIVFVVNLHSKYRIREARTSAFSVVK--FCSLVCFLTLAWPPQSPEHRGVPAPCGCQLLL
               40        50          60        70        80        

          130       140       150       160       170       180
pF1KE0 QSPLLYCDLEVEGFRATMAQRLVRVLQICAGHVPGVSALNLLSLLRSSEGPSLEDL
                                                               
NP_001 GEGVFPRHRCWAGEPLQHSPAHRGEAGPHLSKPPALL                   
       90       100       110       120                        




180 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Thu Nov  3 04:54:27 2016 done: Thu Nov  3 04:54:28 2016
 Total Scan time:  5.700 Total Display time: -0.010

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com