Result of FASTA (omim) for pF1KB5535
FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: pF1KB5535, 463 aa
  1>>>pF1KB5535 463 - 463 aa - 463 aa
Library: /omim/omim.rfq.tfa
  60827320 residues in 85289 sequences

Statistics:  Expectation_n fit: rho(ln(x))= 5.2378+/-0.000379; mu= 17.8542+/- 0.023
 mean_var=63.7592+/-13.045, 0's: 0 Z-trim(111.8): 54  B-trim: 103 in 1/50
 Lambda= 0.160621
 statistics sampled from 20463 (20517) to 20463 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
 ktup: 2, E-join: 1 (0.6), E-opt: 0.2 (0.241), width:  16
 Scan time:  8.750

The best scores are:                                      opt bits E(85289)
NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 3212 753.3 3.1e-217
NP_680475 (OMIM: 170650,245000,245010,602365) dipe ( 137)  719 175.3 8.9e-44
NP_001107645 (OMIM: 170650,245000,245010,602365) d ( 141)  719 175.4 9.1e-44
NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334)  452 113.7 7.9e-25
NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334)  452 113.7 7.9e-25
NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303)  418 105.8 1.7e-22
NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333)  386 98.4 3.2e-20
NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333)  386 98.4 3.2e-20
NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333)  386 98.4 3.2e-20
XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333)  386 98.4 3.2e-20
NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333)  386 98.4 3.2e-20
XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297)  383 97.7 4.7e-20
XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297)  383 97.7 4.7e-20
NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335)  383 97.7 5.2e-20
XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317)  382 97.4 5.8e-20
NP_001304166 (OMIM: 116810) cathepsin B isoform 2  ( 215)  323 83.7 5.5e-16
XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245)  323 83.7 6.1e-16
XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  323 83.8   8e-16
NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  323 83.8   8e-16
XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  323 83.8   8e-16
NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  323 83.8   8e-16
XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339)  323 83.8   8e-16
NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339)  323 83.8   8e-16
NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201)  295 77.2 4.6e-14
XP_016866237 (OMIM: 606749) PREDICTED: tubulointer ( 309)  297 77.7 4.8e-14
XP_016866236 (OMIM: 606749) PREDICTED: tubulointer ( 351)  297 77.8 5.4e-14
XP_016866235 (OMIM: 606749) PREDICTED: tubulointer ( 401)  297 77.8   6e-14
XP_011512799 (OMIM: 606749) PREDICTED: tubulointer ( 426)  297 77.8 6.3e-14
XP_006715125 (OMIM: 606749) PREDICTED: tubulointer ( 458)  297 77.8 6.7e-14
NP_055279 (OMIM: 606749) tubulointerstitial nephri ( 476)  297 77.8 6.9e-14
NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329)  286 75.2   3e-13
NP_001191344 (OMIM: 616064) tubulointerstitial nep ( 362)  277 73.1 1.4e-12
XP_005271164 (OMIM: 616064) PREDICTED: tubulointer ( 362)  277 73.1 1.4e-12
XP_011540248 (OMIM: 616064) PREDICTED: tubulointer ( 408)  277 73.2 1.5e-12
NP_001191343 (OMIM: 616064) tubulointerstitial nep ( 436)  277 73.2 1.6e-12
NP_071447 (OMIM: 616064) tubulointerstitial nephri ( 467)  277 73.2 1.7e-12
XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272)  266 70.5 6.3e-12
XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272)  266 70.5 6.3e-12
NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151)  261 69.2 8.6e-12
XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424)  257 68.5 3.9e-11
NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484)  257 68.6 4.3e-11
NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331)  235 63.4 1.1e-09
NP_001186668 (OMIM: 116845) cathepsin S isoform 2  ( 281)  229 62.0 2.5e-09
XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440)  227 61.6   5e-09


>>NP_001805 (OMIM: 170650,245000,245010,602365) dipeptid  (463 aa)
 initn: 3212 init1: 3212 opt: 3212  Z-score: 4021.1  bits: 753.3 E(85289): 3.1e-217
Smith-Waterman score: 3212; 100.0% identity (100.0% similar) in 463 aa overlap (1-463:1-463)

               10        20        30        40        50        60
pF1KB5 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA
              130       140       150       160       170       180

              190       200       210       220       230       240
pF1KB5 IQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNV
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 IQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNV
              190       200       210       220       230       240

              250       260       270       280       290       300
pF1KB5 HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEG
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEG
              250       260       270       280       290       300

              310       320       330       340       350       360
pF1KB5 GFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 GFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALM
              310       320       330       340       350       360

              370       380       390       400       410       420
pF1KB5 KLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGM
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 KLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGM
              370       380       390       400       410       420

              430       440       450       460   
pF1KB5 DYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL
       :::::::::::::::::::::::::::::::::::::::::::
NP_001 DYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL
              430       440       450       460   

>>NP_680475 (OMIM: 170650,245000,245010,602365) dipeptid  (137 aa)
 initn: 719 init1: 719 opt: 719  Z-score: 906.9  bits: 175.3 E(85289): 8.9e-44
Smith-Waterman score: 719; 100.0% identity (100.0% similar) in 106 aa overlap (1-106:1-106)

               10        20        30        40        50        60
pF1KB5 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_680 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE
       ::::::::::::::::::::::::::::::::::::::::::::::              
NP_680 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKDVTDFISHLFMQLG
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA
                                                                   
NP_680 TVGIYDLPHLRNKLVIK                                           
              130                                                  

>>NP_001107645 (OMIM: 170650,245000,245010,602365) dipep  (141 aa)
 initn: 719 init1: 719 opt: 719  Z-score: 906.7  bits: 175.4 E(85289): 9.1e-44
Smith-Waterman score: 719; 100.0% identity (100.0% similar) in 106 aa overlap (1-106:1-106)

               10        20        30        40        50        60
pF1KB5 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ
       ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ
               10        20        30        40        50        60

               70        80        90       100       110       120
pF1KB5 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE
       ::::::::::::::::::::::::::::::::::::::::::::::              
NP_001 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKDVTDFISHLFMQLG
               70        80        90       100       110       120

              130       140       150       160       170       180
pF1KB5 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA
                                                                   
NP_001 TVGIYDLPHLRNKLAMNRRWG                                       
              130       140                                        

>>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein  (334 aa)
 initn: 394 init1: 170 opt: 452  Z-score: 566.7  bits: 113.7 E(85289): 7.9e-25
Smith-Waterman score: 454; 31.3% identity (60.0% similar) in 335 aa overlap (128-454:26-329)

       100       110       120       130       140       150       
pF1KB5 DYKWFAFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLK
                                     :.   .:   : ...  :.:. .  ..  :
NP_001      MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKA-THRRLYGANEEGWRRAVWEK
                    10        20        30         40        50    

       160         170       180       190       200       210     
pF1KB5 NSQ--EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRP
       : .  : ....  .  :.:. :.::.       :  :.. . .:  .: .  .. :. : 
NP_001 NMKMIELHNGEYSQGKHGFTMAMNAFGD----MTNEEFRQM-MG-CFRNQKFRKGKVFRE
           60        70        80            90         100        

         220       230       240       250       260       270     
pF1KB5 KPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILT
          ::       .: :: : :::. .:  .:.::.:: .::::..:.. : ::...  . 
NP_001 ---PL-------FLDLPKSVDWRK-KG--YVTPVKNQKQCGSCWAFSATGALEGQM--FR
         110              120          130       140       150     

         280       290         300       310       320       330   
pF1KB5 NNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKM
       ....   :: :..:.::  :  :::.:::         .. ::  :  .::...:  ::.
NP_001 KTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKY
           160       170       180       190       200       210   

           340       350       360       370        380       390  
pF1KB5 KEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTG
       . .     .. .  :.    : ..::::  ..  ::..::... ...:  ::.:::    
NP_001 RPENSVANDTGFTVVAP---GKEKALMK-AVATVGPISVAMDAGHSSFQFYKSGIYF---
           220       230           240       250       260         

            400       410       420         430       440          
pF1KB5 LRDPFNPFELTNHAVLLVGYGTDSASGMD--YWIVKNSWGTGWGENGYFRIRRG-TDECA
         .:    .  .:.::.:::: ..:.. .  ::.::::::  :: ::: .: .  ...:.
NP_001 --EPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG
          270       280       290       300       310       320    

     450       460   
pF1KB5 IESIAVAATPIPKL
       : . :         
NP_001 IATAASYPNV    
          330        

>>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H  (334 aa)
 initn: 394 init1: 170 opt: 452  Z-score: 566.7  bits: 113.7 E(85289): 7.9e-25
Smith-Waterman score: 454; 31.3% identity (60.0% similar) in 335 aa overlap (128-454:26-329)

       100       110       120       130       140       150       
pF1KB5 DYKWFAFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLK
                                     :.   .:   : ...  :.:. .  ..  :
NP_001      MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKA-THRRLYGANEEGWRRAVWEK
                    10        20        30         40        50    

       160         170       180       190       200       210     
pF1KB5 NSQ--EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRP
       : .  : ....  .  :.:. :.::.       :  :.. . .:  .: .  .. :. : 
NP_001 NMKMIELHNGEYSQGKHGFTMAMNAFGD----MTNEEFRQM-MG-CFRNQKFRKGKVFRE
           60        70        80            90         100        

         220       230       240       250       260       270     
pF1KB5 KPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILT
          ::       .: :: : :::. .:  .:.::.:: .::::..:.. : ::...  . 
NP_001 ---PL-------FLDLPKSVDWRK-KG--YVTPVKNQKQCGSCWAFSATGALEGQM--FR
         110              120          130       140       150     

         280       290         300       310       320       330   
pF1KB5 NNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKM
       ....   :: :..:.::  :  :::.:::         .. ::  :  .::...:  ::.
NP_001 KTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKY
           160       170       180       190       200       210   

           340       350       360       370        380       390  
pF1KB5 KEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTG
       . .     .. .  :.    : ..::::  ..  ::..::... ...:  ::.:::    
NP_001 RPENSVANDTGFTVVAP---GKEKALMK-AVATVGPISVAMDAGHSSFQFYKSGIYF---
           220       230           240       250       260         

            400       410       420         430       440          
pF1KB5 LRDPFNPFELTNHAVLLVGYGTDSASGMD--YWIVKNSWGTGWGENGYFRIRRG-TDECA
         .:    .  .:.::.:::: ..:.. .  ::.::::::  :: ::: .: .  ...:.
NP_001 --EPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG
          270       280       290       300       310       320    

     450       460   
pF1KB5 IESIAVAATPIPKL
       : . :         
NP_001 IATAASYPNV    
          330        

>>NP_001327 (OMIM: 603169) cathepsin Z preproprotein [Ho  (303 aa)
 initn: 300 init1: 131 opt: 418  Z-score: 524.8  bits: 105.8 E(85289): 1.7e-22
Smith-Waterman score: 418; 33.8% identity (60.1% similar) in 228 aa overlap (231-445:62-279)

              210       220       230       240       250          
pF1KB5 MIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQ---ASCGS
                                     :: ::::::: :.:..: .:::     :::
NP_001 TCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGS
              40        50        60        70        80        90 

       260       270        280       290       300       310      
pF1KB5 CYSFASMGMLEARIRILTNNSQ-TPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGL
       :.. :: . .  :: :  ...  . .:: :.:..:.. : .::::   : .  ::.. :.
NP_001 CWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGN-AGSCEGGND-LSVWDYAHQHGI
             100       110       120        130        140         

        320       330            340           350       360       
pF1KB5 VEEACFPYTGTDSPCKMKEDC-----FRYYSSEYHY----VGGFYGGCNEALMKLELVHH
        .:.:  : . :. :   ..:     :.   .  .:    :: . .  ..  :  :.  .
NP_001 PDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYAN
     150       160       170       180       190       200         

       370       380       390       400       410       420       
pF1KB5 GPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKN
       ::.. .. . . . .:  ::: .      ..     ::.: ..:.:   ..: .::::.:
NP_001 GPISCGIMATERLANYTGGIYAE------YQDTTYINHVVSVAGWGI--SDGTEYWIVRN
     210       220       230             240       250         260 

       430       440       450       460         
pF1KB5 SWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL      
       :::  ::: :..::  .:                        
NP_001 SWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV
             270       280       290       300   

>>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro  (333 aa)
 initn: 341 init1: 165 opt: 386  Z-score: 484.1  bits: 98.4 E(85289): 3.2e-20
Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328)

      190       200       210       220       230         240      
pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV
                                     :  ... :. :    : : :::. .:  .:
NP_001 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV
      70        80        90       100       110       120         

        250       260       270       280       290         300    
pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY
       .::.::..::::..:.. : ::..  .. ....   :: :..:.::  :  .::.::. .
NP_001 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M
        130       140       150         160       170       180    

          310        320       330       340         350           
pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL
         : .:.:: : :  :  .:: .:.  ::        :. .:  ..  ::      ..::
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL
           190       200       210               220       230     

     360       370        380       390       400       410        
pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS--
       ::  ..  ::..::... ...:: ::.:::      .:    :  .:.::.:::: .:  
NP_001 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE
          240       250       260            270       280         

        420       430       440        450       460   
pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL
       ...  ::.::::::  :: .:: .. .   ..:.: : :         
NP_001 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV    
     290       300       310       320       330       

>>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro  (333 aa)
 initn: 341 init1: 165 opt: 386  Z-score: 484.1  bits: 98.4 E(85289): 3.2e-20
Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328)

      190       200       210       220       230         240      
pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV
                                     :  ... :. :    : : :::. .:  .:
NP_666 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV
      70        80        90       100       110       120         

        250       260       270       280       290         300    
pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY
       .::.::..::::..:.. : ::..  .. ....   :: :..:.::  :  .::.::. .
NP_666 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M
        130       140       150         160       170       180    

          310        320       330       340         350           
pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL
         : .:.:: : :  :  .:: .:.  ::        :. .:  ..  ::      ..::
NP_666 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL
           190       200       210               220       230     

     360       370        380       390       400       410        
pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS--
       ::  ..  ::..::... ...:: ::.:::      .:    :  .:.::.:::: .:  
NP_666 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE
          240       250       260            270       280         

        420       430       440        450       460   
pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL
       ...  ::.::::::  :: .:: .. .   ..:.: : :         
NP_666 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV    
     290       300       310       320       330       

>>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre  (333 aa)
 initn: 341 init1: 165 opt: 386  Z-score: 484.1  bits: 98.4 E(85289): 3.2e-20
Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328)

      190       200       210       220       230         240      
pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV
                                     :  ... :. :    : : :::. .:  .:
NP_001 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV
      70        80        90       100       110       120         

        250       260       270       280       290         300    
pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY
       .::.::..::::..:.. : ::..  .. ....   :: :..:.::  :  .::.::. .
NP_001 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M
        130       140       150         160       170       180    

          310        320       330       340         350           
pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL
         : .:.:: : :  :  .:: .:.  ::        :. .:  ..  ::      ..::
NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL
           190       200       210               220       230     

     360       370        380       390       400       410        
pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS--
       ::  ..  ::..::... ...:: ::.:::      .:    :  .:.::.:::: .:  
NP_001 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE
          240       250       260            270       280         

        420       430       440        450       460   
pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL
       ...  ::.::::::  :: .:: .. .   ..:.: : :         
NP_001 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV    
     290       300       310       320       330       

>>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is  (333 aa)
 initn: 341 init1: 165 opt: 386  Z-score: 484.1  bits: 98.4 E(85289): 3.2e-20
Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328)

      190       200       210       220       230         240      
pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV
                                     :  ... :. :    : : :::. .:  .:
XP_005 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV
      70        80        90       100       110       120         

        250       260       270       280       290         300    
pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY
       .::.::..::::..:.. : ::..  .. ....   :: :..:.::  :  .::.::. .
XP_005 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M
        130       140       150         160       170       180    

          310        320       330       340         350           
pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL
         : .:.:: : :  :  .:: .:.  ::        :. .:  ..  ::      ..::
XP_005 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL
           190       200       210               220       230     

     360       370        380       390       400       410        
pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS--
       ::  ..  ::..::... ...:: ::.:::      .:    :  .:.::.:::: .:  
XP_005 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE
          240       250       260            270       280         

        420       430       440        450       460   
pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL
       ...  ::.::::::  :: .:: .. .   ..:.: : :         
XP_005 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV    
     290       300       310       320       330       




463 residues in 1 query   sequences
60827320 residues in 85289 library sequences
 Tcomplib [36.3.4 Apr, 2011] (8 proc)
 start: Sat Nov  5 13:04:02 2016 done: Sat Nov  5 13:04:03 2016
 Total Scan time:  8.750 Total Display time:  0.030

Function used was FASTA [36.3.4 Apr, 2011]
Inquiries or Suggestions ?
Send a message to flexiclone AT kazusagt.com