FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE0209, 303 aa 1>>>pF1KE0209 303 - 303 aa - 303 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.3131+/-0.000336; mu= 15.3827+/- 0.021 mean_var=71.9212+/-14.263, 0's: 0 Z-trim(115.0): 53 B-trim: 0 in 0/56 Lambda= 0.151233 statistics sampled from 25149 (25203) to 25149 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.663), E-opt: 0.2 (0.296), width: 16 Scan time: 6.780 The best scores are: opt bits E(85289) NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 2187 486.3 3.3e-137 NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 418 100.4 7.1e-21 NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 305 75.7 1.4e-13 NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 305 75.7 1.4e-13 NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 305 75.7 1.4e-13 NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 305 75.7 1.4e-13 XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 305 75.7 1.4e-13 NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 298 74.1 4.1e-13 NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 298 74.1 4.1e-13 NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 270 68.0 2.8e-11 NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 254 64.5 2.8e-10 XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 241 61.6 1.8e-09 NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 236 60.5 3.4e-09 NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 236 60.6 4.9e-09 XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 236 60.6 4.9e-09 NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 236 60.6 4.9e-09 XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 236 60.6 4.9e-09 XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 236 60.6 4.9e-09 NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 236 60.6 4.9e-09 NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 202 53.2 8.2e-07 NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 198 52.3 1.5e-06 XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 193 51.2 2.8e-06 XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 193 51.2 2.8e-06 NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 189 50.1 3.2e-06 NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 190 50.4 3.4e-06 XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 185 49.5 9.9e-06 XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 185 49.5 9.9e-06 XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 185 49.5 1e-05 NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 185 49.5 1.1e-05 XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 160 44.1 0.00058 NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 160 44.1 0.00065 XP_016866234 (OMIM: 606749) PREDICTED: tubulointer ( 438) 157 43.5 0.00094 NP_001191344 (OMIM: 616064) tubulointerstitial nep ( 362) 147 41.2 0.0036 XP_005271164 (OMIM: 616064) PREDICTED: tubulointer ( 362) 147 41.2 0.0036 XP_011540248 (OMIM: 616064) PREDICTED: tubulointer ( 408) 147 41.3 0.004 NP_001191343 (OMIM: 616064) tubulointerstitial nep ( 436) 147 41.3 0.0042 NP_071447 (OMIM: 616064) tubulointerstitial nephri ( 467) 147 41.3 0.0045 XP_016866237 (OMIM: 606749) PREDICTED: tubulointer ( 309) 140 39.6 0.0093 >>NP_001327 (OMIM: 603169) cathepsin Z preproprotein [Ho (303 aa) initn: 2187 init1: 2187 opt: 2187 Z-score: 2584.4 bits: 486.3 E(85289): 3.3e-137 Smith-Waterman score: 2187; 100.0% identity (100.0% similar) in 303 aa overlap (1-303:1-303) 10 20 30 40 50 60 pF1KE0 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE0 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSV 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE0 QNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 QNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE0 AIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYIN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 AIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYIN 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE0 HVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGD :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGD 250 260 270 280 290 300 pF1KE0 PIV ::: NP_001 PIV >>NP_001805 (OMIM: 170650,245000,245010,602365) dipeptid (463 aa) initn: 279 init1: 131 opt: 418 Z-score: 495.8 bits: 100.4 E(85289): 7.1e-21 Smith-Waterman score: 418; 33.8% identity (60.1% similar) in 228 aa overlap (62-279:231-445) 40 50 60 70 80 90 pF1KE0 TCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGS :: ::::::: :.:..: .::: ::: NP_001 MIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQ---ASCGS 210 220 230 240 250 100 110 120 130 140 pF1KE0 CWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGN-AGSCEGGND-LSVWDYAHQHGI :.. :: . . :: : ... . .:: :.:..:.. : .:::: : . ::.. :. NP_001 CYSFASMGMLEARIRILTNNSQ-TPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGL 260 270 280 290 300 310 150 160 170 180 190 200 pF1KE0 PDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYAN .:.: : . :. : ..: :. . .: :: . . .. : :. . NP_001 VEEACFPYTGTDSPCKMKEDC-----FRYYSSEYHY----VGGFYGGCNEALMKLELVHH 320 330 340 350 360 210 220 230 240 250 260 pF1KE0 GPISCGIMATERLANYTGGIYAE------YQDTTYINHVVSVAGWGI--SDGTEYWIVRN ::.. .. . . . .: ::: . .. ::.: ..:.: ..: .::::.: NP_001 GPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKN 370 380 390 400 410 420 270 280 290 300 pF1KE0 SWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV ::: ::: :..:: .: NP_001 SWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL 430 440 450 460 >>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 247 init1: 110 opt: 305 Z-score: 364.6 bits: 75.7 E(85289): 1.4e-13 Smith-Waterman score: 329; 33.2% identity (57.6% similar) in 229 aa overlap (63-275:115-314) 40 50 60 70 80 90 pF1KE0 CYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSC :.: :::. .:.. ..:: :::: NP_001 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK---GYVTPVKNQG---QCGSC 90 100 110 120 130 100 110 120 130 140 pF1KE0 WAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQH- :: ..:.:. .. ... : : :: ::..:: :: : :.:: . ::: :. NP_001 WAFSATGALEGQM-FRKTGRLIS--LSEQNLVDCSGPQGNEG-CNGG----LMDYAFQYV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 ----GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREK-M :. .: :.: .. : :.: ... : : . .. .:: . NP_001 QDNGGLDSEESYPYEATEESC-KYNPK---------YSVANDT-----GFVDIPKQEKAL 200 210 220 230 210 220 230 240 250 pF1KE0 MAEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGI----SDGTEY : . . :::: .: : : . : ::: : . .. ..: : :.:.:. ::...: NP_001 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 260 270 280 290 300 pF1KE0 WIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV :.:.::::: :: :.... NP_001 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 247 init1: 110 opt: 305 Z-score: 364.6 bits: 75.7 E(85289): 1.4e-13 Smith-Waterman score: 329; 33.2% identity (57.6% similar) in 229 aa overlap (63-275:115-314) 40 50 60 70 80 90 pF1KE0 CYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSC :.: :::. .:.. ..:: :::: NP_666 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK---GYVTPVKNQG---QCGSC 90 100 110 120 130 100 110 120 130 140 pF1KE0 WAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQH- :: ..:.:. .. ... : : :: ::..:: :: : :.:: . ::: :. NP_666 WAFSATGALEGQM-FRKTGRLIS--LSEQNLVDCSGPQGNEG-CNGG----LMDYAFQYV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 ----GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREK-M :. .: :.: .. : :.: ... : : . .. .:: . NP_666 QDNGGLDSEESYPYEATEESC-KYNPK---------YSVANDT-----GFVDIPKQEKAL 200 210 220 230 210 220 230 240 250 pF1KE0 MAEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGI----SDGTEY : . . :::: .: : : . : ::: : . .. ..: : :.:.:. ::...: NP_666 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 260 270 280 290 300 pF1KE0 WIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV :.:.::::: :: :.... NP_666 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 247 init1: 110 opt: 305 Z-score: 364.6 bits: 75.7 E(85289): 1.4e-13 Smith-Waterman score: 329; 33.2% identity (57.6% similar) in 229 aa overlap (63-275:115-314) 40 50 60 70 80 90 pF1KE0 CYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSC :.: :::. .:.. ..:: :::: NP_001 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK---GYVTPVKNQG---QCGSC 90 100 110 120 130 100 110 120 130 140 pF1KE0 WAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQH- :: ..:.:. .. ... : : :: ::..:: :: : :.:: . ::: :. NP_001 WAFSATGALEGQM-FRKTGRLIS--LSEQNLVDCSGPQGNEG-CNGG----LMDYAFQYV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 ----GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREK-M :. .: :.: .. : :.: ... : : . .. .:: . NP_001 QDNGGLDSEESYPYEATEESC-KYNPK---------YSVANDT-----GFVDIPKQEKAL 200 210 220 230 210 220 230 240 250 pF1KE0 MAEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGI----SDGTEY : . . :::: .: : : . : ::: : . .. ..: : :.:.:. ::...: NP_001 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 260 270 280 290 300 pF1KE0 WIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV :.:.::::: :: :.... NP_001 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 247 init1: 110 opt: 305 Z-score: 364.6 bits: 75.7 E(85289): 1.4e-13 Smith-Waterman score: 329; 33.2% identity (57.6% similar) in 229 aa overlap (63-275:115-314) 40 50 60 70 80 90 pF1KE0 CYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSC :.: :::. .:.. ..:: :::: NP_001 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK---GYVTPVKNQG---QCGSC 90 100 110 120 130 100 110 120 130 140 pF1KE0 WAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQH- :: ..:.:. .. ... : : :: ::..:: :: : :.:: . ::: :. NP_001 WAFSATGALEGQM-FRKTGRLIS--LSEQNLVDCSGPQGNEG-CNGG----LMDYAFQYV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 ----GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREK-M :. .: :.: .. : :.: ... : : . .. .:: . NP_001 QDNGGLDSEESYPYEATEESC-KYNPK---------YSVANDT-----GFVDIPKQEKAL 200 210 220 230 210 220 230 240 250 pF1KE0 MAEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGI----SDGTEY : . . :::: .: : : . : ::: : . .. ..: : :.:.:. ::...: NP_001 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 260 270 280 290 300 pF1KE0 WIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV :.:.::::: :: :.... NP_001 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is (333 aa) initn: 247 init1: 110 opt: 305 Z-score: 364.6 bits: 75.7 E(85289): 1.4e-13 Smith-Waterman score: 329; 33.2% identity (57.6% similar) in 229 aa overlap (63-275:115-314) 40 50 60 70 80 90 pF1KE0 CYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSC :.: :::. .:.. ..:: :::: XP_005 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK---GYVTPVKNQG---QCGSC 90 100 110 120 130 100 110 120 130 140 pF1KE0 WAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQH- :: ..:.:. .. ... : : :: ::..:: :: : :.:: . ::: :. XP_005 WAFSATGALEGQM-FRKTGRLIS--LSEQNLVDCSGPQGNEG-CNGG----LMDYAFQYV 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 ----GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREK-M :. .: :.: .. : :.: ... : : . .. .:: . XP_005 QDNGGLDSEESYPYEATEESC-KYNPK---------YSVANDT-----GFVDIPKQEKAL 200 210 220 230 210 220 230 240 250 pF1KE0 MAEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGI----SDGTEY : . . :::: .: : : . : ::: : . .. ..: : :.:.:. ::...: XP_005 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY 240 250 260 270 280 290 260 270 280 290 300 pF1KE0 WIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV :.:.::::: :: :.... XP_005 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 300 310 320 330 >>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein (334 aa) initn: 273 init1: 106 opt: 298 Z-score: 356.3 bits: 74.1 E(85289): 4.1e-13 Smith-Waterman score: 346; 31.0% identity (57.5% similar) in 261 aa overlap (27-275:88-315) 10 20 30 40 50 pF1KE0 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEY ::. . :.: . . .. : . NP_001 MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQK------FRKGKVFREPLF 60 70 80 90 100 110 60 70 80 90 100 110 pF1KE0 LSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPST : ::::: :::. : :.. ..:: . :::::: ..:.:. .. ... : : NP_001 L---DLPKSVDWRK-KG--YVTPVKNQ---KQCGSCWAFSATGALEGQM-FRKTGKLVS- 120 130 140 150 160 120 130 140 150 160 170 pF1KE0 LLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCG :: ::..:: :: : :.:: ...:....: . .: : : :. : NP_001 -LSEQNLVDCSRPQGNQG-CNGGFMARAFQYVKENGGLDSEESYPYVAVDEIC------- 170 180 190 200 210 180 190 200 210 220 pF1KE0 TCNEFKECHAIRNYTLWRVGDYGSLSGREK-MMAEIYANGPISCGIMATER-LANYTGGI ... ... : : . : :.:: .: . . :::: .. : . . : .:: NP_001 ---KYRPENSVANDTGFTV----VAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGI 220 230 240 250 260 230 240 250 260 270 280 pF1KE0 YAEYQ-DTTYINHVVSVAGWGI----SDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGK : : . .. ..: : :.:.:. :....::.:.:::: :: :...: NP_001 YFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG 270 280 290 300 310 320 290 300 pF1KE0 GARYNLAIEEHCTFGDPIV NP_001 IATAASYPNV 330 >>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H (334 aa) initn: 273 init1: 106 opt: 298 Z-score: 356.3 bits: 74.1 E(85289): 4.1e-13 Smith-Waterman score: 346; 31.0% identity (57.5% similar) in 261 aa overlap (27-275:88-315) 10 20 30 40 50 pF1KE0 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEY ::. . :.: . . .. : . NP_001 MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQK------FRKGKVFREPLF 60 70 80 90 100 110 60 70 80 90 100 110 pF1KE0 LSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPST : ::::: :::. : :.. ..:: . :::::: ..:.:. .. ... : : NP_001 L---DLPKSVDWRK-KG--YVTPVKNQ---KQCGSCWAFSATGALEGQM-FRKTGKLVS- 120 130 140 150 160 120 130 140 150 160 170 pF1KE0 LLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCG :: ::..:: :: : :.:: ...:....: . .: : : :. : NP_001 -LSEQNLVDCSRPQGNQG-CNGGFMARAFQYVKENGGLDSEESYPYVAVDEIC------- 170 180 190 200 210 180 190 200 210 220 pF1KE0 TCNEFKECHAIRNYTLWRVGDYGSLSGREK-MMAEIYANGPISCGIMATER-LANYTGGI ... ... : : . : :.:: .: . . :::: .. : . . : .:: NP_001 ---KYRPENSVANDTGFTV----VAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGI 220 230 240 250 260 230 240 250 260 270 280 pF1KE0 YAEYQ-DTTYINHVVSVAGWGI----SDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGK : : . .. ..: : :.:.:. :....::.:.:::: :: :...: NP_001 YFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG 270 280 290 300 310 320 290 300 pF1KE0 GARYNLAIEEHCTFGDPIV NP_001 IATAASYPNV 330 >>NP_004070 (OMIM: 116845) cathepsin S isoform 1 preprop (331 aa) initn: 309 init1: 148 opt: 270 Z-score: 323.4 bits: 68.0 E(85289): 2.8e-11 Smith-Waterman score: 370; 33.9% identity (60.3% similar) in 224 aa overlap (62-275:115-312) 40 50 60 70 80 90 pF1KE0 TCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGS :: : ::: . . .:. .. . ::. NP_004 SEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSVDWR-----EKGCVTEVKYQGS-CGA 90 100 110 120 130 100 110 120 130 140 pF1KE0 CWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC-----GNAGSCEGGNDLSVWDYA-H ::: ....:. ....: : : ::.::..:: :: : :.:: ....: NP_004 CWAFSAVGALEAQLKLKT-GKLVS--LSAQNLVDCSTEKYGNKG-CNGGFMTTAFQYIID 140 150 160 170 180 190 150 160 170 180 190 200 pF1KE0 QHGIPDETCNNYQAKDQEC--DKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMM ..:: ... :.: ::.: :. . .::... : :: :: .. NP_004 NKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTEL------------PYG----REDVL 200 210 220 230 210 220 230 240 250 260 pF1KE0 AEIYAN-GPISCGIMATE-RLANYTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIVRN : :: ::.: :. : . . : .:.: : . : .:: : :.:.: .: :::.:.: NP_004 KEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKN 240 250 260 270 280 290 270 280 290 300 pF1KE0 SWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV :::. .::.:..:. NP_004 SWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI 300 310 320 330 303 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Thu Nov 3 21:11:49 2016 done: Thu Nov 3 21:11:50 2016 Total Scan time: 6.780 Total Display time: 0.020 Function used was FASTA [36.3.4 Apr, 2011]