FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB5654, 331 aa
1>>>pF1KB5654 331 - 331 aa - 331 aa
Library: /omim/omim.rfq.tfa
60827320 residues in 85289 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2812+/-0.000427; mu= 16.1503+/- 0.026
mean_var=70.0546+/-13.938, 0's: 0 Z-trim(111.4): 60 B-trim: 152 in 1/49
Lambda= 0.153234
statistics sampled from 19878 (19938) to 19878 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.612), E-opt: 0.2 (0.234), width: 16
Scan time: 6.890
The best scores are: opt bits E(85289)
NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 2292 516.0 4.3e-146
NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 1378 313.9 2.5e-85
NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 1241 283.7 3.7e-76
NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 1119 256.7 5e-68
NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 1119 256.7 5e-68
XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 1101 252.7 7.8e-67
NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 1101 252.7 7.8e-67
NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 1101 252.7 7.8e-67
NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 1101 252.7 7.8e-67
NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 1101 252.7 7.8e-67
XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 815 189.4 7.2e-48
XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 815 189.4 7.2e-48
NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 723 169.2 1.1e-41
XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 713 166.9 5e-41
XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 702 164.5 2.6e-40
XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 702 164.5 2.6e-40
XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 561 133.4 8.2e-31
NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 561 133.5 9.2e-31
NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 543 129.2 5.6e-30
NP_001325 (OMIM: 600550) cathepsin O preproprotein ( 321) 502 120.3 5.6e-27
NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 451 108.9 9.4e-24
NP_001326 (OMIM: 602364) cathepsin W preproprotein ( 376) 399 97.6 4.5e-20
XP_011519578 (OMIM: 116820) PREDICTED: pro-catheps ( 169) 272 69.3 6.7e-12
NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 272 69.4 1.1e-11
NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 239 62.3 2.4e-09
NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 228 59.6 6.9e-09
XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 228 59.6 7.7e-09
XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 59.7 1e-08
NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 59.7 1e-08
XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 59.7 1e-08
NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 59.7 1e-08
NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 228 59.7 1e-08
XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 228 59.7 1e-08
XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440) 194 52.3 2.3e-06
XP_016866234 (OMIM: 606749) PREDICTED: tubulointer ( 438) 150 42.6 0.0019
>>NP_004070 (OMIM: 116845) cathepsin S isoform 1 preprop (331 aa)
initn: 2292 init1: 2292 opt: 2292 Z-score: 2743.8 bits: 516.0 E(85289): 4.3e-146
Smith-Waterman score: 2292; 99.7% identity (99.7% similar) in 331 aa overlap (1-331:1-331)
10 20 30 40 50 60
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNWILPDSVD
:::::::::::::::::::::::::::::::::::::::::::::::::::: :::::::
NP_004 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSVD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB5 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB5 NGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 NGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB5 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_004 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSW
250 260 270 280 290 300
310 320 330
pF1KB5 GHNFGEEGYIRMARNKGNHCGIASFPSYPEI
:::::::::::::::::::::::::::::::
NP_004 GHNFGEEGYIRMARNKGNHCGIASFPSYPEI
310 320 330
>>NP_001186668 (OMIM: 116845) cathepsin S isoform 2 prep (281 aa)
initn: 1377 init1: 1377 opt: 1378 Z-score: 1652.8 bits: 313.9 E(85289): 2.5e-85
Smith-Waterman score: 1853; 84.9% identity (84.9% similar) in 331 aa overlap (1-331:1-281)
10 20 30 40 50 60
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB5 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNWILPDSVD
:::::::::::::::::::::::
NP_001 LHNLEHSMGMHSYDLGMNHLGDM-------------------------------------
70 80
130 140 150 160 170 180
pF1KB5 WREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC
:::::::::::::::::::::::::::::::::::::::::::::::
NP_001 -------------GSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGC
90 100 110 120 130
190 200 210 220 230 240
pF1KB5 NGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 NGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKE
140 150 160 170 180 190
250 260 270 280 290 300
pF1KB5 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
NP_001 AVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSW
200 210 220 230 240 250
310 320 330
pF1KB5 GHNFGEEGYIRMARNKGNHCGIASFPSYPEI
:::::::::::::::::::::::::::::::
NP_001 GHNFGEEGYIRMARNKGNHCGIASFPSYPEI
260 270 280
>>NP_000387 (OMIM: 265800,601105) cathepsin K preproprot (329 aa)
initn: 1097 init1: 467 opt: 1241 Z-score: 1488.2 bits: 283.7 E(85289): 3.7e-76
Smith-Waterman score: 1241; 55.8% identity (79.9% similar) in 328 aa overlap (7-331:6-329)
10 20 30 40 50 60
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVM
:::. . : :. . :: ::.:::::. :::..: .: ::::::::::..
NP_000 MWGLKVLLLPVVSFA-LYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYIS
10 20 30 40 50
70 80 90 100 110
pF1KB5 LHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNWI--LPDS
.:::: :.:.:.:.:.::::::::::::.. :..:.:: . .:. :.: :::
NP_000 IHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDS
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 VDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNK
::.:.:: :: :: ::.::.:::::.:::::.::: :::::..:: :::::: .: :
NP_000 VDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSE---ND
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 GCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGREDVL
::.::.::.::::. :.::::. .::: .....:.:. .:: : : :.: : : .:
NP_000 GCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB5 KEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSC-TQNVNHGVLVVGYGDLNGKEYWLVK
:.::: :::::..:: :: .: .::::. :: ..:.::.::.:::: .:...:..:
NP_000 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIK
240 250 260 270 280 290
300 310 320 330
pF1KB5 NSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
::::.:.:..::: :::::.: ::::.. :.:..
NP_000 NSWGENWGNKGYILMARNKNNACGIANLASFPKM
300 310 320
>>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein (334 aa)
initn: 1056 init1: 363 opt: 1119 Z-score: 1342.3 bits: 256.7 E(85289): 5e-68
Smith-Waterman score: 1119; 50.6% identity (78.5% similar) in 326 aa overlap (12-331:15-334)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK
.::: .. : .:: .:. :: :. . : :::. :: .::::.:
NP_001 MNLSLVLAAFCLGIASAVPKF--DQNLDTKWYQWKATHRRLYGA-NEEGWRRAVWEKNMK
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNWI-LP
.. ::: :.:.: :.. ..:: .::::.:: ..:. .: ...... ... .: .. ::
NP_001 MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR-NQKFRKGKVFR-EPLFLDLP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 DSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYG
:::::.:: :: :: : .::.::::::.::::.:. ::::::::: ::::::: . :
NP_001 KSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ-G
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 NKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGRED
:.:::::::. ::::. .: :.::. :::: :.:. :.: . .:. . .: . :.:
NP_001 NQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEK
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB5 VLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLNGK
.: .:::. ::.::..:: : :: .:.::.:.::.:. .:..::::::::: . :..
NP_001 ALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNS
240 250 260 270 280 290
300 310 320 330
pF1KB5 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
.::::::::: ..: .::...:..:.::::::. :::..
NP_001 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV
300 310 320 330
>>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H (334 aa)
initn: 1056 init1: 363 opt: 1119 Z-score: 1342.3 bits: 256.7 E(85289): 5e-68
Smith-Waterman score: 1119; 50.6% identity (78.5% similar) in 326 aa overlap (12-331:15-334)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK
.::: .. : .:: .:. :: :. . : :::. :: .::::.:
NP_001 MNLSLVLAAFCLGIASAVPKF--DQNLDTKWYQWKATHRRLYGA-NEEGWRRAVWEKNMK
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNWI-LP
.. ::: :.:.: :.. ..:: .::::.:: ..:. .: ...... ... .: .. ::
NP_001 MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFR-NQKFRKGKVFR-EPLFLDLP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 DSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYG
:::::.:: :: :: : .::.::::::.::::.:. ::::::::: ::::::: . :
NP_001 KSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQ-G
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 NKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGRED
:.:::::::. ::::. .: :.::. :::: :.:. :.: . .:. . .: . :.:
NP_001 NQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKYRPENSVANDTGFTVVAPGKEK
180 190 200 210 220 230
240 250 260 270 280 290
pF1KB5 VLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLNGK
.: .:::. ::.::..:: : :: .:.::.:.::.:. .:..::::::::: . :..
NP_001 ALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNS
240 250 260 270 280 290
300 310 320 330
pF1KB5 EYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
.::::::::: ..: .::...:..:.::::::. :::..
NP_001 KYWLVKNSWGPEWGSNGYVKIAKDKNNHCGIATAASYPNV
300 310 320 330
>>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is (333 aa)
initn: 937 init1: 293 opt: 1101 Z-score: 1320.8 bits: 252.7 E(85289): 7.8e-67
Smith-Waterman score: 1101; 49.7% identity (75.6% similar) in 328 aa overlap (14-331:15-333)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV
: : : : .:. .: :: ... : :::. :: .::::.:..
XP_005 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMI
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSL--RVPSQ---WQRNITYKSNPNWI
::: :. : ::. ..:: .::::::: ..:... : : . .:. . :..
XP_005 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-----
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK
: :::::::: :: :: ::.::.::::::.::::.:. :::.:.::: ::::::: .
XP_005 -PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGR
::.:::::.: ::::. :: :.::. ::::.: ...:.:. :: .:. . ....: .
XP_005 -GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q
180 190 200 210 220 230
240 250 260 270 280
pF1KB5 EDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLN
: .: .:::. ::.::..:: : ::..:. :.:.::.:. ....::::::::: . .
XP_005 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD
240 250 260 270 280 290
290 300 310 320 330
pF1KB5 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
...:::::::::...: ::..::... ::::::: ::: .
XP_005 NNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa)
initn: 937 init1: 293 opt: 1101 Z-score: 1320.8 bits: 252.7 E(85289): 7.8e-67
Smith-Waterman score: 1101; 49.7% identity (75.6% similar) in 328 aa overlap (14-331:15-333)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV
: : : : .:. .: :: ... : :::. :: .::::.:..
NP_001 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMI
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSL--RVPSQ---WQRNITYKSNPNWI
::: :. : ::. ..:: .::::::: ..:... : : . .:. . :..
NP_001 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-----
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK
: :::::::: :: :: ::.::.::::::.::::.:. :::.:.::: ::::::: .
NP_001 -PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGR
::.:::::.: ::::. :: :.::. ::::.: ...:.:. :: .:. . ....: .
NP_001 -GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q
180 190 200 210 220 230
240 250 260 270 280
pF1KB5 EDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLN
: .: .:::. ::.::..:: : ::..:. :.:.::.:. ....::::::::: . .
NP_001 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD
240 250 260 270 280 290
290 300 310 320 330
pF1KB5 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
...:::::::::...: ::..::... ::::::: ::: .
NP_001 NNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa)
initn: 937 init1: 293 opt: 1101 Z-score: 1320.8 bits: 252.7 E(85289): 7.8e-67
Smith-Waterman score: 1101; 49.7% identity (75.6% similar) in 328 aa overlap (14-331:15-333)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV
: : : : .:. .: :: ... : :::. :: .::::.:..
NP_001 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMI
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSL--RVPSQ---WQRNITYKSNPNWI
::: :. : ::. ..:: .::::::: ..:... : : . .:. . :..
NP_001 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-----
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK
: :::::::: :: :: ::.::.::::::.::::.:. :::.:.::: ::::::: .
NP_001 -PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGR
::.:::::.: ::::. :: :.::. ::::.: ...:.:. :: .:. . ....: .
NP_001 -GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q
180 190 200 210 220 230
240 250 260 270 280
pF1KB5 EDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLN
: .: .:::. ::.::..:: : ::..:. :.:.::.:. ....::::::::: . .
NP_001 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD
240 250 260 270 280 290
290 300 310 320 330
pF1KB5 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
...:::::::::...: ::..::... ::::::: ::: .
NP_001 NNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa)
initn: 937 init1: 293 opt: 1101 Z-score: 1320.8 bits: 252.7 E(85289): 7.8e-67
Smith-Waterman score: 1101; 49.7% identity (75.6% similar) in 328 aa overlap (14-331:15-333)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV
: : : : .:. .: :: ... : :::. :: .::::.:..
NP_001 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMI
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSL--RVPSQ---WQRNITYKSNPNWI
::: :. : ::. ..:: .::::::: ..:... : : . .:. . :..
NP_001 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-----
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK
: :::::::: :: :: ::.::.::::::.::::.:. :::.:.::: ::::::: .
NP_001 -PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGR
::.:::::.: ::::. :: :.::. ::::.: ...:.:. :: .:. . ....: .
NP_001 -GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q
180 190 200 210 220 230
240 250 260 270 280
pF1KB5 EDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLN
: .: .:::. ::.::..:: : ::..:. :.:.::.:. ....::::::::: . .
NP_001 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD
240 250 260 270 280 290
290 300 310 320 330
pF1KB5 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
...:::::::::...: ::..::... ::::::: ::: .
NP_001 NNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa)
initn: 937 init1: 293 opt: 1101 Z-score: 1320.8 bits: 252.7 E(85289): 7.8e-67
Smith-Waterman score: 1101; 49.7% identity (75.6% similar) in 328 aa overlap (14-331:15-333)
10 20 30 40 50
pF1KB5 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFV
: : : : .:. .: :: ... : :::. :: .::::.:..
NP_666 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMI
10 20 30 40 50
60 70 80 90 100 110
pF1KB5 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSL--RVPSQ---WQRNITYKSNPNWI
::: :. : ::. ..:: .::::::: ..:... : : . .:. . :..
NP_666 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-----
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB5 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK
: :::::::: :: :: ::.::.::::::.::::.:. :::.:.::: ::::::: .
NP_666 -PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ
120 130 140 150 160 170
180 190 200 210 220 230
pF1KB5 YGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGR
::.:::::.: ::::. :: :.::. ::::.: ...:.:. :: .:. . ....: .
NP_666 -GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q
180 190 200 210 220 230
240 250 260 270 280
pF1KB5 EDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLN
: .: .:::. ::.::..:: : ::..:. :.:.::.:. ....::::::::: . .
NP_666 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD
240 250 260 270 280 290
290 300 310 320 330
pF1KB5 GKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
...:::::::::...: ::..::... ::::::: ::: .
NP_666 NNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
331 residues in 1 query sequences
60827320 residues in 85289 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 13:10:24 2016 done: Sat Nov 5 13:10:25 2016
Total Scan time: 6.890 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]