FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB5535, 463 aa 1>>>pF1KB5535 463 - 463 aa - 463 aa Library: /omim/omim.rfq.tfa 60827320 residues in 85289 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.2378+/-0.000379; mu= 17.8542+/- 0.023 mean_var=63.7592+/-13.045, 0's: 0 Z-trim(111.8): 54 B-trim: 103 in 1/50 Lambda= 0.160621 statistics sampled from 20463 (20517) to 20463 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.6), E-opt: 0.2 (0.241), width: 16 Scan time: 8.750 The best scores are: opt bits E(85289) NP_001805 (OMIM: 170650,245000,245010,602365) dipe ( 463) 3212 753.3 3.1e-217 NP_680475 (OMIM: 170650,245000,245010,602365) dipe ( 137) 719 175.3 8.9e-44 NP_001107645 (OMIM: 170650,245000,245010,602365) d ( 141) 719 175.4 9.1e-44 NP_001188504 (OMIM: 603308) cathepsin L2 prepropro ( 334) 452 113.7 7.9e-25 NP_001324 (OMIM: 603308) cathepsin L2 preproprotei ( 334) 452 113.7 7.9e-25 NP_001327 (OMIM: 603169) cathepsin Z preproprotein ( 303) 418 105.8 1.7e-22 NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 386 98.4 3.2e-20 NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 pr ( 333) 386 98.4 3.2e-20 NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 386 98.4 3.2e-20 XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L ( 333) 386 98.4 3.2e-20 NP_001244901 (OMIM: 116880) cathepsin L1 isoform 1 ( 333) 386 98.4 3.2e-20 XP_005254238 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 383 97.7 4.7e-20 XP_016877441 (OMIM: 116820) PREDICTED: pro-catheps ( 297) 383 97.7 4.7e-20 NP_004381 (OMIM: 116820) pro-cathepsin H isoform a ( 335) 383 97.7 5.2e-20 XP_016877440 (OMIM: 116820) PREDICTED: pro-catheps ( 317) 382 97.4 5.8e-20 NP_001304166 (OMIM: 116810) cathepsin B isoform 2 ( 215) 323 83.7 5.5e-16 XP_016868590 (OMIM: 116810) PREDICTED: cathepsin B ( 245) 323 83.7 6.1e-16 XP_006716308 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 XP_016868586 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 XP_011542114 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 XP_016868589 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 NP_680092 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 323 83.8 8e-16 NP_680093 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 323 83.8 8e-16 XP_006716307 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 XP_016868588 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 NP_001899 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 323 83.8 8e-16 NP_680090 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 323 83.8 8e-16 XP_016868587 (OMIM: 116810) PREDICTED: cathepsin B ( 339) 323 83.8 8e-16 NP_680091 (OMIM: 116810) cathepsin B isoform 1 pre ( 339) 323 83.8 8e-16 NP_001306066 (OMIM: 116820) pro-cathepsin H isofor ( 201) 295 77.2 4.6e-14 XP_016866237 (OMIM: 606749) PREDICTED: tubulointer ( 309) 297 77.7 4.8e-14 XP_016866236 (OMIM: 606749) PREDICTED: tubulointer ( 351) 297 77.8 5.4e-14 XP_016866235 (OMIM: 606749) PREDICTED: tubulointer ( 401) 297 77.8 6e-14 XP_011512799 (OMIM: 606749) PREDICTED: tubulointer ( 426) 297 77.8 6.3e-14 XP_006715125 (OMIM: 606749) PREDICTED: tubulointer ( 458) 297 77.8 6.7e-14 NP_055279 (OMIM: 606749) tubulointerstitial nephri ( 476) 297 77.8 6.9e-14 NP_000387 (OMIM: 265800,601105) cathepsin K prepro ( 329) 286 75.2 3e-13 NP_001191344 (OMIM: 616064) tubulointerstitial nep ( 362) 277 73.1 1.4e-12 XP_005271164 (OMIM: 616064) PREDICTED: tubulointer ( 362) 277 73.1 1.4e-12 XP_011540248 (OMIM: 616064) PREDICTED: tubulointer ( 408) 277 73.2 1.5e-12 NP_001191343 (OMIM: 616064) tubulointerstitial nep ( 436) 277 73.2 1.6e-12 NP_071447 (OMIM: 616064) tubulointerstitial nephri ( 467) 277 73.2 1.7e-12 XP_016869782 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 266 70.5 6.3e-12 XP_011516565 (OMIM: 116880) PREDICTED: cathepsin L ( 272) 266 70.5 6.3e-12 NP_001244902 (OMIM: 116880) cathepsin L1 isoform 2 ( 151) 261 69.2 8.6e-12 XP_011543630 (OMIM: 603539,615362) PREDICTED: cath ( 424) 257 68.5 3.9e-11 NP_003784 (OMIM: 603539,615362) cathepsin F precur ( 484) 257 68.6 4.3e-11 NP_004070 (OMIM: 116845) cathepsin S isoform 1 pre ( 331) 235 63.4 1.1e-09 NP_001186668 (OMIM: 116845) cathepsin S isoform 2 ( 281) 229 62.0 2.5e-09 XP_005271163 (OMIM: 616064) PREDICTED: tubulointer ( 440) 227 61.6 5e-09 >>NP_001805 (OMIM: 170650,245000,245010,602365) dipeptid (463 aa) initn: 3212 init1: 3212 opt: 3212 Z-score: 4021.1 bits: 753.3 E(85289): 3.1e-217 Smith-Waterman score: 3212; 100.0% identity (100.0% similar) in 463 aa overlap (1-463:1-463) 10 20 30 40 50 60 pF1KB5 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB5 IQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNV :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 IQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNV 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB5 HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 HGINFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEG 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB5 GFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 GFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALM 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB5 KLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGM :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 KLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGM 370 380 390 400 410 420 430 440 450 460 pF1KB5 DYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL ::::::::::::::::::::::::::::::::::::::::::: NP_001 DYWIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL 430 440 450 460 >>NP_680475 (OMIM: 170650,245000,245010,602365) dipeptid (137 aa) initn: 719 init1: 719 opt: 719 Z-score: 906.9 bits: 175.3 E(85289): 8.9e-44 Smith-Waterman score: 719; 100.0% identity (100.0% similar) in 106 aa overlap (1-106:1-106) 10 20 30 40 50 60 pF1KB5 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_680 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE :::::::::::::::::::::::::::::::::::::::::::::: NP_680 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKDVTDFISHLFMQLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA NP_680 TVGIYDLPHLRNKLVIK 130 >>NP_001107645 (OMIM: 170650,245000,245010,602365) dipep (141 aa) initn: 719 init1: 719 opt: 719 Z-score: 906.7 bits: 175.4 E(85289): 9.1e-44 Smith-Waterman score: 719; 100.0% identity (100.0% similar) in 106 aa overlap (1-106:1-106) 10 20 30 40 50 60 pF1KB5 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: NP_001 MGAGPSLLLAALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQ 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB5 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKYKEEGSKVTTYCNE :::::::::::::::::::::::::::::::::::::::::::::: NP_001 EKKVVVYLQKLDTAYDDLGNSGHFTIIYNQGFEIVLNDYKWFAFFKDVTDFISHLFMQLG 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB5 TMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINA NP_001 TVGIYDLPHLRNKLAMNRRWG 130 140 >>NP_001188504 (OMIM: 603308) cathepsin L2 preproprotein (334 aa) initn: 394 init1: 170 opt: 452 Z-score: 566.7 bits: 113.7 E(85289): 7.9e-25 Smith-Waterman score: 454; 31.3% identity (60.0% similar) in 335 aa overlap (128-454:26-329) 100 110 120 130 140 150 pF1KB5 DYKWFAFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLK :. .: : ... :.:. . .. : NP_001 MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKA-THRRLYGANEEGWRRAVWEK 10 20 30 40 50 160 170 180 190 200 210 pF1KB5 NSQ--EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRP : . : .... . :.:. :.::. : :.. . .: .: . .. :. : NP_001 NMKMIELHNGEYSQGKHGFTMAMNAFGD----MTNEEFRQM-MG-CFRNQKFRKGKVFRE 60 70 80 90 100 220 230 240 250 260 270 pF1KB5 KPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILT :: .: :: : :::. .: .:.::.:: .::::..:.. : ::... . NP_001 ---PL-------FLDLPKSVDWRK-KG--YVTPVKNQKQCGSCWAFSATGALEGQM--FR 110 120 130 140 150 280 290 300 310 320 330 pF1KB5 NNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKM .... :: :..:.:: : :::.::: .. :: : .::...: ::. NP_001 KTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKY 160 170 180 190 200 210 340 350 360 370 380 390 pF1KB5 KEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTG . . .. . :. : ..:::: .. ::..::... ...: ::.::: NP_001 RPENSVANDTGFTVVAP---GKEKALMK-AVATVGPISVAMDAGHSSFQFYKSGIYF--- 220 230 240 250 260 400 410 420 430 440 pF1KB5 LRDPFNPFELTNHAVLLVGYGTDSASGMD--YWIVKNSWGTGWGENGYFRIRRG-TDECA .: . .:.::.:::: ..:.. . ::.:::::: :: ::: .: . ...:. NP_001 --EPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG 270 280 290 300 310 320 450 460 pF1KB5 IESIAVAATPIPKL : . : NP_001 IATAASYPNV 330 >>NP_001324 (OMIM: 603308) cathepsin L2 preproprotein [H (334 aa) initn: 394 init1: 170 opt: 452 Z-score: 566.7 bits: 113.7 E(85289): 7.9e-25 Smith-Waterman score: 454; 31.3% identity (60.0% similar) in 335 aa overlap (128-454:26-329) 100 110 120 130 140 150 pF1KB5 DYKWFAFFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLK :. .: : ... :.:. . .. : NP_001 MNLSLVLAAFCLGIASAVPKFDQNLDTKWYQWKA-THRRLYGANEEGWRRAVWEK 10 20 30 40 50 160 170 180 190 200 210 pF1KB5 NSQ--EKYSNRLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRP : . : .... . :.:. :.::. : :.. . .: .: . .. :. : NP_001 NMKMIELHNGEYSQGKHGFTMAMNAFGD----MTNEEFRQM-MG-CFRNQKFRKGKVFRE 60 70 80 90 100 220 230 240 250 260 270 pF1KB5 KPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILT :: .: :: : :::. .: .:.::.:: .::::..:.. : ::... . NP_001 ---PL-------FLDLPKSVDWRK-KG--YVTPVKNQKQCGSCWAFSATGALEGQM--FR 110 120 130 140 150 280 290 300 310 320 330 pF1KB5 NNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYTGTDSPCKM .... :: :..:.:: : :::.::: .. :: : .::...: ::. NP_001 KTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICKY 160 170 180 190 200 210 340 350 360 370 380 390 pF1KB5 KEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTG . . .. . :. : ..:::: .. ::..::... ...: ::.::: NP_001 RPENSVANDTGFTVVAP---GKEKALMK-AVATVGPISVAMDAGHSSFQFYKSGIYF--- 220 230 240 250 260 400 410 420 430 440 pF1KB5 LRDPFNPFELTNHAVLLVGYGTDSASGMD--YWIVKNSWGTGWGENGYFRIRRG-TDECA .: . .:.::.:::: ..:.. . ::.:::::: :: ::: .: . ...:. NP_001 --EPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG 270 280 290 300 310 320 450 460 pF1KB5 IESIAVAATPIPKL : . : NP_001 IATAASYPNV 330 >>NP_001327 (OMIM: 603169) cathepsin Z preproprotein [Ho (303 aa) initn: 300 init1: 131 opt: 418 Z-score: 524.8 bits: 105.8 E(85289): 1.7e-22 Smith-Waterman score: 418; 33.8% identity (60.1% similar) in 228 aa overlap (231-445:62-279) 210 220 230 240 250 pF1KB5 MIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQ---ASCGS :: ::::::: :.:..: .::: ::: NP_001 TCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGS 40 50 60 70 80 90 260 270 280 290 300 310 pF1KB5 CYSFASMGMLEARIRILTNNSQ-TPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGL :.. :: . . :: : ... . .:: :.:..:.. : .:::: : . ::.. :. NP_001 CWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGN-AGSCEGGND-LSVWDYAHQHGI 100 110 120 130 140 320 330 340 350 360 pF1KB5 VEEACFPYTGTDSPCKMKEDC-----FRYYSSEYHY----VGGFYGGCNEALMKLELVHH .:.: : . :. : ..: :. . .: :: . . .. : :. . NP_001 PDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYAN 150 160 170 180 190 200 370 380 390 400 410 420 pF1KB5 GPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKN ::.. .. . . . .: ::: . .. ::.: ..:.: ..: .::::.: NP_001 GPISCGIMATERLANYTGGIYAE------YQDTTYINHVVSVAGWGI--SDGTEYWIVRN 210 220 230 240 250 260 430 440 450 460 pF1KB5 SWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL ::: ::: :..:: .: NP_001 SWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV 270 280 290 300 >>NP_001903 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 341 init1: 165 opt: 386 Z-score: 484.1 bits: 98.4 E(85289): 3.2e-20 Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328) 190 200 210 220 230 240 pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV : ... :. : : : :::. .: .: NP_001 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV 70 80 90 100 110 120 250 260 270 280 290 300 pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY .::.::..::::..:.. : ::.. .. .... :: :..:.:: : .::.::. . NP_001 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M 130 140 150 160 170 180 310 320 330 340 350 pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL : .:.:: : : : .:: .:. :: :. .: .. :: ..:: NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL 190 200 210 220 230 360 370 380 390 400 410 pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS-- :: .. ::..::... ...:: ::.::: .: : .:.::.:::: .: NP_001 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE 240 250 260 270 280 420 430 440 450 460 pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL ... ::.:::::: :: .:: .. . ..:.: : : NP_001 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 >>NP_666023 (OMIM: 116880) cathepsin L1 isoform 1 prepro (333 aa) initn: 341 init1: 165 opt: 386 Z-score: 484.1 bits: 98.4 E(85289): 3.2e-20 Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328) 190 200 210 220 230 240 pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV : ... :. : : : :::. .: .: NP_666 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV 70 80 90 100 110 120 250 260 270 280 290 300 pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY .::.::..::::..:.. : ::.. .. .... :: :..:.:: : .::.::. . NP_666 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M 130 140 150 160 170 180 310 320 330 340 350 pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL : .:.:: : : : .:: .:. :: :. .: .. :: ..:: NP_666 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL 190 200 210 220 230 360 370 380 390 400 410 pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS-- :: .. ::..::... ...:: ::.::: .: : .:.::.:::: .: NP_666 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE 240 250 260 270 280 420 430 440 450 460 pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL ... ::.:::::: :: .:: .. . ..:.: : : NP_666 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 >>NP_001244900 (OMIM: 116880) cathepsin L1 isoform 1 pre (333 aa) initn: 341 init1: 165 opt: 386 Z-score: 484.1 bits: 98.4 E(85289): 3.2e-20 Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328) 190 200 210 220 230 240 pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV : ... :. : : : :::. .: .: NP_001 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV 70 80 90 100 110 120 250 260 270 280 290 300 pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY .::.::..::::..:.. : ::.. .. .... :: :..:.:: : .::.::. . NP_001 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M 130 140 150 160 170 180 310 320 330 340 350 pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL : .:.:: : : : .:: .:. :: :. .: .. :: ..:: NP_001 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL 190 200 210 220 230 360 370 380 390 400 410 pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS-- :: .. ::..::... ...:: ::.::: .: : .:.::.:::: .: NP_001 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE 240 250 260 270 280 420 430 440 450 460 pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL ... ::.:::::: :: .:: .. . ..:.: : : NP_001 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 >>XP_005251773 (OMIM: 116880) PREDICTED: cathepsin L1 is (333 aa) initn: 341 init1: 165 opt: 386 Z-score: 484.1 bits: 98.4 E(85289): 3.2e-20 Smith-Waterman score: 432; 36.1% identity (63.5% similar) in 249 aa overlap (219-454:100-328) 190 200 210 220 230 240 pF1KB5 TYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFV : ... :. : : : :::. .: .: XP_005 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YV 70 80 90 100 110 120 250 260 270 280 290 300 pF1KB5 SPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPY .::.::..::::..:.. : ::.. .. .... :: :..:.:: : .::.::. . XP_005 TPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-M 130 140 150 160 170 180 310 320 330 340 350 pF1KB5 LIAGKYAQDFG-LVEEACFPYTGTDSPCKMKEDCFRYYSSEYHYVG--GFYG--GCNEAL : .:.:: : : : .:: .:. :: :. .: .. :: ..:: XP_005 DYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPKQEKAL 190 200 210 220 230 360 370 380 390 400 410 pF1KB5 MKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDS-- :: .. ::..::... ...:: ::.::: .: : .:.::.:::: .: XP_005 MK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTE 240 250 260 270 280 420 430 440 450 460 pF1KB5 ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL ... ::.:::::: :: .:: .. . ..:.: : : XP_005 SDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV 290 300 310 320 330 463 residues in 1 query sequences 60827320 residues in 85289 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sat Nov 5 13:04:02 2016 done: Sat Nov 5 13:04:03 2016 Total Scan time: 8.750 Total Display time: 0.030 Function used was FASTA [36.3.4 Apr, 2011]