FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0209, 303 aa
1>>>pF1KE0209 303 - 303 aa - 303 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.4329+/-0.00079; mu= 14.8302+/- 0.048
mean_var=69.3112+/-13.805, 0's: 0 Z-trim(108.8): 22 B-trim: 190 in 1/49
Lambda= 0.154054
statistics sampled from 10403 (10421) to 10403 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.693), E-opt: 0.2 (0.32), width: 16
Scan time: 2.340
The best scores are: opt bits E(32554)
CCDS13474.1 CTSZ gene_id:1522|Hs108|chr20 ( 303) 2187 494.8 3.3e-140
CCDS8282.1 CTSC gene_id:1075|Hs108|chr11 ( 463) 418 101.8 1.1e-21
CCDS6675.1 CTSL gene_id:1514|Hs108|chr9 ( 333) 305 76.6 2.9e-14
CCDS6723.1 CTSV gene_id:1515|Hs108|chr9 ( 334) 298 75.0 8.6e-14
CCDS968.1 CTSS gene_id:1520|Hs108|chr1 ( 331) 270 68.8 6.4e-12
CCDS55634.1 CTSS gene_id:1520|Hs108|chr1 ( 281) 254 65.2 6.5e-11
>>CCDS13474.1 CTSZ gene_id:1522|Hs108|chr20 (303 aa)
initn: 2187 init1: 2187 opt: 2187 Z-score: 2630.7 bits: 494.8 E(32554): 3.3e-140
Smith-Waterman score: 2187; 100.0% identity (100.0% similar) in 303 aa overlap (1-303:1-303)
10 20 30 40 50 60
pF1KE0 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEYLSPA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 DLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLLSV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 QNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 QNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTCNEFKECH
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 AIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYIN
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 AIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGIYAEYQDTTYIN
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 HVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 HVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGD
250 260 270 280 290 300
pF1KE0 PIV
:::
CCDS13 PIV
>>CCDS8282.1 CTSC gene_id:1075|Hs108|chr11 (463 aa)
initn: 279 init1: 131 opt: 418 Z-score: 503.1 bits: 101.8 E(32554): 1.1e-21
Smith-Waterman score: 418; 33.8% identity (60.1% similar) in 228 aa overlap (62-279:231-445)
40 50 60 70 80 90
pF1KE0 TCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGS
:: ::::::: :.:..: .::: :::
CCDS82 MIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINFVSPVRNQ---ASCGS
210 220 230 240 250
100 110 120 130 140
pF1KE0 CWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDCGN-AGSCEGGND-LSVWDYAHQHGI
:.. :: . . :: : ... . .:: :.:..:.. : .:::: : . ::.. :.
CCDS82 CYSFASMGMLEARIRILTNNSQ-TPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGL
260 270 280 290 300 310
150 160 170 180 190 200
pF1KE0 PDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYAN
.:.: : . :. : ..: :. . .: :: . . .. : :. .
CCDS82 VEEACFPYTGTDSPCKMKEDC-----FRYYSSEYHY----VGGFYGGCNEALMKLELVHH
320 330 340 350 360
210 220 230 240 250 260
pF1KE0 GPISCGIMATERLANYTGGIYAE------YQDTTYINHVVSVAGWGI--SDGTEYWIVRN
::.. .. . . . .: ::: . .. ::.: ..:.: ..: .::::.:
CCDS82 GPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKN
370 380 390 400 410 420
270 280 290 300
pF1KE0 SWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV
::: ::: :..:: .:
CCDS82 SWGTGWGENGYFRIRRGTDECAIESIAVAATPIPKL
430 440 450 460
>>CCDS6675.1 CTSL gene_id:1514|Hs108|chr9 (333 aa)
initn: 247 init1: 110 opt: 305 Z-score: 369.5 bits: 76.6 E(32554): 2.9e-14
Smith-Waterman score: 329; 33.2% identity (57.6% similar) in 229 aa overlap (63-275:115-314)
40 50 60 70 80 90
pF1KE0 CYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSC
:.: :::. .:.. ..:: ::::
CCDS66 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK---GYVTPVKNQG---QCGSC
90 100 110 120 130
100 110 120 130 140
pF1KE0 WAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQH-
:: ..:.:. .. ... : : :: ::..:: :: : :.:: . ::: :.
CCDS66 WAFSATGALEGQM-FRKTGRLIS--LSEQNLVDCSGPQGNEG-CNGG----LMDYAFQYV
140 150 160 170 180 190
150 160 170 180 190 200
pF1KE0 ----GIPDETCNNYQAKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREK-M
:. .: :.: .. : :.: ... : : . .. .:: .
CCDS66 QDNGGLDSEESYPYEATEESC-KYNPK---------YSVANDT-----GFVDIPKQEKAL
200 210 220 230
210 220 230 240 250
pF1KE0 MAEIYANGPISCGIMAT-ERLANYTGGIYAEYQDTTY-INHVVSVAGWGI----SDGTEY
: . . :::: .: : : . : ::: : . .. ..: : :.:.:. ::...:
CCDS66 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY
240 250 260 270 280 290
260 270 280 290 300
pF1KE0 WIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV
:.:.::::: :: :....
CCDS66 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV
300 310 320 330
>>CCDS6723.1 CTSV gene_id:1515|Hs108|chr9 (334 aa)
initn: 273 init1: 106 opt: 298 Z-score: 361.1 bits: 75.0 E(32554): 8.6e-14
Smith-Waterman score: 346; 31.0% identity (57.5% similar) in 261 aa overlap (27-275:88-315)
10 20 30 40 50
pF1KE0 MARRGPGWRPLLLLVLLAGAAQGGLYFRRGQTCYRPLRGDGLAPLGRSTYPRPHEY
::. . :.: . . .. : .
CCDS67 MIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQMMGCFRNQK------FRKGKVFREPLF
60 70 80 90 100 110
60 70 80 90 100 110
pF1KE0 LSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPST
: ::::: :::. : :.. ..:: . :::::: ..:.:. .. ... : :
CCDS67 L---DLPKSVDWRK-KG--YVTPVKNQ---KQCGSCWAFSATGALEGQM-FRKTGKLVS-
120 130 140 150 160
120 130 140 150 160 170
pF1KE0 LLSVQNVIDC----GNAGSCEGGNDLSVWDYAHQHG-IPDETCNNYQAKDQECDKFNQCG
:: ::..:: :: : :.:: ...:....: . .: : : :. :
CCDS67 -LSEQNLVDCSRPQGNQG-CNGGFMARAFQYVKENGGLDSEESYPYVAVDEIC-------
170 180 190 200 210
180 190 200 210 220
pF1KE0 TCNEFKECHAIRNYTLWRVGDYGSLSGREK-MMAEIYANGPISCGIMATER-LANYTGGI
... ... : : . : :.:: .: . . :::: .. : . . : .::
CCDS67 ---KYRPENSVANDTGFTV----VAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGI
220 230 240 250 260
230 240 250 260 270 280
pF1KE0 YAEYQ-DTTYINHVVSVAGWGI----SDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGK
: : . .. ..: : :.:.:. :....::.:.:::: :: :...:
CCDS67 YFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNHCG
270 280 290 300 310 320
290 300
pF1KE0 GARYNLAIEEHCTFGDPIV
CCDS67 IATAASYPNV
330
>>CCDS968.1 CTSS gene_id:1520|Hs108|chr1 (331 aa)
initn: 309 init1: 148 opt: 270 Z-score: 327.5 bits: 68.8 E(32554): 6.4e-12
Smith-Waterman score: 370; 33.9% identity (60.3% similar) in 224 aa overlap (62-275:115-312)
40 50 60 70 80 90
pF1KE0 TCYRPLRGDGLAPLGRSTYPRPHEYLSPADLPKSWDWRNVDGVNYASITRNQHIPQYCGS
:: : ::: . . .:. .. . ::.
CCDS96 SEEVMSLMSSLRVPSQWQRNITYKSNPNRILPDSVDWR-----EKGCVTEVKYQGS-CGA
90 100 110 120 130
100 110 120 130 140
pF1KE0 CWAHASTSAMADRINIKRKGAWPSTLLSVQNVIDC-----GNAGSCEGGNDLSVWDYA-H
::: ....:. ....: : : ::.::..:: :: : :.:: ....:
CCDS96 CWAFSAVGALEAQLKLKT-GKLVS--LSAQNLVDCSTEKYGNKG-CNGGFMTTAFQYIID
140 150 160 170 180 190
150 160 170 180 190 200
pF1KE0 QHGIPDETCNNYQAKDQEC--DKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMM
..:: ... :.: ::.: :. . .::... : :: :: ..
CCDS96 NKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTEL------------PYG----REDVL
200 210 220 230
210 220 230 240 250 260
pF1KE0 AEIYAN-GPISCGIMATE-RLANYTGGIYAEYQDTTYINHVVSVAGWGISDGTEYWIVRN
: :: ::.: :. : . . : .:.: : . : .:: : :.:.: .: :::.:.:
CCDS96 KEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKN
240 250 260 270 280 290
270 280 290 300
pF1KE0 SWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV
:::. .::.:..:.
CCDS96 SWGHNFGEEGYIRMARNKGNHCGIASFPSYPEI
300 310 320 330
>>CCDS55634.1 CTSS gene_id:1520|Hs108|chr1 (281 aa)
initn: 277 init1: 148 opt: 254 Z-score: 309.4 bits: 65.2 E(32554): 6.5e-11
Smith-Waterman score: 354; 35.0% identity (61.4% similar) in 197 aa overlap (89-275:86-262)
60 70 80 90 100 110
pF1KE0 PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLL
::.::: ....:. ....: : : :
CCDS55 LKFVMLHNLEHSMGMHSYDLGMNHLGDMGSCGACWAFSAVGALEAQLKLKT-GKLVS--L
60 70 80 90 100 110
120 130 140 150 160 170
pF1KE0 SVQNVIDC-----GNAGSCEGGNDLSVWDYA-HQHGIPDETCNNYQAKDQEC--DKFNQC
:.::..:: :: : :.:: ....: ..:: ... :.: ::.: :. .
CCDS55 SAQNLVDCSTEKYGNKG-CNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRA
120 130 140 150 160 170
180 190 200 210 220
pF1KE0 GTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYAN-GPISCGIMATE-RLANYTGG
.::... : :: :: .. : :: ::.: :. : . . : .:
CCDS55 ATCSKYTEL------------PYG----REDVLKEAVANKGPVSVGVDARHPSFFLYRSG
180 190 200 210
230 240 250 260 270 280
pF1KE0 IYAEYQDTTYINHVVSVAGWGISDGTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARY
.: : . : .:: : :.:.: .: :::.:.::::. .::.:..:.
CCDS55 VYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASF
220 230 240 250 260 270
290 300
pF1KE0 NLAIEEHCTFGDPIV
CCDS55 PSYPEI
280
303 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 21:11:49 2016 done: Thu Nov 3 21:11:49 2016
Total Scan time: 2.340 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]