FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0217, 172 aa
1>>>pF1KE0217 172 - 172 aa - 172 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1861+/-0.000706; mu= 12.8071+/- 0.043
mean_var=58.3777+/-11.667, 0's: 0 Z-trim(108.8): 24 B-trim: 0 in 0/52
Lambda= 0.167861
statistics sampled from 10457 (10481) to 10457 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.727), E-opt: 0.2 (0.322), width: 16
Scan time: 1.830
The best scores are: opt bits E(32554)
CCDS1877.1 LGALSL gene_id:29094|Hs108|chr2 ( 172) 1166 290.1 4.4e-79
CCDS12521.1 LGALS4 gene_id:3960|Hs108|chr19 ( 323) 305 81.7 4.5e-16
CCDS1611.1 LGALS8 gene_id:3964|Hs108|chr1 ( 359) 304 81.5 5.8e-16
CCDS1612.1 LGALS8 gene_id:3964|Hs108|chr1 ( 317) 303 81.2 6.2e-16
CCDS32592.1 LGALS9 gene_id:3965|Hs108|chr17 ( 323) 301 80.8 8.8e-16
CCDS11222.1 LGALS9 gene_id:3965|Hs108|chr17 ( 355) 301 80.8 9.6e-16
CCDS42283.1 LGALS9B gene_id:284194|Hs108|chr17 ( 355) 295 79.3 2.6e-15
CCDS32587.1 LGALS9C gene_id:654346|Hs108|chr17 ( 356) 289 77.9 7.2e-15
>>CCDS1877.1 LGALSL gene_id:29094|Hs108|chr2 (172 aa)
initn: 1166 init1: 1166 opt: 1166 Z-score: 1533.2 bits: 290.1 E(32554): 4.4e-79
Smith-Waterman score: 1166; 100.0% identity (100.0% similar) in 172 aa overlap (1-172:1-172)
10 20 30 40 50 60
pF1KE0 MAGSVADSDAVVKLDDGHLNNSLSSPVQADVYFPRLIVPFCGHIKGGMRPGKKVLVMGIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 MAGSVADSDAVVKLDDGHLNNSLSSPVQADVYFPRLIVPFCGHIKGGMRPGKKVLVMGIV
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 DLNPESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQSAIPYFPFIP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 DLNPESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQSAIPYFPFIP
70 80 90 100 110 120
130 140 150 160 170
pF1KE0 DQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQITKLG
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS18 DQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQITKLG
130 140 150 160 170
>>CCDS12521.1 LGALS4 gene_id:3960|Hs108|chr19 (323 aa)
initn: 317 init1: 209 opt: 305 Z-score: 402.0 bits: 81.7 E(32554): 4.5e-16
Smith-Waterman score: 305; 31.8% identity (66.9% similar) in 154 aa overlap (17-169:170-319)
10 20 30 40
pF1KE0 MAGSVADSDAVVKLDDGHLNNSLSS-PVQADVYFPRLIVPFCGHIK
:: ...:.: :.. ::. :...
CCDS12 GDLQLQSINFIGGQPLRPQGPPMMPPYPGPGHCHQQLNSLPTMEGPPTFNPPVPYFGRLQ
140 150 160 170 180 190
50 60 70 80 90 100
pF1KE0 GGMRPGKKVLVMGIVDLNPESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGE
::. . ... : : . .::::.. :.: .:.:.... . . ..::: ..:
CCDS12 GGLTARRTIIIKGYVPPTGKSFAINFKVGSS----GDIALHINPRMGNGTVVRNSLLNGS
200 210 220 230 240 250
110 120 130 140 150 160
pF1KE0 RGEEQSAIPYFPFIPDQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGD
: :.. : . :: : : : . : : ::.:...:..:::: ::..... .::..:.::
CCDS12 WGSEEKKITHNPFGPGQFFDLSIRCGLDRFKVYANGQHLFDFAHRLSAFQRVDTLEIQGD
260 270 280 290 300 310
170
pF1KE0 LQITKLG
. ..
CCDS12 VTLSYVQI
320
>>CCDS1611.1 LGALS8 gene_id:3964|Hs108|chr1 (359 aa)
initn: 353 init1: 191 opt: 304 Z-score: 400.0 bits: 81.5 E(32554): 5.8e-16
Smith-Waterman score: 304; 31.1% identity (64.1% similar) in 167 aa overlap (5-171:194-356)
10 20 30
pF1KE0 MAGSVADSDAVVKLDDGHLNNSLSSPVQADVYFP
.: . .: :. .:..:. . .
CCDS16 SLELTEISRENVPKSGTPQLPSNRGGDISKIAPRTVYTKSKDSTVNHTLTCTKIPPMNYV
170 180 190 200 210 220
40 50 60 70 80 90
pF1KE0 RLIVPFCGHIKGGMRPGKKVLVMGIVDLNPESFAISLTCGDSEDPPADVAIELKAVFTDR
.:: .... : ::. :.: : :. : .:: ..: : :. :.:..:. .. .
CCDS16 SKRLPFAARLNTPMGPGRTVVVKGEVNANAKSFNVDLLAGKSK----DIALHLNPRLNIK
230 240 250 260 270
100 110 120 130 140 150
pF1KE0 QLLRNSCISGERGEEQSAIPYFPFIPDQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTL
..::: .. :::. : ::: : . :.. : :. .:.: :.: . ... ::.. :
CCDS16 AFVRNSFLQESWGEEERNITSFPFSPGMYFEMIIYCDVREFKVAVNGVHSLEYKHRFKEL
280 290 300 310 320 330
160 170
pF1KE0 SAIDTIKINGDLQITKLG
:.:::..::::... ..
CCDS16 SSIDTLEINGDIHLLEVRSW
340 350
>>CCDS1612.1 LGALS8 gene_id:3964|Hs108|chr1 (317 aa)
initn: 353 init1: 191 opt: 303 Z-score: 399.6 bits: 81.2 E(32554): 6.2e-16
Smith-Waterman score: 303; 35.5% identity (68.8% similar) in 138 aa overlap (34-171:181-314)
10 20 30 40 50 60
pF1KE0 SVADSDAVVKLDDGHLNNSLSSPVQADVYFPRLIVPFCGHIKGGMRPGKKVLVMGIVDLN
:.: .:: .... : ::. :.: : :. :
CCDS16 FSFSSDLQSTQASSLELTEISRENVPKSGTPQLRLPFAARLNTPMGPGRTVVVKGEVNAN
160 170 180 190 200 210
70 80 90 100 110 120
pF1KE0 PESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQSAIPYFPFIPDQP
.:: ..: : :.: .:..:. .. . ..::: .. :::. : ::: : .
CCDS16 AKSFNVDLLAGKSKD----IALHLNPRLNIKAFVRNSFLQESWGEEERNITSFPFSPGMY
220 230 240 250 260
130 140 150 160 170
pF1KE0 FRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQITKLG
:.. : :. .:.: :.: . ... ::.. ::.:::..::::... ..
CCDS16 FEMIIYCDVREFKVAVNGVHSLEYKHRFKELSSIDTLEINGDIHLLEVRSW
270 280 290 300 310
>>CCDS32592.1 LGALS9 gene_id:3965|Hs108|chr17 (323 aa)
initn: 363 init1: 188 opt: 301 Z-score: 396.8 bits: 80.8 E(32554): 8.8e-16
Smith-Waterman score: 301; 32.9% identity (66.4% similar) in 152 aa overlap (23-171:176-321)
10 20 30 40 50
pF1KE0 MAGSVADSDAVVKLDDGHLNNSLSSPVQADVYFPR--LIVPFCGHIKGGMRP
.:.:. ...:. .:: : ::. :
CCDS32 SFQPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMYPHPAYPMPFITTILGGLYP
150 160 170 180 190 200
60 70 80 90 100 110
pF1KE0 GKKVLVMGIVDLNPESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQ
.:..:. : : . . : :.: :. .:..:. : . ..::. :.. : :.
CCDS32 SKSILLSGTVLPSAQRFHINLCSGNH------IAFHLNPRFDENAVVRNTQIDNSWGSEE
210 220 230 240 250
120 130 140 150 160
pF1KE0 SAIPY-FPFIPDQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQIT
..: .::. : : : :::: ..: :::..::..:::...: .:. ....::.:.:
CCDS32 RSLPRKMPFVRGQSFSVWILCEAHCLKVAVDGQHLFEYYHRLRNLPTINRLEVGGDIQLT
260 270 280 290 300 310
170
pF1KE0 KLG
..
CCDS32 HVQT
320
>>CCDS11222.1 LGALS9 gene_id:3965|Hs108|chr17 (355 aa)
initn: 363 init1: 188 opt: 301 Z-score: 396.2 bits: 80.8 E(32554): 9.6e-16
Smith-Waterman score: 301; 32.9% identity (66.4% similar) in 152 aa overlap (23-171:208-353)
10 20 30 40 50
pF1KE0 MAGSVADSDAVVKLDDGHLNNSLSSPVQADVYFPR--LIVPFCGHIKGGMRP
.:.:. ...:. .:: : ::. :
CCDS11 RQKPPGVWPANPAPITQTVIHTVQSAPGQMFSTPAIPPMMYPHPAYPMPFITTILGGLYP
180 190 200 210 220 230
60 70 80 90 100 110
pF1KE0 GKKVLVMGIVDLNPESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQ
.:..:. : : . . : :.: :. .:..:. : . ..::. :.. : :.
CCDS11 SKSILLSGTVLPSAQRFHINLCSGNH------IAFHLNPRFDENAVVRNTQIDNSWGSEE
240 250 260 270 280 290
120 130 140 150 160
pF1KE0 SAIPY-FPFIPDQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQIT
..: .::. : : : :::: ..: :::..::..:::...: .:. ....::.:.:
CCDS11 RSLPRKMPFVRGQSFSVWILCEAHCLKVAVDGQHLFEYYHRLRNLPTINRLEVGGDIQLT
300 310 320 330 340 350
170
pF1KE0 KLG
..
CCDS11 HVQT
>>CCDS42283.1 LGALS9B gene_id:284194|Hs108|chr17 (355 aa)
initn: 361 init1: 184 opt: 295 Z-score: 388.3 bits: 79.3 E(32554): 2.6e-15
Smith-Waterman score: 295; 31.6% identity (67.8% similar) in 152 aa overlap (23-171:208-353)
10 20 30 40 50
pF1KE0 MAGSVADSDAVVKLDDGHLNNSLSSPVQADVYFPR--LIVPFCGHIKGGMRP
.:.:. ...:. .:: : ::. :
CCDS42 RQKPPSVRPANPAPITQTVIHTVQSASGQMFSTPAIPPMMYPHPAYPMPFITTIPGGLYP
180 190 200 210 220 230
60 70 80 90 100 110
pF1KE0 GKKVLVMGIVDLNPESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQ
.:.... : : . . : :.: :. :. .:.... : . ..::. :.. : :.
CCDS42 SKSIILSGTVLPSAQRFHINL-CSGSH-----IAFHMNPRFDENAVVRNTQINNSWGSEE
240 250 260 270 280 290
120 130 140 150 160
pF1KE0 SAIPY-FPFIPDQPFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQIT
..: .::. : : : :::: ..: :::...:..:::...: .:. ....::.:.:
CCDS42 RSLPRKMPFVRGQSFSVWILCEAHCLKVAVDGQHVFEYYHRLRNLPTINKLEVGGDIQLT
300 310 320 330 340 350
170
pF1KE0 KLG
..
CCDS42 HVQT
>>CCDS32587.1 LGALS9C gene_id:654346|Hs108|chr17 (356 aa)
initn: 361 init1: 184 opt: 289 Z-score: 380.4 bits: 77.9 E(32554): 7.2e-15
Smith-Waterman score: 289; 33.1% identity (67.6% similar) in 139 aa overlap (34-171:222-354)
10 20 30 40 50 60
pF1KE0 SVADSDAVVKLDDGHLNNSLSSPVQADVYFPRLIVPFCGHIKGGMRPGKKVLVMGIVDLN
: .:: : ::. :.:.... : : .
CCDS32 ITQTVIHTVQSASGQMFSQTPAIPPMMYPHPAYPMPFITTIPGGLYPSKSIILSGTVLPS
200 210 220 230 240 250
70 80 90 100 110 120
pF1KE0 PESFAISLTCGDSEDPPADVAIELKAVFTDRQLLRNSCISGERGEEQSAIPY-FPFIPDQ
. : :.: :. :. .:.... : . ..::. :.. : :. ..: .::. :
CCDS32 AQRFHINL-CSGSH-----IAFHMNPRFDENAVVRNTQINNSWGSEERSLPRKMPFVRGQ
260 270 280 290 300
130 140 150 160 170
pF1KE0 PFRVEILCEHPRFRVFVDGHQLFDFYHRIQTLSAIDTIKINGDLQITKLG
: : :::: ..: :::...:..:::...: .:. ....::.:.:..
CCDS32 SFSVWILCEAHCLKVAVDGQHVFEYYHRLRNLPTINKLEVGGDIQLTHVQT
310 320 330 340 350
172 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 20:53:21 2016 done: Thu Nov 3 20:53:21 2016
Total Scan time: 1.830 Total Display time: 0.000
Function used was FASTA [36.3.4 Apr, 2011]