FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB6938, 229 aa
1>>>pF1KB6938 229 - 229 aa - 229 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1896+/-0.000899; mu= 15.1880+/- 0.054
mean_var=74.0425+/-14.632, 0's: 0 Z-trim(106.4): 94 B-trim: 0 in 0/50
Lambda= 0.149051
statistics sampled from 8893 (8990) to 8893 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.651), E-opt: 0.2 (0.276), width: 16
Scan time: 1.490
The best scores are: opt bits E(32554)
CCDS41752.1 CLEC1B gene_id:51266|Hs108|chr12 ( 229) 1588 350.5 5.2e-97
CCDS41751.1 CLEC1B gene_id:51266|Hs108|chr12 ( 196) 1238 275.2 2.1e-74
CCDS73443.1 CLEC1A gene_id:51267|Hs108|chr12 ( 247) 387 92.3 3.1e-19
CCDS8612.1 CLEC1A gene_id:51267|Hs108|chr12 ( 280) 387 92.3 3.4e-19
CCDS44830.1 CLEC12B gene_id:387837|Hs108|chr12 ( 276) 369 88.4 4.9e-18
CCDS41753.1 CLEC7A gene_id:64581|Hs108|chr12 ( 247) 364 87.3 9.4e-18
CCDS76528.1 CLEC1A gene_id:51267|Hs108|chr12 ( 188) 348 83.8 8.3e-17
CCDS8613.1 CLEC7A gene_id:64581|Hs108|chr12 ( 201) 302 73.9 8.3e-14
CCDS8614.1 CLEC7A gene_id:64581|Hs108|chr12 ( 168) 301 73.7 8.4e-14
CCDS8609.1 CLEC12A gene_id:160364|Hs108|chr12 ( 232) 301 73.8 1.1e-13
CCDS8608.1 CLEC12A gene_id:160364|Hs108|chr12 ( 265) 301 73.8 1.2e-13
CCDS55803.1 CLEC12A gene_id:160364|Hs108|chr12 ( 275) 301 73.8 1.2e-13
CCDS8623.1 KLRK1 gene_id:22914|Hs108|chr12 ( 216) 285 70.3 1.1e-12
CCDS8618.1 OLR1 gene_id:4973|Hs108|chr12 ( 273) 283 70.0 1.8e-12
CCDS8610.1 CLEC12B gene_id:387837|Hs108|chr12 ( 232) 280 69.3 2.5e-12
CCDS8622.1 KLRD1 gene_id:3824|Hs108|chr12 ( 148) 265 65.9 1.6e-11
CCDS8621.1 KLRD1 gene_id:3824|Hs108|chr12 ( 179) 265 65.9 1.9e-11
CCDS8611.1 CLEC9A gene_id:283420|Hs108|chr12 ( 241) 265 66.0 2.4e-11
CCDS73442.1 CLEC12A gene_id:160364|Hs108|chr12 ( 213) 258 64.5 6.1e-11
>>CCDS41752.1 CLEC1B gene_id:51266|Hs108|chr12 (229 aa)
initn: 1588 init1: 1588 opt: 1588 Z-score: 1855.1 bits: 350.5 E(32554): 5.2e-97
Smith-Waterman score: 1588; 98.7% identity (99.1% similar) in 229 aa overlap (1-229:1-229)
10 20 30 40 50 60
pF1KB6 MQDEDGYITLNIKTRKPALVSVGPASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRN
:::::::::::::::::::.::: ::::::::::::::::::::::::::::::::::::
CCDS41 MQDEDGYITLNIKTRKPALISVGSASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRN
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB6 YLQDENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHN
::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 YLQGENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHN
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB6 LTWEESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LTWEESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISE
130 140 150 160 170 180
190 200 210 220
pF1KB6 NMFEFLEDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 NMFEFLEDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
190 200 210 220
>>CCDS41751.1 CLEC1B gene_id:51266|Hs108|chr12 (196 aa)
initn: 1238 init1: 1238 opt: 1238 Z-score: 1449.3 bits: 275.2 E(32554): 2.1e-74
Smith-Waterman score: 1293; 84.3% identity (85.2% similar) in 229 aa overlap (1-229:1-196)
10 20 30 40 50 60
pF1KB6 MQDEDGYITLNIKTRKPALVSVGPASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRN
:::::::::::::::::::.: .:::::
CCDS41 MQDEDGYITLNIKTRKPALIS---------------------------------AVMQRN
10 20
70 80 90 100 110 120
pF1KB6 YLQDENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHN
::: ::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 YLQGENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHN
30 40 50 60 70 80
130 140 150 160 170 180
pF1KB6 LTWEESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 LTWEESKQYCTDMNATLLKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISE
90 100 110 120 130 140
190 200 210 220
pF1KB6 NMFEFLEDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS41 NMFEFLEDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
150 160 170 180 190
>>CCDS73443.1 CLEC1A gene_id:51267|Hs108|chr12 (247 aa)
initn: 340 init1: 214 opt: 387 Z-score: 458.9 bits: 92.3 E(32554): 3.1e-19
Smith-Waterman score: 387; 31.2% identity (67.6% similar) in 173 aa overlap (62-228:70-236)
40 50 60 70 80 90
pF1KB6 VMALILLILCVGMVVGLVALGIWSVMQRNYLQDENENRTGTLQQLAKRFCQYVVKQSELK
:: .: . .:.::..:...:. ::
CCDS73 FQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEKLCR------ELY
40 50 60 70 80 90
100 110 120 130 140 150
pF1KB6 GTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEESKQYCTDMNATLLKIDNRNIVEYIKA
. .:.:::: .:...::.:: :.. . .::. : .: . :.:.:::.... .:. .
CCDS73 NKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS
100 110 120 130 140 150
160 170 180 190 200
pF1KB6 RTH----LIRWVGLSRQKSNEVWKWEDGSVISENMFEFLED--GKGNMNCAYFHNGKMHP
... :.:: : :...: : ::. .. ..:... : . . .:. . :: .
CCDS73 QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFS
160 170 180 190 200 210
210 220
pF1KB6 TFCENKHYLMCERKAGMTKVDQLP
:.. . .:::.:::.: ..:
CCDS73 KDCKELKRCVCERRAGMVKPESLHVPPETLGEGD
220 230 240
>>CCDS8612.1 CLEC1A gene_id:51267|Hs108|chr12 (280 aa)
initn: 431 init1: 214 opt: 387 Z-score: 458.1 bits: 92.3 E(32554): 3.4e-19
Smith-Waterman score: 444; 30.2% identity (60.0% similar) in 265 aa overlap (1-228:11-269)
10 20 30 40
pF1KB6 MQDEDGYITLNIK------TRKPA-LVSVGPASSSWWRVMALILLILCVG
: :.:: :.... ::.: . : :: :: .:: :: ::.
CCDS86 MQAKYSSTRDMLDDDGDTTMSLHSQGSATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLV
10 20 30 40 50 60
50 60 70
pF1KB6 MVVGLVALGIW---------------SVMQRNY---------LQDENENRTGTLQQLAKR
...::.:::. : :.. :: .: . .:.::..:..
CCDS86 LLIGLAALGLLFFQYYQLSNTGQDTISQMEERLGNTSQELQSLQVQNIKLAGSLQHVAEK
70 80 90 100 110 120
80 90 100 110 120 130
pF1KB6 FCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEESKQYCTDMNATLLK
.:. :: . .:.:::: .:...::.:: :.. . .::. : .: . :.:.::
CCDS86 LCR------ELYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLK
130 140 150 160 170
140 150 160 170 180 190
pF1KB6 IDNRNIVEYIKARTH----LIRWVGLSRQKSNEVWKWEDGSVISENMFEFLED--GKGNM
:.... .:. .... :.:: : :...: : ::. .. ..:... : . .
CCDS86 INKQEDLEFAASQSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSR
180 190 200 210 220 230
200 210 220
pF1KB6 NCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
.:. . :: . :.. . .:::.:::.: ..:
CCDS86 DCVAILNGMIFSKDCKELKRCVCERRAGMVKPESLHVPPETLGEGD
240 250 260 270 280
>>CCDS44830.1 CLEC12B gene_id:387837|Hs108|chr12 (276 aa)
initn: 437 init1: 103 opt: 369 Z-score: 437.3 bits: 88.4 E(32554): 4.9e-18
Smith-Waterman score: 392; 30.9% identity (60.1% similar) in 243 aa overlap (24-228:33-275)
10 20 30 40 50
pF1KB6 MQDEDGYITLNIKTRKPALVSVGPASSSWWRVMALILLILCVGMVVGLVALGI
:: : :: :: :. ::. ...:::.::.
CCDS44 EEVTYATLTFQDSAGARNNRDGNNLRKRGHPAPSPIWRHAALGLVTLCLMLLIGLVTLGM
10 20 30 40 50 60
60 70 80
pF1KB6 WSVMQRNYLQDENENRTG---TLQQ----------------------------LAKRFCQ
.. : .....:. . :.:: . :: :
CCDS44 MFLQISNDINSDSEKLSQLQKTIQQQQDNLSQQLGNSNNLSMEEEFLKSQISSVLKRQEQ
70 80 90 100 110 120
90 100 110 120 130 140
pF1KB6 YVVKQ-SELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNL-TWEESKQYCTDMNATLLKI
...: .:: . :.:.:: :..: .::: : .. :: .:.. : : :.::.::
CCDS44 MAIKLCQELIIHTSDHRCNPCPKMWQWYQNSCYYFTTNEEKTWANSRKDCIDKNSTLVKI
130 140 150 160 170 180
150 160 170 180 190
pF1KB6 DNRNIVEYIKARTHLIR---WVGLSRQKSNEVWKWEDGSVISENMF--EFLEDGKGNMNC
:. . ... .. :. :.::: ..:.. : :::::: : ..: . :.. .:. .:
CCDS44 DSLEEKDFLMSQPLLMFSFFWLGLSWDSSGRSWFWEDGSVPSPSLFSTKELDQINGSKGC
190 200 210 220 230 240
200 210 220
pF1KB6 AYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
:::..:... . : . . .::. :. .:...:
CCDS44 AYFQKGNIYISRCSAEIFWICEKTAAPVKTEDLD
250 260 270
>>CCDS41753.1 CLEC7A gene_id:64581|Hs108|chr12 (247 aa)
initn: 416 init1: 140 opt: 364 Z-score: 432.2 bits: 87.3 E(32554): 9.4e-18
Smith-Waterman score: 390; 32.6% identity (59.4% similar) in 239 aa overlap (3-222:11-247)
10 20 30 40
pF1KB6 MQDEDGYITLNIKTRKPALVSV----GP-ASSSWWRVMALILLILCVGMVVG
::::: :.. ... . ..: : :.: ::..:.:: :::. ..:
CCDS41 MEYHPDLENLDEDGYTQLHFDSQSNTRIAVVSEKGSCAASPPWRLIAVILGILCLVILVI
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB6 LVALG---IW-SVMQRNYLQDE---NENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCS
:.:: :: : : :.. ..:. . : . . . :. . .: : : :
CCDS41 AVVLGTMAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTT--GVLSS
70 80 90 100 110
110 120 130 140 150
pF1KB6 PCDTNWRYYGDSCYGFFRHNLTWEESKQYCTDMNATLLKIDNRN----IVEYIKARTHLI
:: :: : ::: : .:. ::. : .....:::::. : ::. ....
CCDS41 PCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQVSSQPDNS
120 130 140 150 160 170
160 170 180 190 200 210
pF1KB6 RWVGLSRQKSNEVWKWEDGSVISENMFEFLEDG---KGNMNCAYFHNGKMHPTFCENKHY
:.:::: ... : :::::..: :.:.. . . . ::...: . .. .: :
CCDS41 FWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQLCSVPSY
180 190 200 210 220 230
220
pF1KB6 LMCERKAGMTKVDQLP
.::.: .:
CCDS41 SICEKKFSM
240
>>CCDS76528.1 CLEC1A gene_id:51267|Hs108|chr12 (188 aa)
initn: 324 init1: 198 opt: 348 Z-score: 415.2 bits: 83.8 E(32554): 8.3e-17
Smith-Waterman score: 348; 31.9% identity (68.8% similar) in 138 aa overlap (97-228:40-177)
70 80 90 100 110 120
pF1KB6 ENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWEES
:.:::: .:...::.:: :.. . .::.
CCDS76 DMLDDDGDTTMSLHSQGSATTRHPEPRRTAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDC
10 20 30 40 50 60
130 140 150 160 170 180
pF1KB6 KQYCTDMNATLLKIDNRNIVEYIKARTH----LIRWVGLSRQKSNEVWKWEDGSVISENM
: .: . :.:.:::.... .:. .... :.:: : :...: : ::. .. ..
CCDS76 KYFCLSENSTMLKINKQEDLEFAASQSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSEL
70 80 90 100 110 120
190 200 210 220
pF1KB6 FEFLED--GKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
:... : . . .:. . :: . :.. . .:::.:::.: ..:
CCDS76 FHIIIDVTSPRSRDCVAILNGMIFSKDCKELKRCVCERRAGMVKPESLHVPPETLGEGD
130 140 150 160 170 180
>>CCDS8613.1 CLEC7A gene_id:64581|Hs108|chr12 (201 aa)
initn: 409 init1: 140 opt: 302 Z-score: 361.4 bits: 73.9 E(32554): 8.3e-14
Smith-Waterman score: 318; 29.3% identity (53.0% similar) in 232 aa overlap (3-222:11-201)
10 20 30 40
pF1KB6 MQDEDGYITLNIKTRKPALVSV----GP-ASSSWWRVMALILLILCVGMVVG
::::: :.. ... . ..: : :.: ::..:.:: :::. ..:
CCDS86 MEYHPDLENLDEDGYTQLHFDSQSNTRIAVVSEKGSCAASPPWRLIAVILGILCLVILVI
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB6 LVALGIWSVMQRNYLQDENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWR
:.:: .:.. ::: ::
CCDS86 AVVLGTMGVLS-----------------------------------------SPCPPNWI
70
110 120 130 140 150 160
pF1KB6 YYGDSCYGFFRHNLTWEESKQYCTDMNATLLKIDNRN----IVEYIKARTHLIRWVGLSR
: ::: : .:. ::. : .....:::::. : ::. .... :.::::
CCDS86 IYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQVSSQPDNSFWIGLSR
80 90 100 110 120 130
170 180 190 200 210 220
pF1KB6 QKSNEVWKWEDGSVISENMFEFLEDG---KGNMNCAYFHNGKMHPTFCENKHYLMCERKA
... : :::::..: :.:.. . . . ::...: . .. .: : .::.:
CCDS86 PQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQLCSVPSYSICEKKF
140 150 160 170 180 190
pF1KB6 GMTKVDQLP
.:
CCDS86 SM
200
>>CCDS8614.1 CLEC7A gene_id:64581|Hs108|chr12 (168 aa)
initn: 346 init1: 146 opt: 301 Z-score: 361.3 bits: 73.7 E(32554): 8.4e-14
Smith-Waterman score: 301; 34.8% identity (61.5% similar) in 135 aa overlap (95-222:34-168)
70 80 90 100 110 120
pF1KB6 ENENRTGTLQQLAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNLTWE
:: ::: :: : ::: : .:.
CCDS86 HPDLENLDEDGYTQLHFDSQSNTRIAVVSEKGVLSSPCPPNWIIYEKSCYLFSMSLNSWD
10 20 30 40 50 60
130 140 150 160 170 180
pF1KB6 ESKQYCTDMNATLLKIDNRN----IVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISE
::. : .....:::::. : ::. .... :.:::: ... : :::::..:
CCDS86 GSKRQCWQLGSNLLKIDSSNELGFIVKQVSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSS
70 80 90 100 110 120
190 200 210 220
pF1KB6 NMFEFLEDG---KGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP
:.:.. . . . ::...: . .. .: : .::.: .:
CCDS86 NLFQIRTTATQENPSPNCVWIHVSVIYDQLCSVPSYSICEKKFSM
130 140 150 160
>>CCDS8609.1 CLEC12A gene_id:160364|Hs108|chr12 (232 aa)
initn: 267 init1: 213 opt: 301 Z-score: 359.3 bits: 73.8 E(32554): 1.1e-13
Smith-Waterman score: 301; 31.1% identity (61.1% similar) in 167 aa overlap (56-220:60-219)
30 40 50 60 70 80
pF1KB6 SSSWWRVMALILLILCVGMVVGLVALGIWSVMQRNYLQDENENRTGTLQQLAKRFCQYVV
.:. .... .: . ::: .: ..:.
CCDS86 KVHVTLKIEMKKMNKLQNISEELQRNISLQLMSNMNISNKIRNLSTTLQTIATKLCR---
30 40 50 60 70 80
90 100 110 120 130 140
pF1KB6 KQSELKGTFKGHKCSPCDTNWRYYGDSCYGFFRHNL-TWEESKQYCTDMNATLLKIDNRN
:: . . :::.:: : .. :::: :. .. ::.:::. :. .::.::::.:.:
CCDS86 ---ELYSKEQEHKCKPCPRRWIWHKDSCY-FLSDDVQTWQESKMACAAQNASLLKINNKN
90 100 110 120 130 140
150 160 170 180 190 200
pF1KB6 IVEYIKARTHLI-RWVGLSRQKSNEVWKWEDGSVISENMFEFLEDGKGNMNCAYFHNGKM
.:.::.... :.::: .... :. . : .:: :.:.. .
CCDS86 ALEFIKSQSRSYDYWLGLSPEEDSTRGMRVDNIINSSAWVIRNAPDLNNMYCGYINRLYV
150 160 170 180 190 200
210 220
pF1KB6 HPTFCENKHYLMCERKAGMTKVDQLP
. : :. ..::. :
CCDS86 QYYHCTYKKRMICEKMANPVQLGSTYFREA
210 220 230
229 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 11:47:31 2016 done: Fri Nov 4 11:47:31 2016
Total Scan time: 1.490 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]