FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0022, 232 aa
1>>>pF1KSDA0022 232 - 232 aa - 232 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 4.7069+/-0.000837; mu= 17.2145+/- 0.050
mean_var=62.4992+/-12.406, 0's: 0 Z-trim(106.7): 65 B-trim: 0 in 0/49
Lambda= 0.162232
statistics sampled from 9070 (9136) to 9070 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.659), E-opt: 0.2 (0.281), width: 16
Scan time: 1.580
The best scores are: opt bits E(32554)
CCDS33308.1 CD302 gene_id:9936|Hs108|chr2 ( 232) 1572 376.3 9.4e-105
CCDS56140.1 CD302 gene_id:100526664|Hs108|chr2 (1817) 1455 349.6 7.9e-96
CCDS56141.1 CD302 gene_id:100526664|Hs108|chr2 (1873) 1445 347.3 4.1e-95
CCDS74595.1 CD302 gene_id:9936|Hs108|chr2 ( 195) 1179 284.2 4e-77
CCDS56139.1 CD302 gene_id:9936|Hs108|chr2 ( 174) 660 162.7 1.4e-40
>>CCDS33308.1 CD302 gene_id:9936|Hs108|chr2 (232 aa)
initn: 1572 init1: 1572 opt: 1572 Z-score: 1994.2 bits: 376.3 E(32554): 9.4e-105
Smith-Waterman score: 1572; 100.0% identity (100.0% similar) in 232 aa overlap (1-232:1-232)
10 20 30 40 50 60
pF1KSD MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILISALVIASTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILISALVIASTV
130 140 150 160 170 180
190 200 210 220 230
pF1KSD ILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 ILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
190 200 210 220 230
>>CCDS56140.1 CD302 gene_id:100526664|Hs108|chr2 (1817 aa)
initn: 1445 init1: 1445 opt: 1455 Z-score: 1833.9 bits: 349.6 E(32554): 7.9e-96
Smith-Waterman score: 1455; 97.7% identity (98.6% similar) in 219 aa overlap (14-232:1600-1817)
10 20 30 40
pF1KSD MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEA
:::. .: :::::::::::::::::::::
CCDS56 ATIVSIKDEDENKFVSRLMRENNNITMRVWLGLSQHSV-DCPSSTWIQFQDSCYIFLQEA
1570 1580 1590 1600 1610 1620
50 60 70 80 90 100
pF1KSD IKVESIEDVRNQCTDHGADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 IKVESIEDVRNQCTDHGADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFK
1630 1640 1650 1660 1670 1680
110 120 130 140 150 160
pF1KSD WFDNSNMTFDKWTDQDDDEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 WFDNSNMTFDKWTDQDDDEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKY
1690 1700 1710 1720 1730 1740
170 180 190 200 210 220
pF1KSD LSDNHILISALVIASTVILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 LSDNHILISALVIASTVILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGE
1750 1760 1770 1780 1790 1800
230
pF1KSD ENEYPVQFD
:::::::::
CCDS56 ENEYPVQFD
1810
>>CCDS56141.1 CD302 gene_id:100526664|Hs108|chr2 (1873 aa)
initn: 1445 init1: 1445 opt: 1445 Z-score: 1821.1 bits: 347.3 E(32554): 4.1e-95
Smith-Waterman score: 1445; 100.0% identity (100.0% similar) in 210 aa overlap (23-232:1664-1873)
10 20 30 40 50
pF1KSD MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDV
::::::::::::::::::::::::::::::
CCDS56 RCSMLIASNETWKKVECEHGFGRVVCKVPLDCPSSTWIQFQDSCYIFLQEAIKVESIEDV
1640 1650 1660 1670 1680 1690
60 70 80 90 100 110
pF1KSD RNQCTDHGADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 RNQCTDHGADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTF
1700 1710 1720 1730 1740 1750
120 130 140 150 160 170
pF1KSD DKWTDQDDDEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILIS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 DKWTDQDDDEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILIS
1760 1770 1780 1790 1800 1810
180 190 200 210 220 230
pF1KSD ALVIASTVILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ALVIASTVILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
1820 1830 1840 1850 1860 1870
>>CCDS74595.1 CD302 gene_id:9936|Hs108|chr2 (195 aa)
initn: 1292 init1: 1179 opt: 1179 Z-score: 1498.1 bits: 284.2 E(32554): 4e-77
Smith-Waterman score: 1222; 84.1% identity (84.1% similar) in 232 aa overlap (1-232:1-195)
10 20 30 40 50 60
pF1KSD MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG
:::::::::::::::::::::: :
CCDS74 MLRAALPALLLPLLGLAAAAVA-------------------------------------G
10 20
70 80 90 100 110 120
pF1KSD ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD
30 40 50 60 70 80
130 140 150 160 170 180
pF1KSD DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILISALVIASTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILISALVIASTV
90 100 110 120 130 140
190 200 210 220 230
pF1KSD ILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS74 ILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
150 160 170 180 190
>>CCDS56139.1 CD302 gene_id:9936|Hs108|chr2 (174 aa)
initn: 660 init1: 660 opt: 660 Z-score: 842.3 bits: 162.7 E(32554): 1.4e-40
Smith-Waterman score: 1028; 74.6% identity (75.0% similar) in 232 aa overlap (1-232:1-174)
10 20 30 40 50 60
pF1KSD MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 MLRAALPALLLPLLGLAAAAVADCPSSTWIQFQDSCYIFLQEAIKVESIEDVRNQCTDHG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTDDASFKWFDNSNMTFDKWTDQDD
::::::::::::::::::::::::::::::::::::::
CCDS56 ADMISIHNEEENAFILDTLKKQWKGPDDILLGMFYDTD----------------------
70 80 90
130 140 150 160 170 180
pF1KSD DEDLVDTCAFLHIKTGEWKKGNCEVSSVEGTLCKTAIPYKRKYLSDNHILISALVIASTV
.:::::::::::::::::::::::
CCDS56 ------------------------------------VPYKRKYLSDNHILISALVIASTV
100 110 120
190 200 210 220 230
pF1KSD ILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS56 ILTVLGAIIWFLYKKHSDSRFTTVFSTAPQSPYNEDCVLVVGEENEYPVQFD
130 140 150 160 170
232 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 23:23:34 2016 done: Wed Nov 2 23:23:34 2016
Total Scan time: 1.580 Total Display time: -0.020
Function used was FASTA [36.3.4 Apr, 2011]