FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KSDA0025, 391 aa
1>>>pF1KSDA0025 391 - 391 aa - 391 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.8731+/-0.000726; mu= 11.1078+/- 0.044
mean_var=118.5032+/-24.035, 0's: 0 Z-trim(113.7): 16 B-trim: 379 in 1/51
Lambda= 0.117817
statistics sampled from 14308 (14318) to 14308 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.78), E-opt: 0.2 (0.44), width: 16
Scan time: 2.780
The best scores are: opt bits E(32554)
CCDS10771.1 HERPUD1 gene_id:9709|Hs108|chr16 ( 391) 2718 472.4 3.2e-133
CCDS45492.1 HERPUD1 gene_id:9709|Hs108|chr16 ( 390) 2699 469.1 3e-132
CCDS5446.1 HERPUD2 gene_id:64224|Hs108|chr7 ( 406) 363 72.1 1e-12
>>CCDS10771.1 HERPUD1 gene_id:9709|Hs108|chr16 (391 aa)
initn: 2718 init1: 2718 opt: 2718 Z-score: 2505.3 bits: 472.4 E(32554): 3.2e-133
Smith-Waterman score: 2718; 100.0% identity (100.0% similar) in 391 aa overlap (1-391:1-391)
10 20 30 40 50 60
pF1KSD MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDQRLIYSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDQRLIYSG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD KLLLDHQCLRDLLPKQEKRHVLHLVCNVKSPSKMPEINAKVAESTEEPAGSNRGQYPEDS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 KLLLDHQCLRDLLPKQEKRHVLHLVCNVKSPSKMPEINAKVAESTEEPAGSNRGQYPEDS
70 80 90 100 110 120
130 140 150 160 170 180
pF1KSD SSDGLRQREVLRNLSSPGWENISRPEAAQQAFQGLGPGFSGYTPYGWLQLSWFQQIYARQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 SSDGLRQREVLRNLSSPGWENISRPEAAQQAFQGLGPGFSGYTPYGWLQLSWFQQIYARQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KSD YYMQYLAATAASGAFVPPPSAQEIPVVSAPAPAPIHNQFPAENQPANQNAAPQVVVNPGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YYMQYLAATAASGAFVPPPSAQEIPVVSAPAPAPIHNQFPAENQPANQNAAPQVVVNPGA
190 200 210 220 230 240
250 260 270 280 290 300
pF1KSD NQNLRMNAQGGPIVEEDDEINRDWLDWTYSAATFSVFLSILYFYSSLSRFLMVMGATVVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 NQNLRMNAQGGPIVEEDDEINRDWLDWTYSAATFSVFLSILYFYSSLSRFLMVMGATVVM
250 260 270 280 290 300
310 320 330 340 350 360
pF1KSD YLHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNNNLQEGTDPETEDPNHLPPDRDVLDGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS10 YLHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNNNLQEGTDPETEDPNHLPPDRDVLDGE
310 320 330 340 350 360
370 380 390
pF1KSD QTSPSFMSTAWLVFKTFFASLLPEGPPAIAN
:::::::::::::::::::::::::::::::
CCDS10 QTSPSFMSTAWLVFKTFFASLLPEGPPAIAN
370 380 390
>>CCDS45492.1 HERPUD1 gene_id:9709|Hs108|chr16 (390 aa)
initn: 2195 init1: 2195 opt: 2699 Z-score: 2487.9 bits: 469.1 E(32554): 3e-132
Smith-Waterman score: 2699; 99.7% identity (99.7% similar) in 391 aa overlap (1-391:1-390)
10 20 30 40 50 60
pF1KSD MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDQRLIYSG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDQRLIYSG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KSD KLLLDHQCLRDLLPKQEKRHVLHLVCNVKSPSKMPEINAKVAESTEEPAGSNRGQYPEDS
::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::::
CCDS45 KLLLDHQCLRDLLPK-EKRHVLHLVCNVKSPSKMPEINAKVAESTEEPAGSNRGQYPEDS
70 80 90 100 110
130 140 150 160 170 180
pF1KSD SSDGLRQREVLRNLSSPGWENISRPEAAQQAFQGLGPGFSGYTPYGWLQLSWFQQIYARQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 SSDGLRQREVLRNLSSPGWENISRPEAAQQAFQGLGPGFSGYTPYGWLQLSWFQQIYARQ
120 130 140 150 160 170
190 200 210 220 230 240
pF1KSD YYMQYLAATAASGAFVPPPSAQEIPVVSAPAPAPIHNQFPAENQPANQNAAPQVVVNPGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 YYMQYLAATAASGAFVPPPSAQEIPVVSAPAPAPIHNQFPAENQPANQNAAPQVVVNPGA
180 190 200 210 220 230
250 260 270 280 290 300
pF1KSD NQNLRMNAQGGPIVEEDDEINRDWLDWTYSAATFSVFLSILYFYSSLSRFLMVMGATVVM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 NQNLRMNAQGGPIVEEDDEINRDWLDWTYSAATFSVFLSILYFYSSLSRFLMVMGATVVM
240 250 260 270 280 290
310 320 330 340 350 360
pF1KSD YLHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNNNLQEGTDPETEDPNHLPPDRDVLDGE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 YLHHVGWFPFRPRPVQNFPNDGPPPDVVNQDPNNNLQEGTDPETEDPNHLPPDRDVLDGE
300 310 320 330 340 350
370 380 390
pF1KSD QTSPSFMSTAWLVFKTFFASLLPEGPPAIAN
:::::::::::::::::::::::::::::::
CCDS45 QTSPSFMSTAWLVFKTFFASLLPEGPPAIAN
360 370 380 390
>>CCDS5446.1 HERPUD2 gene_id:64224|Hs108|chr7 (406 aa)
initn: 878 init1: 334 opt: 363 Z-score: 341.7 bits: 72.1 E(32554): 1e-12
Smith-Waterman score: 941; 41.0% identity (65.6% similar) in 410 aa overlap (9-391:9-406)
10 20 30 40 50 60
pF1KSD MESETEPEPVTLLVKSPNQRHRDLELSGDRGWSVGHLKAHLSRVYPERPRPEDQRLIYSG
::::..:.:::.. : .: .:.::.::.::: ::: .: .::::.:::
CCDS54 MDQSGMEIPVTLIIKAPNQKYSDQTISCFLNWTVGKLKTHLSNVYPSKPLTKDQRLVYSG
10 20 30 40 50 60
70 80 90 100 110
pF1KSD KLLLDHQCLRDLLPKQEKRHVLHLVCNVKSPSKMP---------EINAKVAESTEEPAGS
.:: :: :.:.: ::.. :..::::. ..: . : : :. ..:. . .::
CCDS54 RLLPDHLQLKDILRKQDEYHMVHLVCTSRTPPSSPKSSTNRESHEALASSSNSSSDHSGS
70 80 90 100 110 120
120 130 140 150
pF1KSD ---NRGQYPED----SSSDGLRQREVLRNLSSPGWENISRPEAAQ----QAFQGLG--PG
. :: . :::.::::: .: . .. .. . : . : . : : . ::
CCDS54 TTPSSGQETLSLAVGSSSEGLRQR-TLPQAQTDQAQSHQFPYVMQGNVDNQFPGQAAPPG
130 140 150 160 170
160 170 180 190 200 210
pF1KSD FSGYTPYGWLQLSWFQQIYARQYYMQYLAATAASGAFVPPPSAQEIPVVSAPAPAPIHNQ
: : .. ::. :.::.::.:::::: ::..:... :. :..: : .
CCDS54 FPVYPAFSPLQMLWWQQMYAHQYYMQYQAAVSAQATSNVNPTQ---PTTSQPLNLA---H
180 190 200 210 220 230
220 230 240 250 260 270
pF1KSD FPAENQPANQNAAPQVVVNPGANQNLRMNAQGGPIVEEDDEINRDWLDWTYSAATFSVFL
:.:. : : . : : :.:..:::::::...:.: .::::::: :. . ...:
CCDS54 VPGEEPPPAPNLVAQ--ENRPMNENVQMNAQGGPVLNEED-FNRDWLDWMYTFSRAAILL
240 250 260 270 280 290
280 290 300 310 320 330
pF1KSD SILYFYSSLSRFLMVMGATVVMYLHHVGWFPFRPRPV-QNFPNDGPPPDVVNQDPNNNLQ
::.:::::.:::.::::: ...:::..:::::: . :. ::.. .: :. : :
CCDS54 SIVYFYSSFSRFIMVMGAMLLVYLHQAGWFPFRQEGGHQQAPNNNA--EVNNDGQNANNL
300 310 320 330 340
340 350 360 370 380 390
pF1KSD EGTDPETEDPNHLPPDRDVLDGEQTS----PSFMSTAWLVFKTFFASLLPEGPPAIAN
: . : . : . ::..: :..:..:: . :::.::.::::: .::
CCDS54 ELEEMERLMDDGLEDESGEDGGEDASAIQRPGLMASAWSFITTFFTSLIPEGPPQVAN
350 360 370 380 390 400
391 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Wed Nov 2 23:26:19 2016 done: Wed Nov 2 23:26:20 2016
Total Scan time: 2.780 Total Display time: 0.010
Function used was FASTA [36.3.4 Apr, 2011]