FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9682, 149 aa
1>>>pF1KB9682 149 - 149 aa - 149 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.3364+/-0.000626; mu= 12.0495+/- 0.038
mean_var=67.7534+/-13.502, 0's: 0 Z-trim(111.4): 29 B-trim: 0 in 0/53
Lambda= 0.155815
statistics sampled from 12349 (12373) to 12349 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.755), E-opt: 0.2 (0.38), width: 16
Scan time: 1.690
The best scores are: opt bits E(32554)
CCDS13186.1 ID1 gene_id:3397|Hs108|chr20 ( 149) 950 221.5 1.5e-58
CCDS13185.1 ID1 gene_id:3397|Hs108|chr20 ( 155) 903 211.0 2.4e-55
CCDS1659.1 ID2 gene_id:3398|Hs108|chr2 ( 134) 286 72.2 1.2e-13
CCDS237.1 ID3 gene_id:3399|Hs108|chr1 ( 119) 275 69.7 5.9e-13
CCDS4544.1 ID4 gene_id:3400|Hs108|chr6 ( 161) 257 65.7 1.3e-11
>>CCDS13186.1 ID1 gene_id:3397|Hs108|chr20 (149 aa)
initn: 950 init1: 950 opt: 950 Z-score: 1164.6 bits: 221.5 E(32554): 1.5e-58
Smith-Waterman score: 950; 100.0% identity (100.0% similar) in 149 aa overlap (1-149:1-149)
10 20 30 40 50 60
pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG
70 80 90 100 110 120
130 140
pF1KB9 RGLPVRAPLSTLNGEISALTAEVRSRSDH
:::::::::::::::::::::::::::::
CCDS13 RGLPVRAPLSTLNGEISALTAEVRSRSDH
130 140
>>CCDS13185.1 ID1 gene_id:3397|Hs108|chr20 (155 aa)
initn: 903 init1: 903 opt: 903 Z-score: 1107.2 bits: 211.0 E(32554): 2.4e-55
Smith-Waterman score: 903; 100.0% identity (100.0% similar) in 142 aa overlap (1-142:1-142)
10 20 30 40 50 60
pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQ
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS13 QVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTPGG
70 80 90 100 110 120
130 140
pF1KB9 RGLPVRAPLSTLNGEISALTAEVRSRSDH
::::::::::::::::::::::
CCDS13 RGLPVRAPLSTLNGEISALTAEAACVPADDRILCR
130 140 150
>>CCDS1659.1 ID2 gene_id:3398|Hs108|chr2 (134 aa)
initn: 307 init1: 249 opt: 286 Z-score: 358.6 bits: 72.2 E(32554): 1.2e-13
Smith-Waterman score: 293; 49.5% identity (73.0% similar) in 111 aa overlap (35-140:15-114)
10 20 30 40 50 60
pF1KB9 SGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAGGAGARLPALLDEQQVNV
::..:..::: .. : .:. .
CCDS16 MKAFSPVRSVRKNSLSDHSLGISR------SKTP--VDDPM--S
10 20 30
70 80 90 100 110
pF1KB9 LLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGT-----PG
:::.:: :::.::::::..:::.::::.:::::::::: :::. :.:. . . ::
CCDS16 LLYNMNDCYSKLKELVPSIPQNKKVSKMEILQHVIDYILDLQIALDSHPTIVSLHHQRPG
40 50 60 70 80 90
120 130 140
pF1KB9 GRGLPVRAPLSTLNGEISALTAEVRSRSDH
.. :.::.::: .:: :.
CCDS16 -QNQASRTPLTTLNTDISILSLQASEFPSELMSNDSKALCG
100 110 120 130
>>CCDS237.1 ID3 gene_id:3399|Hs108|chr1 (119 aa)
initn: 248 init1: 192 opt: 275 Z-score: 346.0 bits: 69.7 E(32554): 5.9e-13
Smith-Waterman score: 276; 49.1% identity (69.1% similar) in 110 aa overlap (19-126:1-100)
10 20 30 40 50
pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEVVRCLSEQSVAISRCAG-GAGARLP-ALLD
.:: . . : :.: ::::.:.::.: : : .:. : .:::
CCDS23 MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLD
10 20 30 40
60 70 80 90 100 110
pF1KB9 EQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSESEVGTP
::: :::::.:::: .:.. ..:.:::::.::::: :::. : .: : :
CCDS23 ---------DMNHCYSRLRELVPGVPRGTQLSQVEILQRVIDYILDLQVVL-AEPAPGPP
50 60 70 80 90
120 130 140
pF1KB9 GGRGLPVRAPLSTLNGEISALTAEVRSRSDH
: ::..
CCDS23 DGPHLPIQTAELTPELVISNDKRSFCH
100 110
>>CCDS4544.1 ID4 gene_id:3400|Hs108|chr6 (161 aa)
initn: 300 init1: 227 opt: 257 Z-score: 322.2 bits: 65.7 E(32554): 1.3e-11
Smith-Waterman score: 266; 42.5% identity (64.9% similar) in 134 aa overlap (20-133:12-144)
10 20 30 40 50
pF1KB9 MKVASGSTATAAAGPSCALKAGKTASGAGEV-VRCLSEQSVAI----SRCAGGAGARLPA
. . .. :.::. .:::.:.. .. . :..:.:: :
CCDS45 MKAVSPVRPSGRKAPSGCGGGELALRCLAEHGHSLGGSAAAAAAAAAARCKA
10 20 30 40 50
60 70 80 90 100 110
pF1KB9 L---LDEQQVNVLLYDMNGCYSRLKELVPTLPQNRKVSKVEILQHVIDYIRDLQLELNSE
:: . : ::: :::::..::::.: :.::::::::::::::: :::: :...
CCDS45 AEAAADEPAL-CLQCDMNDCYSRLRRLVPTIPPNKKVSKVEILQHVIDYILDLQLALETH
60 70 80 90 100 110
120 130 140
pF1KB9 SEV----------GTPGGR--GLPVRAPLSTLNGEISALTAEVRSRSDH
. :.: . : :.::..::
CCDS45 PALLRQPPPPAPPHHPAGTCPAAPPRTPLTALNTDPAGAVNKQGDSILCR
120 130 140 150 160
149 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 18:17:44 2016 done: Fri Nov 4 18:17:44 2016
Total Scan time: 1.690 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]