FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9805, 225 aa
1>>>pF1KB9805 225 - 225 aa - 225 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.4397+/-0.000702; mu= 13.1053+/- 0.043
mean_var=159.8311+/-31.815, 0's: 0 Z-trim(116.1): 35 B-trim: 0 in 0/54
Lambda= 0.101448
statistics sampled from 16686 (16721) to 16686 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.816), E-opt: 0.2 (0.514), width: 16
Scan time: 2.670
The best scores are: opt bits E(32554)
CCDS33507.2 BHLHE23 gene_id:128408|Hs108|chr20 ( 241) 1512 231.7 3.1e-61
CCDS6179.1 BHLHE22 gene_id:27319|Hs108|chr8 ( 381) 487 81.9 6e-16
>>CCDS33507.2 BHLHE23 gene_id:128408|Hs108|chr20 (241 aa)
initn: 1512 init1: 1512 opt: 1512 Z-score: 1212.9 bits: 231.7 E(32554): 3.1e-61
Smith-Waterman score: 1512; 100.0% identity (100.0% similar) in 225 aa overlap (1-225:17-241)
10 20 30 40
pF1KB9 MAELKSLSGDAYLALSHGYAAAAAGLAYGAAREPEAARGYGTPG
::::::::::::::::::::::::::::::::::::::::::::
CCDS33 MSIRPPGEPPSPGGAAMAELKSLSGDAYLALSHGYAAAAAGLAYGAAREPEAARGYGTPG
10 20 30 40 50 60
50 60 70 80 90 100
pF1KB9 PGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPREQRSLRLS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 PGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPREQRSLRLS
70 80 90 100 110 120
110 120 130 140 150 160
pF1KB9 INARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALDEMRRL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 INARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQALDEMRRL
130 140 150 160 170 180
170 180 190 200 210 220
pF1KB9 VAFLNQGQGLAAPVNAAPLTPFGQATVCPFSAGAALGPCPDKCAAFSGTPSALCKHCHEK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS33 VAFLNQGQGLAAPVNAAPLTPFGQATVCPFSAGAALGPCPDKCAAFSGTPSALCKHCHEK
190 200 210 220 230 240
pF1KB9 P
:
CCDS33 P
>>CCDS6179.1 BHLHE22 gene_id:27319|Hs108|chr8 (381 aa)
initn: 618 init1: 466 opt: 487 Z-score: 399.8 bits: 81.9 E(32554): 6e-16
Smith-Waterman score: 599; 50.2% identity (68.5% similar) in 235 aa overlap (9-225:154-381)
10 20 30
pF1KB9 MAELKSLSGDAYLALSHGYAA--AAAGLAYGAAREPEA
: :.: : : :. : . :.:. :.
CCDS61 ALCLKYGESASRGSVAESSGGEQSPDDDSDGRCELVLRAGVADPRASPGAGGGGAKAAEG
130 140 150 160 170 180
40 50 60 70 80 90
pF1KB9 ARGYGTPGPGGDLPAAPAPRAPAQAAESSGEQSGDEDDAFEQRRRRRGPGSAADGRRRPR
. : :...: :. . . .. ::. .:: . . .:.... .. .
CCDS61 CSNAHLHG-GASVP--PGGLGGGGGGGSSSGSSGGGGGS--GSGSGGSSSSSSSSSKKSK
190 200 210 220 230
100 110 120 130 140 150
pF1KB9 EQRSLRLSINARERRRMHDLNDALDGLRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQ
::..:::.::::::::::::::::: ::::::::::::::::::::::::::::::::::
CCDS61 EQKALRLNINARERRRMHDLNDALDELRAVIPYAHSPSVRKLSKIATLLLAKNYILMQAQ
240 250 260 270 280 290
160 170 180 190 200
pF1KB9 ALDEMRRLVAFLNQGQGL---------AAPVNAAPLTP----FGQATVCPFSAGAALGP-
::.:::::::.:::::.. :: . :: : : . ::. ::::: : :
CCDS61 ALEEMRRLVAYLNQGQAISAASLPSSAAAAAAAAALHPALGAYEQAAGYPFSAG--LPPA
300 310 320 330 340 350
210 220
pF1KB9 --CPDKCAAFSGTPSALCKHCHEKP
::.::: :... :.:::.: :::
CCDS61 ASCPEKCALFNSVSSSLCKQCTEKP
360 370 380
225 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 19:07:04 2016 done: Fri Nov 4 19:07:04 2016
Total Scan time: 2.670 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]