FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB9829, 340 aa
1>>>pF1KB9829 340 - 340 aa - 340 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 8.2485+/-0.000719; mu= 4.0679+/- 0.043
mean_var=169.9488+/-35.360, 0's: 0 Z-trim(115.9): 4 B-trim: 0 in 0/51
Lambda= 0.098382
statistics sampled from 16464 (16468) to 16464 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.506), width: 16
Scan time: 3.330
The best scores are: opt bits E(32554)
CCDS73120.1 ZNF488 gene_id:118738|Hs108|chr10 ( 340) 2365 346.9 1.4e-95
CCDS43243.1 PRDM8 gene_id:56978|Hs108|chr4 ( 689) 497 82.0 1.6e-15
>>CCDS73120.1 ZNF488 gene_id:118738|Hs108|chr10 (340 aa)
initn: 2365 init1: 2365 opt: 2365 Z-score: 1829.5 bits: 346.9 E(32554): 1.4e-95
Smith-Waterman score: 2365; 100.0% identity (100.0% similar) in 340 aa overlap (1-340:1-340)
10 20 30 40 50 60
pF1KB9 MPEWPPCLSVAPALVITMAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEKTNRLGPEA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 MPEWPPCLSVAPALVITMAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEKTNRLGPEA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB9 AVGRAGRDVGSAELALLVAPGKPRPGKPLPPKTRGEQRQSAFTELPRMKDRQVDAQAQER
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 AVGRAGRDVGSAELALLVAPGKPRPGKPLPPKTRGEQRQSAFTELPRMKDRQVDAQAQER
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB9 EHDDPTGQPGAPQLTQNIPRGPAGSKVFSVWPSGARSEQRSAFSKPTKRPAERPELTSVF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 EHDDPTGQPGAPQLTQNIPRGPAGSKVFSVWPSGARSEQRSAFSKPTKRPAERPELTSVF
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB9 PAGESADALGELSGLLNTTDLACWGRLSTPKLLVGDLWNLQALPQNAPLCSTFLGAPTLW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 PAGESADALGELSGLLNTTDLACWGRLSTPKLLVGDLWNLQALPQNAPLCSTFLGAPTLW
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB9 LEHTQAQVPPPSSSSTTSWALLPPTLTSLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 LEHTQAQVPPPSSSSTTSWALLPPTLTSLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKK
250 260 270 280 290 300
310 320 330 340
pF1KB9 EHAGPDPHSQKRREEALACPVCQEHFRERHHLSRHMTSHS
::::::::::::::::::::::::::::::::::::::::
CCDS73 EHAGPDPHSQKRREEALACPVCQEHFRERHHLSRHMTSHS
310 320 330 340
>>CCDS43243.1 PRDM8 gene_id:56978|Hs108|chr4 (689 aa)
initn: 465 init1: 249 opt: 497 Z-score: 392.1 bits: 82.0 E(32554): 1.6e-15
Smith-Waterman score: 515; 35.6% identity (55.7% similar) in 343 aa overlap (24-340:372-689)
10 20 30 40 50
pF1KB9 MPEWPPCLSVAPALVITMAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEK-
:: :: . . : . :. .. . . :..
CCDS43 PASKEDLVCTPQQYRASGSYFGLEENGRLFAPPSPETGEAKRSAFVEVKKAARAASLQEE
350 360 370 380 390 400
60 70 80 90 100
pF1KB9 --TNRLGPEAAVGRAGRDVGSAELALLVAPGK-----PRPGKPLPPKTRG--EQRQSAFT
.. : . :: ::. : : :::: ::: . .: : ::::
CCDS43 GTADGAGVASEDQDAGGGGGSSTPAAASPVGAEKLLAPRPGGPLPSRLEGGSPARGSAFT
410 420 430 440 450 460
110 120 130 140 150 160
pF1KB9 ELPRMKDRQVDAQAQEREHDDPTGQPGAPQLTQNIPRGPAGSKVFSVWPSGARSEQR-SA
.:.. ..: :: .: : ::. .:: :..: ::
CCDS43 SVPQL------GSAGSTSGGGGTGAGAA---------GGAGG------GQGAASDERKSA
470 480 490 500
170 180 190 200 210
pF1KB9 FSKPTKRPAE-RP-----ELTSVFPAGESADALGELSGLLNTTD-LACWGRLSTPKLLVG
::.:.. .. : .: .. : . ::..: ..: :: .:. : :
CCDS43 FSQPARSFSQLSPLVLGQKLGALEPC-HPADGVGPTRLYPAAADPLAV--KLQGAADLNG
510 520 530 540 550
220 230 240 250 260
pF1KB9 DLWNLQALPQNAPLCSTFLGAPTLWLEHTQAQVPPPSSSST--------TSWALLPPTLT
.: . . : : :: : ..: . . : . ..... .. .::::..:
CCDS43 GCGSLPSGGGGLPKQSPFLYATAFWPKSSAAAAAAAAAAAAGPLQLQLPSALTLLPPSFT
560 570 580 590 600 610
270 280 290 300 310 320
pF1KB9 SLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKKEHAGPDPHSQKRREEALACPVCQEHFR
:: : .:::::::: :::.:::::.:::::::::.: .: ..:::: : ::.:.: ::
CCDS43 SLCLPAQNWCAKCNASFRMTSDLVYHMRSHHKKEYAM-EPLVKRRREEKLKCPICNESFR
620 630 640 650 660 670
330 340
pF1KB9 ERHHLSRHMTSHS
::::::::::::.
CCDS43 ERHHLSRHMTSHN
680
340 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 19:32:09 2016 done: Fri Nov 4 19:32:09 2016
Total Scan time: 3.330 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]