FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB9829, 340 aa 1>>>pF1KB9829 340 - 340 aa - 340 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 8.2485+/-0.000719; mu= 4.0679+/- 0.043 mean_var=169.9488+/-35.360, 0's: 0 Z-trim(115.9): 4 B-trim: 0 in 0/51 Lambda= 0.098382 statistics sampled from 16464 (16468) to 16464 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.817), E-opt: 0.2 (0.506), width: 16 Scan time: 3.330 The best scores are: opt bits E(32554) CCDS73120.1 ZNF488 gene_id:118738|Hs108|chr10 ( 340) 2365 346.9 1.4e-95 CCDS43243.1 PRDM8 gene_id:56978|Hs108|chr4 ( 689) 497 82.0 1.6e-15 >>CCDS73120.1 ZNF488 gene_id:118738|Hs108|chr10 (340 aa) initn: 2365 init1: 2365 opt: 2365 Z-score: 1829.5 bits: 346.9 E(32554): 1.4e-95 Smith-Waterman score: 2365; 100.0% identity (100.0% similar) in 340 aa overlap (1-340:1-340) 10 20 30 40 50 60 pF1KB9 MPEWPPCLSVAPALVITMAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEKTNRLGPEA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 MPEWPPCLSVAPALVITMAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEKTNRLGPEA 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB9 AVGRAGRDVGSAELALLVAPGKPRPGKPLPPKTRGEQRQSAFTELPRMKDRQVDAQAQER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 AVGRAGRDVGSAELALLVAPGKPRPGKPLPPKTRGEQRQSAFTELPRMKDRQVDAQAQER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB9 EHDDPTGQPGAPQLTQNIPRGPAGSKVFSVWPSGARSEQRSAFSKPTKRPAERPELTSVF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 EHDDPTGQPGAPQLTQNIPRGPAGSKVFSVWPSGARSEQRSAFSKPTKRPAERPELTSVF 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB9 PAGESADALGELSGLLNTTDLACWGRLSTPKLLVGDLWNLQALPQNAPLCSTFLGAPTLW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 PAGESADALGELSGLLNTTDLACWGRLSTPKLLVGDLWNLQALPQNAPLCSTFLGAPTLW 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB9 LEHTQAQVPPPSSSSTTSWALLPPTLTSLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS73 LEHTQAQVPPPSSSSTTSWALLPPTLTSLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKK 250 260 270 280 290 300 310 320 330 340 pF1KB9 EHAGPDPHSQKRREEALACPVCQEHFRERHHLSRHMTSHS :::::::::::::::::::::::::::::::::::::::: CCDS73 EHAGPDPHSQKRREEALACPVCQEHFRERHHLSRHMTSHS 310 320 330 340 >>CCDS43243.1 PRDM8 gene_id:56978|Hs108|chr4 (689 aa) initn: 465 init1: 249 opt: 497 Z-score: 392.1 bits: 82.0 E(32554): 1.6e-15 Smith-Waterman score: 515; 35.6% identity (55.7% similar) in 343 aa overlap (24-340:372-689) 10 20 30 40 50 pF1KB9 MPEWPPCLSVAPALVITMAAGKGAPLSPSAENRWRLSEPELGRGCKPVLLEK- :: :: . . : . :. .. . . :.. CCDS43 PASKEDLVCTPQQYRASGSYFGLEENGRLFAPPSPETGEAKRSAFVEVKKAARAASLQEE 350 360 370 380 390 400 60 70 80 90 100 pF1KB9 --TNRLGPEAAVGRAGRDVGSAELALLVAPGK-----PRPGKPLPPKTRG--EQRQSAFT .. : . :: ::. : : :::: ::: . .: : :::: CCDS43 GTADGAGVASEDQDAGGGGGSSTPAAASPVGAEKLLAPRPGGPLPSRLEGGSPARGSAFT 410 420 430 440 450 460 110 120 130 140 150 160 pF1KB9 ELPRMKDRQVDAQAQEREHDDPTGQPGAPQLTQNIPRGPAGSKVFSVWPSGARSEQR-SA .:.. ..: :: .: : ::. .:: :..: :: CCDS43 SVPQL------GSAGSTSGGGGTGAGAA---------GGAGG------GQGAASDERKSA 470 480 490 500 170 180 190 200 210 pF1KB9 FSKPTKRPAE-RP-----ELTSVFPAGESADALGELSGLLNTTD-LACWGRLSTPKLLVG ::.:.. .. : .: .. : . ::..: ..: :: .:. : : CCDS43 FSQPARSFSQLSPLVLGQKLGALEPC-HPADGVGPTRLYPAAADPLAV--KLQGAADLNG 510 520 530 540 550 220 230 240 250 260 pF1KB9 DLWNLQALPQNAPLCSTFLGAPTLWLEHTQAQVPPPSSSST--------TSWALLPPTLT .: . . : : :: : ..: . . : . ..... .. .::::..: CCDS43 GCGSLPSGGGGLPKQSPFLYATAFWPKSSAAAAAAAAAAAAGPLQLQLPSALTLLPPSFT 560 570 580 590 600 610 270 280 290 300 310 320 pF1KB9 SLGLSTQNWCAKCNLSFRLTSDLVFHMRSHHKKEHAGPDPHSQKRREEALACPVCQEHFR :: : .:::::::: :::.:::::.:::::::::.: .: ..:::: : ::.:.: :: CCDS43 SLCLPAQNWCAKCNASFRMTSDLVYHMRSHHKKEYAM-EPLVKRRREEKLKCPICNESFR 620 630 640 650 660 670 330 340 pF1KB9 ERHHLSRHMTSHS ::::::::::::. CCDS43 ERHHLSRHMTSHN 680 340 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 19:32:09 2016 done: Fri Nov 4 19:32:09 2016 Total Scan time: 3.330 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]