FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB6396, 304 aa 1>>>pF1KB6396 304 - 304 aa - 304 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.0597+/-0.000684; mu= 8.7209+/- 0.042 mean_var=113.9448+/-22.511, 0's: 0 Z-trim(114.0): 6 B-trim: 0 in 0/51 Lambda= 0.120151 statistics sampled from 14605 (14609) to 14605 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.789), E-opt: 0.2 (0.449), width: 16 Scan time: 2.990 The best scores are: opt bits E(32554) CCDS33814.1 DPPA4 gene_id:55211|Hs108|chr3 ( 304) 2047 364.8 4.6e-101 CCDS2956.1 DPPA2 gene_id:151871|Hs108|chr3 ( 298) 574 109.5 3.3e-24 >>CCDS33814.1 DPPA4 gene_id:55211|Hs108|chr3 (304 aa) initn: 2047 init1: 2047 opt: 2047 Z-score: 1927.9 bits: 364.8 E(32554): 4.6e-101 Smith-Waterman score: 2047; 99.7% identity (100.0% similar) in 304 aa overlap (1-304:1-304) 10 20 30 40 50 60 pF1KB6 MLRGSASSTSMEKAKGKEWTSTEKSREEDQQASNQPNSIALPGTSAKRTKEKMSVKGSKV ::::::::::::::::::::::::::::::::::::::::::::::::::::::.::::: CCDS33 MLRGSASSTSMEKAKGKEWTSTEKSREEDQQASNQPNSIALPGTSAKRTKEKMSIKGSKV 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB6 LCPKKKAEHTDNPRPQKKIPIPPLPSKLPPVNLIHRDILRAWCQQLKLSSKGQKLDAYKR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LCPKKKAEHTDNPRPQKKIPIPPLPSKLPPVNLIHRDILRAWCQQLKLSSKGQKLDAYKR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB6 LCAFAYPNQKDFPSTAKEAKIRKSLQKKLKVEKGETSLQSSETHPPEVALPPVGEPPALE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 LCAFAYPNQKDFPSTAKEAKIRKSLQKKLKVEKGETSLQSSETHPPEVALPPVGEPPALE 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB6 NSTALLEGVNTVVVTTSAPEALLASWARISARARTPEAVESPQEASGVRWCVVHGKSLPA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 NSTALLEGVNTVVVTTSAPEALLASWARISARARTPEAVESPQEASGVRWCVVHGKSLPA 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB6 DTDGWVHLQFHAGQAWVPEKQEGRVSALFLLPASNFPPPHLEDNMLCPKCVHRNKVLIKS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS33 DTDGWVHLQFHAGQAWVPEKQEGRVSALFLLPASNFPPPHLEDNMLCPKCVHRNKVLIKS 250 260 270 280 290 300 pF1KB6 LQWE :::: CCDS33 LQWE >>CCDS2956.1 DPPA2 gene_id:151871|Hs108|chr3 (298 aa) initn: 691 init1: 222 opt: 574 Z-score: 548.1 bits: 109.5 E(32554): 3.3e-24 Smith-Waterman score: 693; 49.8% identity (67.2% similar) in 241 aa overlap (75-301:73-293) 50 60 70 80 90 100 pF1KB6 SAKRTKEKMSVKGSKVLCPKKKAEHTDNPRPQK---KIPIPPLPSKLPPVNLIHRDILRA ::: ::: :::. :::.: . :: :: CCDS29 SVSSTSDVKLEKPKKYNPGHLLQTNEQFTAPQKARCKIPALPLPTILPPINKVCRDTLRD 50 60 70 80 90 100 110 120 130 140 150 160 pF1KB6 WCQQLKLSSKGQKLDAYKRLCAFAYPNQK-DFPSTAKEAKIRKSLQKKLKVEKGETSLQS ::::: ::..:.:...: :: :::.:. :.: ..:..... .:. : : .. :: CCDS29 WCQQLGLSTNGKKIEVYLRLHRHAYPEQRQDMPEMSQETRLQRCSRKRKAVTK-RARLQR 110 120 130 140 150 160 170 180 190 200 210 220 pF1KB6 SETHPPEVALPPVGEPPALENSTALLEGVNTVVVTTSAPEALLASWARISARARTPEAVE : : : : .::: : :::: :.:::::::.::: :.:.. CCDS29 SYEM----------------NERA--EETNTVEVITSAPGAMLASWARIAARAVQPKALN 170 180 190 200 230 240 250 260 270 pF1KB6 S---P-------QEASGVRWCVVHGKSLPADTDGWVHLQFHAGQAWVPEKQEGRVSALFL : : ..:::::::::::. : ::: :::.::::::::::: .. :. .::: CCDS29 SCSIPVSVEAFLMQASGVRWCVVHGRLLSADTKGWVRLQFHAGQAWVPTTHR-RMISLFL 210 220 230 240 250 260 280 290 300 pF1KB6 LPASNFPPPHLEDNMLCPKCVHRNKVLIKSLQWE ::: :: : .::::::: :..::: ..: : CCDS29 LPACIFPSPGIEDNMLCPDCAKRNKKMMKRLMTVEK 270 280 290 304 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 17:06:11 2016 done: Fri Nov 4 17:06:11 2016 Total Scan time: 2.990 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]