FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8169, 332 aa 1>>>pF1KB8169 332 - 332 aa - 332 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 6.6638+/-0.000664; mu= 11.3192+/- 0.040 mean_var=110.9851+/-22.049, 0's: 0 Z-trim(114.2): 6 B-trim: 4 in 1/50 Lambda= 0.121742 statistics sampled from 14747 (14753) to 14747 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.793), E-opt: 0.2 (0.453), width: 16 Scan time: 2.660 The best scores are: opt bits E(32554) CCDS5984.1 NEIL2 gene_id:252969|Hs108|chr8 ( 332) 2329 419.0 2.6e-117 CCDS47803.1 NEIL2 gene_id:252969|Hs108|chr8 ( 271) 1916 346.4 1.5e-95 CCDS47802.1 NEIL2 gene_id:252969|Hs108|chr8 ( 216) 1206 221.6 4.4e-58 >>CCDS5984.1 NEIL2 gene_id:252969|Hs108|chr8 (332 aa) initn: 2329 init1: 2329 opt: 2329 Z-score: 2219.4 bits: 419.0 E(32554): 2.6e-117 Smith-Waterman score: 2329; 100.0% identity (100.0% similar) in 332 aa overlap (1-332:1-332) 10 20 30 40 50 60 pF1KB8 MPEGPLVRKFHHLVSPFVGQQVVKTGGSSKKLQPASLQSLWLQDTQVHGKKLFLRFDLDE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MPEGPLVRKFHHLVSPFVGQQVVKTGGSSKKLQPASLQSLWLQDTQVHGKKLFLRFDLDE 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 EMGPPGSSPTPEPPQKEVQKEGAADPKQVGEPSGQKTLDGSSRSAELVPQGEDDSEYLER :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 EMGPPGSSPTPEPPQKEVQKEGAADPKQVGEPSGQKTLDGSSRSAELVPQGEDDSEYLER 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 DAPAGDAGRWLRVSFGLFGSVWVNDFSRAKKANKRGDWRDPSPRLVLHFGGGGFLAFYNC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 DAPAGDAGRWLRVSFGLFGSVWVNDFSRAKKANKRGDWRDPSPRLVLHFGGGGFLAFYNC 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 QLSWSSSPVVTPTCDILSEKFHRGQALEALGQAQPVCYTLLDQRYFSGLGNIIKNEALYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 QLSWSSSPVVTPTCDILSEKFHRGQALEALGQAQPVCYTLLDQRYFSGLGNIIKNEALYR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 AGIHPLSLGSVLSASRREVLVDHVVEFSTAWLQGKFQGRPQHTQVYQKEQCPAGHQVMKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 AGIHPLSLGSVLSASRREVLVDHVVEFSTAWLQGKFQGRPQHTQVYQKEQCPAGHQVMKE 250 260 270 280 290 300 310 320 330 pF1KB8 AFGPEDGLQRLTWWCPQCQPQLSEEPEQCQFS :::::::::::::::::::::::::::::::: CCDS59 AFGPEDGLQRLTWWCPQCQPQLSEEPEQCQFS 310 320 330 >>CCDS47803.1 NEIL2 gene_id:252969|Hs108|chr8 (271 aa) initn: 1916 init1: 1916 opt: 1916 Z-score: 1828.7 bits: 346.4 E(32554): 1.5e-95 Smith-Waterman score: 1916; 100.0% identity (100.0% similar) in 271 aa overlap (62-332:1-271) 40 50 60 70 80 90 pF1KB8 LQPASLQSLWLQDTQVHGKKLFLRFDLDEEMGPPGSSPTPEPPQKEVQKEGAADPKQVGE :::::::::::::::::::::::::::::: CCDS47 MGPPGSSPTPEPPQKEVQKEGAADPKQVGE 10 20 30 100 110 120 130 140 150 pF1KB8 PSGQKTLDGSSRSAELVPQGEDDSEYLERDAPAGDAGRWLRVSFGLFGSVWVNDFSRAKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 PSGQKTLDGSSRSAELVPQGEDDSEYLERDAPAGDAGRWLRVSFGLFGSVWVNDFSRAKK 40 50 60 70 80 90 160 170 180 190 200 210 pF1KB8 ANKRGDWRDPSPRLVLHFGGGGFLAFYNCQLSWSSSPVVTPTCDILSEKFHRGQALEALG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 ANKRGDWRDPSPRLVLHFGGGGFLAFYNCQLSWSSSPVVTPTCDILSEKFHRGQALEALG 100 110 120 130 140 150 220 230 240 250 260 270 pF1KB8 QAQPVCYTLLDQRYFSGLGNIIKNEALYRAGIHPLSLGSVLSASRREVLVDHVVEFSTAW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 QAQPVCYTLLDQRYFSGLGNIIKNEALYRAGIHPLSLGSVLSASRREVLVDHVVEFSTAW 160 170 180 190 200 210 280 290 300 310 320 330 pF1KB8 LQGKFQGRPQHTQVYQKEQCPAGHQVMKEAFGPEDGLQRLTWWCPQCQPQLSEEPEQCQF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 LQGKFQGRPQHTQVYQKEQCPAGHQVMKEAFGPEDGLQRLTWWCPQCQPQLSEEPEQCQF 220 230 240 250 260 270 pF1KB8 S : CCDS47 S >>CCDS47802.1 NEIL2 gene_id:252969|Hs108|chr8 (216 aa) initn: 1201 init1: 1201 opt: 1206 Z-score: 1156.2 bits: 221.6 E(32554): 4.4e-58 Smith-Waterman score: 1271; 65.1% identity (65.1% similar) in 332 aa overlap (1-332:1-216) 10 20 30 40 50 60 pF1KB8 MPEGPLVRKFHHLVSPFVGQQVVKTGGSSKKLQPASLQSLWLQDTQVHGKKLFLRFDLDE ::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 MPEGPLVRKFHHLVSPFVGQQVVKTGGSSKKLQPASLQSLWLQDTQV------------- 10 20 30 40 70 80 90 100 110 120 pF1KB8 EMGPPGSSPTPEPPQKEVQKEGAADPKQVGEPSGQKTLDGSSRSAELVPQGEDDSEYLER CCDS47 ------------------------------------------------------------ 130 140 150 160 170 180 pF1KB8 DAPAGDAGRWLRVSFGLFGSVWVNDFSRAKKANKRGDWRDPSPRLVLHFGGGGFLAFYNC ::::::::::::::::: CCDS47 -------------------------------------------RLVLHFGGGGFLAFYNC 50 60 190 200 210 220 230 240 pF1KB8 QLSWSSSPVVTPTCDILSEKFHRGQALEALGQAQPVCYTLLDQRYFSGLGNIIKNEALYR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 QLSWSSSPVVTPTCDILSEKFHRGQALEALGQAQPVCYTLLDQRYFSGLGNIIKNEALYR 70 80 90 100 110 120 250 260 270 280 290 300 pF1KB8 AGIHPLSLGSVLSASRREVLVDHVVEFSTAWLQGKFQGRPQHTQVYQKEQCPAGHQVMKE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS47 AGIHPLSLGSVLSASRREVLVDHVVEFSTAWLQGKFQGRPQHTQVYQKEQCPAGHQVMKE 130 140 150 160 170 180 310 320 330 pF1KB8 AFGPEDGLQRLTWWCPQCQPQLSEEPEQCQFS :::::::::::::::::::::::::::::::: CCDS47 AFGPEDGLQRLTWWCPQCQPQLSEEPEQCQFS 190 200 210 332 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 10:11:39 2016 done: Fri Nov 4 10:11:39 2016 Total Scan time: 2.660 Total Display time: 0.000 Function used was FASTA [36.3.4 Apr, 2011]