FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB7525, 150 aa 1>>>pF1KB7525 150 - 150 aa - 150 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 5.1832+/-0.000805; mu= 11.4050+/- 0.048 mean_var=48.9573+/- 9.806, 0's: 0 Z-trim(105.2): 10 B-trim: 0 in 0/52 Lambda= 0.183301 statistics sampled from 8271 (8276) to 8271 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.65), E-opt: 0.2 (0.254), width: 16 Scan time: 1.670 The best scores are: opt bits E(32554) CCDS3264.1 POLR2H gene_id:5437|Hs108|chr3 ( 150) 987 268.4 1.2e-72 CCDS63861.1 POLR2H gene_id:5437|Hs108|chr3 ( 114) 742 203.6 2.9e-53 CCDS63859.1 POLR2H gene_id:5437|Hs108|chr3 ( 175) 742 203.6 4.3e-53 CCDS63860.1 POLR2H gene_id:5437|Hs108|chr3 ( 122) 582 161.3 1.7e-40 CCDS63862.1 POLR2H gene_id:5437|Hs108|chr3 ( 86) 337 96.5 3.9e-21 >>CCDS3264.1 POLR2H gene_id:5437|Hs108|chr3 (150 aa) initn: 987 init1: 987 opt: 987 Z-score: 1417.9 bits: 268.4 E(32554): 1.2e-72 Smith-Waterman score: 987; 100.0% identity (100.0% similar) in 150 aa overlap (1-150:1-150) 10 20 30 40 50 60 pF1KB7 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG 70 80 90 100 110 120 130 140 150 pF1KB7 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF :::::::::::::::::::::::::::::: CCDS32 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF 130 140 150 >>CCDS63861.1 POLR2H gene_id:5437|Hs108|chr3 (114 aa) initn: 742 init1: 742 opt: 742 Z-score: 1069.8 bits: 203.6 E(32554): 2.9e-53 Smith-Waterman score: 742; 100.0% identity (100.0% similar) in 114 aa overlap (37-150:1-114) 10 20 30 40 50 60 pF1KB7 EDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVIASTLYE :::::::::::::::::::::::::::::: CCDS63 MDLILDVNIQIYPVDLGDKFRLVIASTLYE 10 20 30 70 80 90 100 110 120 pF1KB7 DGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGGLLMRLQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 DGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGGLLMRLQ 40 50 60 70 80 90 130 140 150 pF1KB7 GDANNLHGFEVDSRVYLLMKKLAF :::::::::::::::::::::::: CCDS63 GDANNLHGFEVDSRVYLLMKKLAF 100 110 >>CCDS63859.1 POLR2H gene_id:5437|Hs108|chr3 (175 aa) initn: 755 init1: 742 opt: 742 Z-score: 1066.6 bits: 203.6 E(32554): 4.3e-53 Smith-Waterman score: 742; 100.0% identity (100.0% similar) in 112 aa overlap (1-112:1-112) 10 20 30 40 50 60 pF1KB7 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLLRLRAAEW 70 80 90 100 110 120 130 140 150 pF1KB7 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF CCDS63 QCSRITGWGLLFQLCVRVLWGPAHEAAGGCQQPAWIRGGLQSLSPDEEASLLNLA 130 140 150 160 170 >>CCDS63860.1 POLR2H gene_id:5437|Hs108|chr3 (122 aa) initn: 789 init1: 558 opt: 582 Z-score: 840.6 bits: 161.3 E(32554): 1.7e-40 Smith-Waterman score: 736; 80.7% identity (80.7% similar) in 150 aa overlap (1-150:1-122) 10 20 30 40 50 60 pF1KB7 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS63 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB7 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG ::::::::::::::::::::::: :::::::: CCDS63 ASTLYEDGTLDDGEYNPTDDRPSS----------------------------SAYVSYGG 70 80 90 130 140 150 pF1KB7 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF :::::::::::::::::::::::::::::: CCDS63 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF 100 110 120 >>CCDS63862.1 POLR2H gene_id:5437|Hs108|chr3 (86 aa) initn: 544 init1: 313 opt: 337 Z-score: 493.0 bits: 96.5 E(32554): 3.9e-21 Smith-Waterman score: 491; 74.6% identity (74.6% similar) in 114 aa overlap (37-150:1-86) 10 20 30 40 50 60 pF1KB7 EDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVIASTLYE :::::::::::::::::::::::::::::: CCDS63 MDLILDVNIQIYPVDLGDKFRLVIASTLYE 10 20 30 70 80 90 100 110 120 pF1KB7 DGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGGLLMRLQ ::::::::::::::::: :::::::::::::: CCDS63 DGTLDDGEYNPTDDRPSS----------------------------SAYVSYGGLLMRLQ 40 50 60 130 140 150 pF1KB7 GDANNLHGFEVDSRVYLLMKKLAF :::::::::::::::::::::::: CCDS63 GDANNLHGFEVDSRVYLLMKKLAF 70 80 150 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 08:26:29 2016 done: Fri Nov 4 08:26:29 2016 Total Scan time: 1.670 Total Display time: -0.030 Function used was FASTA [36.3.4 Apr, 2011]