FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7525, 150 aa
1>>>pF1KB7525 150 - 150 aa - 150 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.1832+/-0.000805; mu= 11.4050+/- 0.048
mean_var=48.9573+/- 9.806, 0's: 0 Z-trim(105.2): 10 B-trim: 0 in 0/52
Lambda= 0.183301
statistics sampled from 8271 (8276) to 8271 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.65), E-opt: 0.2 (0.254), width: 16
Scan time: 1.670
The best scores are: opt bits E(32554)
CCDS3264.1 POLR2H gene_id:5437|Hs108|chr3 ( 150) 987 268.4 1.2e-72
CCDS63861.1 POLR2H gene_id:5437|Hs108|chr3 ( 114) 742 203.6 2.9e-53
CCDS63859.1 POLR2H gene_id:5437|Hs108|chr3 ( 175) 742 203.6 4.3e-53
CCDS63860.1 POLR2H gene_id:5437|Hs108|chr3 ( 122) 582 161.3 1.7e-40
CCDS63862.1 POLR2H gene_id:5437|Hs108|chr3 ( 86) 337 96.5 3.9e-21
>>CCDS3264.1 POLR2H gene_id:5437|Hs108|chr3 (150 aa)
initn: 987 init1: 987 opt: 987 Z-score: 1417.9 bits: 268.4 E(32554): 1.2e-72
Smith-Waterman score: 987; 100.0% identity (100.0% similar) in 150 aa overlap (1-150:1-150)
10 20 30 40 50 60
pF1KB7 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG
70 80 90 100 110 120
130 140 150
pF1KB7 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF
::::::::::::::::::::::::::::::
CCDS32 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF
130 140 150
>>CCDS63861.1 POLR2H gene_id:5437|Hs108|chr3 (114 aa)
initn: 742 init1: 742 opt: 742 Z-score: 1069.8 bits: 203.6 E(32554): 2.9e-53
Smith-Waterman score: 742; 100.0% identity (100.0% similar) in 114 aa overlap (37-150:1-114)
10 20 30 40 50 60
pF1KB7 EDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVIASTLYE
::::::::::::::::::::::::::::::
CCDS63 MDLILDVNIQIYPVDLGDKFRLVIASTLYE
10 20 30
70 80 90 100 110 120
pF1KB7 DGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGGLLMRLQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 DGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGGLLMRLQ
40 50 60 70 80 90
130 140 150
pF1KB7 GDANNLHGFEVDSRVYLLMKKLAF
::::::::::::::::::::::::
CCDS63 GDANNLHGFEVDSRVYLLMKKLAF
100 110
>>CCDS63859.1 POLR2H gene_id:5437|Hs108|chr3 (175 aa)
initn: 755 init1: 742 opt: 742 Z-score: 1066.6 bits: 203.6 E(32554): 4.3e-53
Smith-Waterman score: 742; 100.0% identity (100.0% similar) in 112 aa overlap (1-112:1-112)
10 20 30 40 50 60
pF1KB7 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG
::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLLRLRAAEW
70 80 90 100 110 120
130 140 150
pF1KB7 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF
CCDS63 QCSRITGWGLLFQLCVRVLWGPAHEAAGGCQQPAWIRGGLQSLSPDEEASLLNLA
130 140 150 160 170
>>CCDS63860.1 POLR2H gene_id:5437|Hs108|chr3 (122 aa)
initn: 789 init1: 558 opt: 582 Z-score: 840.6 bits: 161.3 E(32554): 1.7e-40
Smith-Waterman score: 736; 80.7% identity (80.7% similar) in 150 aa overlap (1-150:1-122)
10 20 30 40 50 60
pF1KB7 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS63 MAGILFEDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVI
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 ASTLYEDGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGG
::::::::::::::::::::::: ::::::::
CCDS63 ASTLYEDGTLDDGEYNPTDDRPSS----------------------------SAYVSYGG
70 80 90
130 140 150
pF1KB7 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF
::::::::::::::::::::::::::::::
CCDS63 LLMRLQGDANNLHGFEVDSRVYLLMKKLAF
100 110 120
>>CCDS63862.1 POLR2H gene_id:5437|Hs108|chr3 (86 aa)
initn: 544 init1: 313 opt: 337 Z-score: 493.0 bits: 96.5 E(32554): 3.9e-21
Smith-Waterman score: 491; 74.6% identity (74.6% similar) in 114 aa overlap (37-150:1-86)
10 20 30 40 50 60
pF1KB7 EDIFDVKDIDPEGKKFDRVSRLHCESESFKMDLILDVNIQIYPVDLGDKFRLVIASTLYE
::::::::::::::::::::::::::::::
CCDS63 MDLILDVNIQIYPVDLGDKFRLVIASTLYE
10 20 30
70 80 90 100 110 120
pF1KB7 DGTLDDGEYNPTDDRPSRADQFEYVMYGKVYRIEGDETSTEAATRLSAYVSYGGLLMRLQ
::::::::::::::::: ::::::::::::::
CCDS63 DGTLDDGEYNPTDDRPSS----------------------------SAYVSYGGLLMRLQ
40 50 60
130 140 150
pF1KB7 GDANNLHGFEVDSRVYLLMKKLAF
::::::::::::::::::::::::
CCDS63 GDANNLHGFEVDSRVYLLMKKLAF
70 80
150 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 08:26:29 2016 done: Fri Nov 4 08:26:29 2016
Total Scan time: 1.670 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]