FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB4371, 158 aa
1>>>pF1KB4371 158 - 158 aa - 158 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 5.2495+/-0.000584; mu= 12.9905+/- 0.035
mean_var=69.5693+/-13.763, 0's: 0 Z-trim(113.2): 6 B-trim: 0 in 0/50
Lambda= 0.153768
statistics sampled from 13860 (13866) to 13860 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.796), E-opt: 0.2 (0.426), width: 16
Scan time: 1.810
The best scores are: opt bits E(32554)
CCDS47673.1 POLR2J3 gene_id:548644|Hs108|chr7 ( 115) 678 158.2 1.4e-39
CCDS43627.1 POLR2J2 gene_id:246721|Hs108|chr7 ( 115) 671 156.7 4.1e-39
CCDS5724.1 POLR2J gene_id:5439|Hs108|chr7 ( 117) 671 156.7 4.2e-39
>>CCDS47673.1 POLR2J3 gene_id:548644|Hs108|chr7 (115 aa)
initn: 678 init1: 678 opt: 678 Z-score: 824.1 bits: 158.2 E(32554): 1.4e-39
Smith-Waterman score: 678; 99.0% identity (99.0% similar) in 105 aa overlap (1-105:1-105)
10 20 30 40 50 60
pF1KB4 MNAPPAFESFLLFEGEKITINKDTKVPNACLFTINKEDHTLGNIIKSQLLKDPQVLFAGY
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS47 MNAPPAFESFLLFEGEKITINKDTKVPNACLFTINKEDHTLGNIIKSQLLKDPQVLFAGY
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 KVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLIRELSLLEERFRVRAGPGGADGVGWTL
:::::::::::::::::::::::::::::::::: ::::::::::
CCDS47 KVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLISELSLLEERFRTCLLPLRLLP
70 80 90 100 110
130 140 150
pF1KB4 ARVPRPGTALACFFGGPQGEAAVMEEQGLPPQAPGHVD
>>CCDS43627.1 POLR2J2 gene_id:246721|Hs108|chr7 (115 aa)
initn: 671 init1: 671 opt: 671 Z-score: 815.7 bits: 156.7 E(32554): 4.1e-39
Smith-Waterman score: 671; 98.1% identity (99.0% similar) in 105 aa overlap (1-105:1-105)
10 20 30 40 50 60
pF1KB4 MNAPPAFESFLLFEGEKITINKDTKVPNACLFTINKEDHTLGNIIKSQLLKDPQVLFAGY
:::::::::::::::::::::::::::.::::::::::::::::::::::::::::::::
CCDS43 MNAPPAFESFLLFEGEKITINKDTKVPKACLFTINKEDHTLGNIIKSQLLKDPQVLFAGY
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB4 KVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLIRELSLLEERFRVRAGPGGADGVGWTL
:::::::::::::::::::::::::::::::::: ::::::::::
CCDS43 KVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLISELSLLEERFRTCLLPLRLLP
70 80 90 100 110
130 140 150
pF1KB4 ARVPRPGTALACFFGGPQGEAAVMEEQGLPPQAPGHVD
>>CCDS5724.1 POLR2J gene_id:5439|Hs108|chr7 (117 aa)
initn: 575 init1: 575 opt: 671 Z-score: 815.6 bits: 156.7 E(32554): 4.2e-39
Smith-Waterman score: 671; 98.1% identity (98.1% similar) in 107 aa overlap (1-106:1-107)
10 20 30 40 50
pF1KB4 MNAPPAFESFLLFEGEK-ITINKDTKVPNACLFTINKEDHTLGNIIKSQLLKDPQVLFAG
::::::::::::::::: ::::::::::::::::::::::::::::::::::::::::::
CCDS57 MNAPPAFESFLLFEGEKKITINKDTKVPNACLFTINKEDHTLGNIIKSQLLKDPQVLFAG
10 20 30 40 50 60
60 70 80 90 100 110
pF1KB4 YKVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLIRELSLLEERFRVRAGPGGADGVGWT
::::::::::::::::::::::::::::::::::: :::::::::::
CCDS57 YKVPHPLEHKIIIRVQTTPDYSPQEAFTNAITDLISELSLLEERFRVAIKDKQEGIE
70 80 90 100 110
120 130 140 150
pF1KB4 LARVPRPGTALACFFGGPQGEAAVMEEQGLPPQAPGHVD
158 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 14:51:51 2016 done: Thu Nov 3 14:51:52 2016
Total Scan time: 1.810 Total Display time: -0.030
Function used was FASTA [36.3.4 Apr, 2011]