FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB7937, 376 aa
1>>>pF1KB7937 376 - 376 aa - 376 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 10.3570+/-0.00111; mu= -4.5821+/- 0.067
mean_var=294.5648+/-60.817, 0's: 0 Z-trim(113.2): 40 B-trim: 366 in 1/52
Lambda= 0.074728
statistics sampled from 13843 (13875) to 13843 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.741), E-opt: 0.2 (0.426), width: 16
Scan time: 3.130
The best scores are: opt bits E(32554)
CCDS9873.1 GTF2A1 gene_id:2957|Hs108|chr14 ( 376) 2492 281.7 7.3e-76
CCDS9874.1 GTF2A1 gene_id:2957|Hs108|chr14 ( 337) 2246 255.2 6.4e-68
>>CCDS9873.1 GTF2A1 gene_id:2957|Hs108|chr14 (376 aa)
initn: 2492 init1: 2492 opt: 2492 Z-score: 1475.5 bits: 281.7 E(32554): 7.3e-76
Smith-Waterman score: 2492; 100.0% identity (100.0% similar) in 376 aa overlap (1-376:1-376)
10 20 30 40 50 60
pF1KB7 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFH
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 MANSANTNTVPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFH
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB7 SEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 SEEQQLLLQVQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIV
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB7 PDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 PDSKLIQHMNASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB7 SVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 SVVLQQQVIPQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTV
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB7 AAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 AAPTPAQAQITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKE
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB7 KDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 KDGAEDGQVEEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNL
310 320 330 340 350 360
370
pF1KB7 NGRDYIFSKAIGDAEW
::::::::::::::::
CCDS98 NGRDYIFSKAIGDAEW
370
>>CCDS9874.1 GTF2A1 gene_id:2957|Hs108|chr14 (337 aa)
initn: 2246 init1: 2246 opt: 2246 Z-score: 1332.8 bits: 255.2 E(32554): 6.4e-68
Smith-Waterman score: 2246; 100.0% identity (100.0% similar) in 337 aa overlap (40-376:1-337)
10 20 30 40 50 60
pF1KB7 VPKLYRSVIEDVINDVRDIFLDDGVDEQVLMELKTLWENKLMQSRAVDGFHSEEQQLLLQ
::::::::::::::::::::::::::::::
CCDS98 MELKTLWENKLMQSRAVDGFHSEEQQLLLQ
10 20 30
70 80 90 100 110 120
pF1KB7 VQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 VQQQHQPQQQQHHHHHHHQQAQPQQTVPQQAQTQQVLIPASQQATAPQVIVPDSKLIQHM
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB7 NASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 NASNMSAAATAATLALPAGVTPVQQILTNSGQLLQVVRAANGAQYIFQPQQSVVLQQQVI
100 110 120 130 140 150
190 200 210 220 230 240
pF1KB7 PQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 PQMQPGGVQAPVIQQVLAPLPGGISPQTGVIIQPQQILFTGNKTQVIPTTVAAPTPAQAQ
160 170 180 190 200 210
250 260 270 280 290 300
pF1KB7 ITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 ITATGQQQPQAQPAQTQAPLVLQVDGTGDTSSEEDEDEEEDYDDDEEEDKEKDGAEDGQV
220 230 240 250 260 270
310 320 330 340 350 360
pF1KB7 EEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS98 EEEPLNSEDDVSDEEGQELFDTENVVVCQYDKIHRSKNKWKFHLKDGIMNLNGRDYIFSK
280 290 300 310 320 330
370
pF1KB7 AIGDAEW
:::::::
CCDS98 AIGDAEW
376 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 14:40:47 2016 done: Sat Nov 5 14:40:47 2016
Total Scan time: 3.130 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]