FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE0463, 416 aa
1>>>pF1KE0463 416 - 416 aa - 416 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 6.3815+/-0.000918; mu= 12.1487+/- 0.056
mean_var=81.9823+/-16.031, 0's: 0 Z-trim(106.6): 14 B-trim: 3 in 1/51
Lambda= 0.141649
statistics sampled from 9057 (9060) to 9057 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.652), E-opt: 0.2 (0.278), width: 16
Scan time: 2.960
The best scores are: opt bits E(32554)
CCDS4069.1 FAM172A gene_id:83989|Hs108|chr5 ( 416) 2756 572.9 1.9e-163
CCDS54880.1 FAM172A gene_id:83989|Hs108|chr5 ( 370) 2468 514.1 8.9e-146
CCDS54879.1 FAM172A gene_id:83989|Hs108|chr5 ( 306) 1517 319.7 2.4e-87
>>CCDS4069.1 FAM172A gene_id:83989|Hs108|chr5 (416 aa)
initn: 2756 init1: 2756 opt: 2756 Z-score: 3047.9 bits: 572.9 E(32554): 1.9e-163
Smith-Waterman score: 2756; 99.5% identity (99.8% similar) in 416 aa overlap (1-416:1-416)
10 20 30 40 50 60
pF1KE0 MSISLSSLILLPIWINMAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 MSISLSSLILLPIWINMAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTL
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE0 EGFEYAFNEKGQLRHIKTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 EGFEYAFNEKGQLRHIKTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLK
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE0 KVSIPVDATESEPKSFIFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 KVSIPVDATESEPKSFIFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQ
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE0 IPFIKRAVAEGYGVIVLNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 IPFIKRAVAEGYGVIVLNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE0 RDFYEKYRNPQREKEMMQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 RDFYEKYRNPQREKEMMQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFV
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE0 ELMIQREADVKNKVTAVALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS40 ELMIQREADVKNKVTAVALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESM
310 320 330 340 350 360
370 380 390 400 410
pF1KE0 LPDCPRVPAGTDRHELTSWKSFPSIFKFFTEASEAKTNSLKPAVTRRSHRIKHEEL
::::::: :::::::::::::::::::::::::::::.::::::::::::::::::
CCDS40 LPDCPRVSAGTDRHELTSWKSFPSIFKFFTEASEAKTSSLKPAVTRRSHRIKHEEL
370 380 390 400 410
>>CCDS54880.1 FAM172A gene_id:83989|Hs108|chr5 (370 aa)
initn: 2468 init1: 2468 opt: 2468 Z-score: 2730.6 bits: 514.1 E(32554): 8.9e-146
Smith-Waterman score: 2468; 99.5% identity (99.7% similar) in 370 aa overlap (47-416:1-370)
20 30 40 50 60 70
pF1KE0 MAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTLEGFEYAFNEKGQLRHI
::::::::::::::::::::::::::::::
CCDS54 MKKDEPPLDFPDTLEGFEYAFNEKGQLRHI
10 20 30
80 90 100 110 120 130
pF1KE0 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIPVDATESEPKSF
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIPVDATESEPKSF
40 50 60 70 80 90
140 150 160 170 180 190
pF1KE0 IFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQIPFIKRAVAEGYGVIV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 IFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQIPFIKRAVAEGYGVIV
100 110 120 130 140 150
200 210 220 230 240 250
pF1KE0 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM
160 170 180 190 200 210
260 270 280 290 300 310
pF1KE0 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA
220 230 240 250 260 270
320 330 340 350 360 370
pF1KE0 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVPAGTDRHEL
::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::
CCDS54 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVSAGTDRHEL
280 290 300 310 320 330
380 390 400 410
pF1KE0 TSWKSFPSIFKFFTEASEAKTNSLKPAVTRRSHRIKHEEL
:::::::::::::::::::::.::::::::::::::::::
CCDS54 TSWKSFPSIFKFFTEASEAKTSSLKPAVTRRSHRIKHEEL
340 350 360 370
>>CCDS54879.1 FAM172A gene_id:83989|Hs108|chr5 (306 aa)
initn: 1517 init1: 1517 opt: 1517 Z-score: 1681.6 bits: 319.7 E(32554): 2.4e-87
Smith-Waterman score: 1918; 82.2% identity (82.4% similar) in 370 aa overlap (47-416:1-306)
20 30 40 50 60 70
pF1KE0 MAQIQQGGPDEKEKTTALKDLLSRIDLDELMKKDEPPLDFPDTLEGFEYAFNEKGQLRHI
::::::::::::::::::::::::::::::
CCDS54 MKKDEPPLDFPDTLEGFEYAFNEKGQLRHI
10 20 30
80 90 100 110 120 130
pF1KE0 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIPVDATESEPKSF
:::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 KTGEPFVFNYREDLHRWNQKRYEALGEIITKYVYELLEKDCNLKKVSIP-----------
40 50 60 70
140 150 160 170 180 190
pF1KE0 IFMSEDALTNPQKLMVLIHGSGVVRAGQWARRLIINEDLDSGTQIPFIKRAVAEGYGVIV
:::::::
CCDS54 -----------------------------------------------------EGYGVIV
80
200 210 220 230 240 250
pF1KE0 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 LNPNENYIEVEKPKIHVQSSSDSSDEPAEKRERKDKVSKETKKRRDFYEKYRNPQREKEM
90 100 110 120 130 140
260 270 280 290 300 310
pF1KE0 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MQLYIRENGSPEEHAIYVWDHFIAQAAAENVFFVAHSYGGLAFVELMIQREADVKNKVTA
150 160 170 180 190 200
320 330 340 350 360 370
pF1KE0 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVPAGTDRHEL
::::::::::::::::::::::::::::::::::::::::::::::::::: ::::::::
CCDS54 VALTDSVHNVWHQEAGKTIREWMRENCCNWVSSSEPLDTSVESMLPDCPRVSAGTDRHEL
210 220 230 240 250 260
380 390 400 410
pF1KE0 TSWKSFPSIFKFFTEASEAKTNSLKPAVTRRSHRIKHEEL
:::::::::::::::::::::.::::::::::::::::::
CCDS54 TSWKSFPSIFKFFTEASEAKTSSLKPAVTRRSHRIKHEEL
270 280 290 300
416 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Thu Nov 3 07:06:06 2016 done: Thu Nov 3 07:06:06 2016
Total Scan time: 2.960 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]