FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE2349, 314 aa
1>>>pF1KE2349 314 - 314 aa - 314 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 7.2458+/-0.00067; mu= 8.4600+/- 0.041
mean_var=124.8578+/-24.647, 0's: 0 Z-trim(115.1): 5 B-trim: 0 in 0/55
Lambda= 0.114780
statistics sampled from 15686 (15691) to 15686 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.792), E-opt: 0.2 (0.482), width: 16
Scan time: 3.180
The best scores are: opt bits E(32554)
CCDS2769.1 TREX1 gene_id:11277|Hs108|chr3 ( 314) 2114 360.2 1.2e-99
CCDS43086.1 TREX1 gene_id:11277|Hs108|chr3 ( 369) 2114 360.3 1.3e-99
CCDS59451.1 TREX1 gene_id:11277|Hs108|chr3 ( 304) 2039 347.8 6.2e-96
CCDS35437.1 TREX2 gene_id:11219|Hs108|chrX ( 236) 605 110.3 1.5e-24
>>CCDS2769.1 TREX1 gene_id:11277|Hs108|chr3 (314 aa)
initn: 2114 init1: 2114 opt: 2114 Z-score: 1902.7 bits: 360.2 E(32554): 1.2e-99
Smith-Waterman score: 2114; 100.0% identity (100.0% similar) in 314 aa overlap (1-314:1-314)
10 20 30 40 50 60
pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE2 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE2 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE2 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE2 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS27 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA
250 260 270 280 290 300
310
pF1KE2 VATLYGLSLATPGE
::::::::::::::
CCDS27 VATLYGLSLATPGE
310
>>CCDS43086.1 TREX1 gene_id:11277|Hs108|chr3 (369 aa)
initn: 2114 init1: 2114 opt: 2114 Z-score: 1901.7 bits: 360.3 E(32554): 1.3e-99
Smith-Waterman score: 2114; 100.0% identity (100.0% similar) in 314 aa overlap (1-314:56-369)
10 20 30
pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPK
::::::::::::::::::::::::::::::
CCDS43 PLPPLRILTLGTHTPTPCSSPGSAAGTYPTMGSQALPPGPMQTLIFFDMEATGLPFSQPK
30 40 50 60 70 80
40 50 60 70 80 90
pF1KE2 VTELCLLAVHRCALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 VTELCLLAVHRCALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTA
90 100 110 120 130 140
100 110 120 130 140 150
pF1KE2 VLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 VLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGA
150 160 170 180 190 200
160 170 180 190 200 210
pF1KE2 FCVDSITALKALERASSPSEHGPRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 FCVDSITALKALERASSPSEHGPRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQW
210 220 230 240 250 260
220 230 240 250 260 270
pF1KE2 RPQALLRWVDAHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS43 RPQALLRWVDAHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGT
270 280 290 300 310 320
280 290 300 310
pF1KE2 KDLPPVKDPGALSREGLLAPLGLLAILTLAVATLYGLSLATPGE
::::::::::::::::::::::::::::::::::::::::::::
CCDS43 KDLPPVKDPGALSREGLLAPLGLLAILTLAVATLYGLSLATPGE
330 340 350 360
>>CCDS59451.1 TREX1 gene_id:11277|Hs108|chr3 (304 aa)
initn: 2039 init1: 2039 opt: 2039 Z-score: 1835.8 bits: 347.8 E(32554): 6.2e-96
Smith-Waterman score: 2039; 100.0% identity (100.0% similar) in 304 aa overlap (11-314:1-304)
10 20 30 40 50 60
pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP
::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 MQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP
10 20 30 40 50
70 80 90 100 110 120
pF1KE2 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC
60 70 80 90 100 110
130 140 150 160 170 180
pF1KE2 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG
120 130 140 150 160 170
190 200 210 220 230 240
pF1KE2 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR
180 190 200 210 220 230
250 260 270 280 290 300
pF1KE2 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS59 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA
240 250 260 270 280 290
310
pF1KE2 VATLYGLSLATPGE
::::::::::::::
CCDS59 VATLYGLSLATPGE
300
>>CCDS35437.1 TREX2 gene_id:11219|Hs108|chrX (236 aa)
initn: 412 init1: 412 opt: 605 Z-score: 554.1 bits: 110.3 E(32554): 1.5e-24
Smith-Waterman score: 605; 46.5% identity (66.8% similar) in 226 aa overlap (12-233:8-226)
10 20 30 40 50 60
pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP
.:..:.:.:::::: .:...:: :.:::: .::.: ... ..:
CCDS35 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHDESGALVLP--
10 20 30 40 50
70 80 90 100 110 120
pF1KE2 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC
::.:::.::. : . . ::::::::. :: . :: .. : ::: :: : :
CCDS35 -RVLDKLTLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPIC
60 70 80 90 100 110
130 140 150 160 170
pF1KE2 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPR----KS
:::::: :::::: ::: :: : . :.:.. ::..:.:: : :: : ..
CCDS35 LVAHNGFDYDFPLLCAELRRLGARLPRD-TVCLDTLPALRGLDRAHS---HGTRARGRQG
120 130 140 150 160
180 190 200 210 220 230
pF1KE2 YSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVT
:::::.. : . : .:.::::: .:: : : :: :.: .:: .. :.:::
CCDS35 YSLGSLFHRYFRAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAHIEPMYLPP
170 180 190 200 210 220
240 250 260 270 280 290
pF1KE2 ASARTKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAI
CCDS35 DDPSLEA
230
314 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sun Nov 6 17:48:53 2016 done: Sun Nov 6 17:48:54 2016
Total Scan time: 3.180 Total Display time: -0.010
Function used was FASTA [36.3.4 Apr, 2011]