FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KE2349, 314 aa 1>>>pF1KE2349 314 - 314 aa - 314 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 7.2458+/-0.00067; mu= 8.4600+/- 0.041 mean_var=124.8578+/-24.647, 0's: 0 Z-trim(115.1): 5 B-trim: 0 in 0/55 Lambda= 0.114780 statistics sampled from 15686 (15691) to 15686 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.792), E-opt: 0.2 (0.482), width: 16 Scan time: 3.180 The best scores are: opt bits E(32554) CCDS2769.1 TREX1 gene_id:11277|Hs108|chr3 ( 314) 2114 360.2 1.2e-99 CCDS43086.1 TREX1 gene_id:11277|Hs108|chr3 ( 369) 2114 360.3 1.3e-99 CCDS59451.1 TREX1 gene_id:11277|Hs108|chr3 ( 304) 2039 347.8 6.2e-96 CCDS35437.1 TREX2 gene_id:11219|Hs108|chrX ( 236) 605 110.3 1.5e-24 >>CCDS2769.1 TREX1 gene_id:11277|Hs108|chr3 (314 aa) initn: 2114 init1: 2114 opt: 2114 Z-score: 1902.7 bits: 360.2 E(32554): 1.2e-99 Smith-Waterman score: 2114; 100.0% identity (100.0% similar) in 314 aa overlap (1-314:1-314) 10 20 30 40 50 60 pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP 10 20 30 40 50 60 70 80 90 100 110 120 pF1KE2 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC 70 80 90 100 110 120 130 140 150 160 170 180 pF1KE2 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG 130 140 150 160 170 180 190 200 210 220 230 240 pF1KE2 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR 190 200 210 220 230 240 250 260 270 280 290 300 pF1KE2 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS27 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA 250 260 270 280 290 300 310 pF1KE2 VATLYGLSLATPGE :::::::::::::: CCDS27 VATLYGLSLATPGE 310 >>CCDS43086.1 TREX1 gene_id:11277|Hs108|chr3 (369 aa) initn: 2114 init1: 2114 opt: 2114 Z-score: 1901.7 bits: 360.3 E(32554): 1.3e-99 Smith-Waterman score: 2114; 100.0% identity (100.0% similar) in 314 aa overlap (1-314:56-369) 10 20 30 pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPK :::::::::::::::::::::::::::::: CCDS43 PLPPLRILTLGTHTPTPCSSPGSAAGTYPTMGSQALPPGPMQTLIFFDMEATGLPFSQPK 30 40 50 60 70 80 40 50 60 70 80 90 pF1KE2 VTELCLLAVHRCALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 VTELCLLAVHRCALESPPTSQGPPPTVPPPPRVVDKLSLCVAPGKACSPAASEITGLSTA 90 100 110 120 130 140 100 110 120 130 140 150 pF1KE2 VLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 VLAAHGRQCFDDNLANLLLAFLRRQPQPWCLVAHNGDRYDFPLLQAELAMLGLTSALDGA 150 160 170 180 190 200 160 170 180 190 200 210 pF1KE2 FCVDSITALKALERASSPSEHGPRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQW :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 FCVDSITALKALERASSPSEHGPRKSYSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQW 210 220 230 240 250 260 220 230 240 250 260 270 pF1KE2 RPQALLRWVDAHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS43 RPQALLRWVDAHARPFGTIRPMYGVTASARTKPRPSAVTTTAHLATTRNTSPSLGESRGT 270 280 290 300 310 320 280 290 300 310 pF1KE2 KDLPPVKDPGALSREGLLAPLGLLAILTLAVATLYGLSLATPGE :::::::::::::::::::::::::::::::::::::::::::: CCDS43 KDLPPVKDPGALSREGLLAPLGLLAILTLAVATLYGLSLATPGE 330 340 350 360 >>CCDS59451.1 TREX1 gene_id:11277|Hs108|chr3 (304 aa) initn: 2039 init1: 2039 opt: 2039 Z-score: 1835.8 bits: 347.8 E(32554): 6.2e-96 Smith-Waterman score: 2039; 100.0% identity (100.0% similar) in 304 aa overlap (11-314:1-304) 10 20 30 40 50 60 pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP :::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 MQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC 60 70 80 90 100 110 130 140 150 160 170 180 pF1KE2 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPRKSYSLG 120 130 140 150 160 170 190 200 210 220 230 240 pF1KE2 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 SIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVTASAR 180 190 200 210 220 230 250 260 270 280 290 300 pF1KE2 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS59 TKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAILTLA 240 250 260 270 280 290 310 pF1KE2 VATLYGLSLATPGE :::::::::::::: CCDS59 VATLYGLSLATPGE 300 >>CCDS35437.1 TREX2 gene_id:11219|Hs108|chrX (236 aa) initn: 412 init1: 412 opt: 605 Z-score: 554.1 bits: 110.3 E(32554): 1.5e-24 Smith-Waterman score: 605; 46.5% identity (66.8% similar) in 226 aa overlap (12-233:8-226) 10 20 30 40 50 60 pF1KE2 MGSQALPPGPMQTLIFFDMEATGLPFSQPKVTELCLLAVHRCALESPPTSQGPPPTVPPP .:..:.:.:::::: .:...:: :.:::: .::.: ... ..: CCDS35 MSEAPRAETFVFLDLEATGLPSVEPEIAELSLFAVHRSSLENPEHDESGALVLP-- 10 20 30 40 50 70 80 90 100 110 120 pF1KE2 PRVVDKLSLCVAPGKACSPAASEITGLSTAVLAAHGRQCFDDNLANLLLAFLRRQPQPWC ::.:::.::. : . . ::::::::. :: . :: .. : ::: :: : : CCDS35 -RVLDKLTLCMCPERPFTAKASEITGLSSEGLARCRKAGFDGAVVRTLQAFLSRQAGPIC 60 70 80 90 100 110 130 140 150 160 170 pF1KE2 LVAHNGDRYDFPLLQAELAMLGLTSALDGAFCVDSITALKALERASSPSEHGPR----KS :::::: :::::: ::: :: : . :.:.. ::..:.:: : :: : .. CCDS35 LVAHNGFDYDFPLLCAELRRLGARLPRD-TVCLDTLPALRGLDRAHS---HGTRARGRQG 120 130 140 150 160 180 190 200 210 220 230 pF1KE2 YSLGSIYTRLYGQSPPDSHTAEGDVLALLSICQWRPQALLRWVDAHARPFGTIRPMYGVT :::::.. : . : .:.::::: .:: : : :: :.: .:: .. :.::: CCDS35 YSLGSLFHRYFRAEPSAAHSAEGDVHTLLLIFLHRAAELLAWADEQARGWAHIEPMYLPP 170 180 190 200 210 220 240 250 260 270 280 290 pF1KE2 ASARTKPRPSAVTTTAHLATTRNTSPSLGESRGTKDLPPVKDPGALSREGLLAPLGLLAI CCDS35 DDPSLEA 230 314 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Sun Nov 6 17:48:53 2016 done: Sun Nov 6 17:48:54 2016 Total Scan time: 3.180 Total Display time: -0.010 Function used was FASTA [36.3.4 Apr, 2011]