FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KE1083, 370 aa
1>>>pF1KE1083 370 - 370 aa - 370 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 9.8230+/-0.00116; mu= -3.7565+/- 0.069
mean_var=240.9719+/-47.401, 0's: 0 Z-trim(110.2): 57 B-trim: 0 in 0/53
Lambda= 0.082621
statistics sampled from 11362 (11401) to 11362 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.692), E-opt: 0.2 (0.35), width: 16
Scan time: 3.110
The best scores are: opt bits E(32554)
CCDS9900.1 ATXN3 gene_id:4287|Hs108|chr14 ( 361) 2014 253.0 3e-67
CCDS32143.1 ATXN3 gene_id:4287|Hs108|chr14 ( 306) 1595 203.0 2.9e-52
CCDS45154.1 ATXN3 gene_id:4287|Hs108|chr14 ( 346) 1500 191.7 8.1e-49
CCDS73680.1 ATXN3 gene_id:4287|Hs108|chr14 ( 291) 1492 190.7 1.4e-48
CCDS48080.1 ATXN3L gene_id:92552|Hs108|chrX ( 355) 1475 188.7 6.5e-48
CCDS53908.1 ATXN3 gene_id:4287|Hs108|chr14 ( 182) 805 108.7 4.2e-24
>>CCDS9900.1 ATXN3 gene_id:4287|Hs108|chr14 (361 aa)
initn: 2109 init1: 2012 opt: 2014 Z-score: 1320.7 bits: 253.0 E(32554): 3e-67
Smith-Waterman score: 2344; 97.6% identity (97.6% similar) in 370 aa overlap (1-370:1-361)
10 20 30 40 50 60
pF1KE1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRQEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRQEI
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KE1 QQQQQQQQQQQQQQGDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLQAAVTMSLETV
:: :::::::::::::::::::::::::::::::::::::::::::::::::
CCDS99 QQ---------QQQGDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLQAAVTMSLETV
310 320 330 340 350
370
pF1KE1 RNDLKTEGKK
::::::::::
CCDS99 RNDLKTEGKK
360
>>CCDS32143.1 ATXN3 gene_id:4287|Hs108|chr14 (306 aa)
initn: 1690 init1: 1593 opt: 1595 Z-score: 1051.9 bits: 203.0 E(32554): 2.9e-52
Smith-Waterman score: 1925; 96.8% identity (97.1% similar) in 308 aa overlap (63-370:8-306)
40 50 60 70 80 90
pF1KE1 LSSIAHQLDEEERMRMAEGGVTSEDYRTFLQQPSGNMDDSGFFSIQVISNALKVWGLELI
.:::::::::::::::::::::::::::::
CCDS32 MESIFHEKQPSGNMDDSGFFSIQVISNALKVWGLELI
10 20 30
100 110 120 130 140 150
pF1KE1 LFNSPEYQRLRIDPINERSFICNYKEHWFTVRKLGKQWFNLNSLLTGPELISDTYLALFL
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 LFNSPEYQRLRIDPINERSFICNYKEHWFTVRKLGKQWFNLNSLLTGPELISDTYLALFL
40 50 60 70 80 90
160 170 180 190 200 210
pF1KE1 AQLQQEGYSIFVVKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 AQLQQEGYSIFVVKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERV
100 110 120 130 140 150
220 230 240 250 260 270
pF1KE1 LEANDGSGMLDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 LEANDGSGMLDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTS
160 170 180 190 200 210
280 290 300 310 320 330
pF1KE1 GTNLTSEELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQGDLSGQSSHPCERPATSS
::::::::::::::::::::::::::::::: ::::::::::::::::::::
CCDS32 GTNLTSEELRKRREAYFEKQQQKQQQQQQQQ---------QQGDLSGQSSHPCERPATSS
220 230 240 250 260
340 350 360 370
pF1KE1 GALGSDLGDAMSEEDMLQAAVTMSLETVRNDLKTEGKK
::::::::::::::::::::::::::::::::::::::
CCDS32 GALGSDLGDAMSEEDMLQAAVTMSLETVRNDLKTEGKK
270 280 290 300
>>CCDS45154.1 ATXN3 gene_id:4287|Hs108|chr14 (346 aa)
initn: 1591 init1: 1494 opt: 1500 Z-score: 989.9 bits: 191.7 E(32554): 8.1e-49
Smith-Waterman score: 2198; 93.5% identity (93.5% similar) in 370 aa overlap (1-370:1-346)
10 20 30 40 50 60
pF1KE1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW
::: ::::::::::::::::::::::::::::::::::::::::::
CCDS45 FLQ---------------VISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW
70 80 90 100
130 140 150 160 170 180
pF1KE1 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM
110 120 130 140 150 160
190 200 210 220 230 240
pF1KE1 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRQEI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRQEI
170 180 190 200 210 220
250 260 270 280 290 300
pF1KE1 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ
230 240 250 260 270 280
310 320 330 340 350 360
pF1KE1 QQQQQQQQQQQQQQGDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLQAAVTMSLETV
:: :::::::::::::::::::::::::::::::::::::::::::::::::
CCDS45 QQ---------QQQGDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLQAAVTMSLETV
290 300 310 320 330
370
pF1KE1 RNDLKTEGKK
::::::::::
CCDS45 RNDLKTEGKK
340
>>CCDS73680.1 ATXN3 gene_id:4287|Hs108|chr14 (291 aa)
initn: 1584 init1: 1487 opt: 1492 Z-score: 985.8 bits: 190.7 E(32554): 1.4e-48
Smith-Waterman score: 1822; 95.9% identity (96.3% similar) in 296 aa overlap (75-370:5-291)
50 60 70 80 90 100
pF1KE1 RMRMAEGGVTSEDYRTFLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRI
: .::::::::::::::::::::::::::
CCDS73 MESIFHEKVISNALKVWGLELILFNSPEYQRLRI
10 20 30
110 120 130 140 150 160
pF1KE1 DPINERSFICNYKEHWFTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFV
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 DPINERSFICNYKEHWFTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFV
40 50 60 70 80 90
170 180 190 200 210 220
pF1KE1 VKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDE
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 VKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDE
100 110 120 130 140 150
230 240 250 260 270 280
pF1KE1 DEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS73 DEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKR
160 170 180 190 200 210
290 300 310 320 330 340
pF1KE1 REAYFEKQQQKQQQQQQQQQQQQQQQQQQQGDLSGQSSHPCERPATSSGALGSDLGDAMS
::::::::::::::::::::: ::::::::::::::::::::::::::::::
CCDS73 REAYFEKQQQKQQQQQQQQQQ---------GDLSGQSSHPCERPATSSGALGSDLGDAMS
220 230 240 250 260
350 360 370
pF1KE1 EEDMLQAAVTMSLETVRNDLKTEGKK
::::::::::::::::::::::::::
CCDS73 EEDMLQAAVTMSLETVRNDLKTEGKK
270 280 290
>>CCDS48080.1 ATXN3L gene_id:92552|Hs108|chrX (355 aa)
initn: 1436 init1: 1311 opt: 1475 Z-score: 973.6 bits: 188.7 E(32554): 6.5e-48
Smith-Waterman score: 1594; 68.1% identity (83.8% similar) in 370 aa overlap (1-370:1-355)
10 20 30 40 50 60
pF1KE1 MESIFHEKQEGSLCAQHCLNNLLQGEYFSPVELSSIAHQLDEEERMRMAEGGVTSEDYRT
:. :::::::: :::::::::::::::::::::.::::::::::::::::::::::.: .
CCDS48 MDFIFHEKQEGFLCAQHCLNNLLQGEYFSPVELASIAHQLDEEERMRMAEGGVTSEEYLA
10 20 30 40 50 60
70 80 90 100 110 120
pF1KE1 FLQQPSGNMDDSGFFSIQVISNALKVWGLELILFNSPEYQRLRIDPINERSFICNYKEHW
:::::: ::::.::::::::::::: ::::.: ::.::::.: ::::::::::::::.::
CCDS48 FLQQPSENMDDTGFFSIQVISNALKFWGLEIIHFNNPEYQKLGIDPINERSFICNYKQHW
70 80 90 100 110 120
130 140 150 160 170 180
pF1KE1 FTVRKLGKQWFNLNSLLTGPELISDTYLALFLAQLQQEGYSIFVVKGDLPDCEADQLLQM
::.::.::.::::::::.:::::::: :: :::.:::..::.:::::::::::::::::.
CCDS48 FTIRKFGKHWFNLNSLLAGPELISDTCLANFLARLQQQAYSVFVVKGDLPDCEADQLLQI
130 140 150 160 170 180
190 200 210 220 230 240
pF1KE1 IRVQQMHRPKLIGEELAQLKEQRVHKTDLERVLEANDGSGMLDEDEEDLQRALALSRQEI
: :..: ::: :..:.. ::.::.:: ::.: : .: :: :.::::.:::: :::::
CCDS48 ISVEEMDTPKLNGKKLVKQKEHRVYKTVLEKVSEESDESGTSDQDEEDFQRALELSRQET
190 200 210 220 230 240
250 260 270 280 290 300
pF1KE1 DMEDEEADLRRAIQLSMQGSSRNISQDMTQTSGTNLTSEELRKRREAYFEKQQQKQQQQQ
. :::. :: .:.::::::: : :::. .:: .. .::. .: .: ::::.::
CCDS48 NREDEH--LRSTIELSMQGSSGNTSQDLPKTSCVTPASEQPKKIKEDYFEKHQQ------
250 260 270 280 290
310 320 330 340 350 360
pF1KE1 QQQQQQQQQQQQQQGDLSGQSSHPCERPATSSGALGSDLGDAMSEEDMLQAAVTMSLETV
.:.:::::.:: :.::. :::.::: :. :::.: .:: .:::: :: .
CCDS48 ------EQKQQQQQSDLPGHSSYLHERPTTSSRAIESDLSDDISE-GTVQAAVDTILEIM
300 310 320 330 340
370
pF1KE1 RNDLKTEGKK
:..:: .:.:
CCDS48 RKNLKIKGEK
350
>>CCDS53908.1 ATXN3 gene_id:4287|Hs108|chr14 (182 aa)
initn: 900 init1: 803 opt: 805 Z-score: 546.2 bits: 108.7 E(32554): 4.2e-24
Smith-Waterman score: 1135; 95.3% identity (95.3% similar) in 191 aa overlap (180-370:1-182)
150 160 170 180 190 200
pF1KE1 LFLAQLQQEGYSIFVVKGDLPDCEADQLLQMIRVQQMHRPKLIGEELAQLKEQRVHKTDL
::::::::::::::::::::::::::::::
CCDS53 MIRVQQMHRPKLIGEELAQLKEQRVHKTDL
10 20 30
210 220 230 240 250 260
pF1KE1 ERVLEANDGSGMLDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS53 ERVLEANDGSGMLDEDEEDLQRALALSRQEIDMEDEEADLRRAIQLSMQGSSRNISQDMT
40 50 60 70 80 90
270 280 290 300 310 320
pF1KE1 QTSGTNLTSEELRKRREAYFEKQQQKQQQQQQQQQQQQQQQQQQQGDLSGQSSHPCERPA
:::::::::::::::::::::::::::::::: :::::::::::::::::::
CCDS53 QTSGTNLTSEELRKRREAYFEKQQQKQQQQQQ---------QQQQGDLSGQSSHPCERPA
100 110 120 130 140
330 340 350 360 370
pF1KE1 TSSGALGSDLGDAMSEEDMLQAAVTMSLETVRNDLKTEGKK
:::::::::::::::::::::::::::::::::::::::::
CCDS53 TSSGALGSDLGDAMSEEDMLQAAVTMSLETVRNDLKTEGKK
150 160 170 180
370 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Sat Nov 5 09:16:34 2016 done: Sat Nov 5 09:16:34 2016
Total Scan time: 3.110 Total Display time: 0.020
Function used was FASTA [36.3.4 Apr, 2011]