FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011
Please cite:
W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448
Query: pF1KB8732, 576 aa
1>>>pF1KB8732 576 - 576 aa - 576 aa
Library: human.CCDS.faa
18511270 residues in 32554 sequences
Statistics: Expectation_n fit: rho(ln(x))= 11.4114+/-0.00107; mu= -7.2284+/- 0.065
mean_var=353.1654+/-71.111, 0's: 0 Z-trim(114.6): 102 B-trim: 16 in 1/53
Lambda= 0.068247
statistics sampled from 15057 (15152) to 15057 sequences
Algorithm: FASTA (3.7 Nov 2010) [optimized]
Parameters: BL50 matrix (15:-5), open/ext: -10/-2
ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.465), width: 16
Scan time: 4.240
The best scores are: opt bits E(32554)
CCDS54009.1 TOX3 gene_id:27324|Hs108|chr16 ( 576) 3798 387.8 2e-107
CCDS54008.1 TOX3 gene_id:27324|Hs108|chr16 ( 571) 3584 366.7 4.3e-101
CCDS34897.1 TOX gene_id:9760|Hs108|chr8 ( 526) 1320 143.8 5.1e-34
CCDS32043.1 TOX4 gene_id:9878|Hs108|chr14 ( 621) 1169 129.0 1.7e-29
CCDS13324.1 TOX2 gene_id:84969|Hs108|chr20 ( 464) 829 95.4 1.7e-19
CCDS46603.1 TOX2 gene_id:84969|Hs108|chr20 ( 506) 829 95.4 1.8e-19
CCDS42875.1 TOX2 gene_id:84969|Hs108|chr20 ( 488) 658 78.6 2e-14
>>CCDS54009.1 TOX3 gene_id:27324|Hs108|chr16 (576 aa)
initn: 3798 init1: 3798 opt: 3798 Z-score: 2042.2 bits: 387.8 E(32554): 2e-107
Smith-Waterman score: 3798; 100.0% identity (100.0% similar) in 576 aa overlap (1-576:1-576)
10 20 30 40 50 60
pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG
10 20 30 40 50 60
70 80 90 100 110 120
pF1KB8 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR
70 80 90 100 110 120
130 140 150 160 170 180
pF1KB8 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS
130 140 150 160 170 180
190 200 210 220 230 240
pF1KB8 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK
190 200 210 220 230 240
250 260 270 280 290 300
pF1KB8 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ
250 260 270 280 290 300
310 320 330 340 350 360
pF1KB8 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT
310 320 330 340 350 360
370 380 390 400 410 420
pF1KB8 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG
370 380 390 400 410 420
430 440 450 460 470 480
pF1KB8 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ
430 440 450 460 470 480
490 500 510 520 530 540
pF1KB8 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI
490 500 510 520 530 540
550 560 570
pF1KB8 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF
::::::::::::::::::::::::::::::::::::
CCDS54 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF
550 560 570
>>CCDS54008.1 TOX3 gene_id:27324|Hs108|chr16 (571 aa)
initn: 3466 init1: 3466 opt: 3584 Z-score: 1928.4 bits: 366.7 E(32554): 4.3e-101
Smith-Waterman score: 3584; 99.6% identity (99.6% similar) in 550 aa overlap (27-576:23-571)
10 20 30 40 50 60
pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG
: ::::::::::::::::::::::: ::::::::
CCDS54 MKCQPRSGARRIEERLHYLITTYLKFGNNNNYMNMAEANNAFFAASE-TFHTPSLG
10 20 30 40 50
70 80 90 100 110 120
pF1KB8 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR
60 70 80 90 100 110
130 140 150 160 170 180
pF1KB8 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS
120 130 140 150 160 170
190 200 210 220 230 240
pF1KB8 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK
180 190 200 210 220 230
250 260 270 280 290 300
pF1KB8 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ
240 250 260 270 280 290
310 320 330 340 350 360
pF1KB8 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT
300 310 320 330 340 350
370 380 390 400 410 420
pF1KB8 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG
360 370 380 390 400 410
430 440 450 460 470 480
pF1KB8 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ
420 430 440 450 460 470
490 500 510 520 530 540
pF1KB8 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI
::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS54 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI
480 490 500 510 520 530
550 560 570
pF1KB8 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF
::::::::::::::::::::::::::::::::::::
CCDS54 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF
540 550 560 570
>>CCDS34897.1 TOX gene_id:9760|Hs108|chr8 (526 aa)
initn: 924 init1: 641 opt: 1320 Z-score: 724.2 bits: 143.8 E(32554): 5.1e-34
Smith-Waterman score: 1328; 49.2% identity (69.6% similar) in 494 aa overlap (1-476:1-464)
10 20 30 40 50
pF1KB8 MDVRFYP-----AAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFH
::::::: ::: : : . :: : .:: ..: ::.:.: .. . :: :..
CCDS34 MDVRFYPPPAQPAAAPDAPCLGPSPCLDPYYCNKFDGENMYMSMTEPSQDYVPAS-QSYP
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 TPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLP
::: .:.:.::::::: : .: . .: ...: :. .: . : : ::..:::
CCDS34 GPSLESEDFNIPPITPPSLPDHSLVHLNEVESGYHSLCHPMNHNG--LLP-FHPQNMDLP
60 70 80 90 100 110
120 130 140 150 160 170
pF1KB8 SITISRNLVEQDGVLHSSGLHM--DQSHTQVSQYRQDPSLI-MRSIVHMTDAARS-GVMP
::.: :.. :::.: :... . : . . .:: . :.. :: . .: .. :.::
CCDS34 EITVS-NMLGQDGTLLSNSISVMPDIRNPEGTQYSSHPQMAAMRPRGQPADIRQQPGMMP
120 130 140 150 160 170
180 190 200 210 220
pF1KB8 PAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIG-
.:::::::::::::::::.::...::.:::::.:::::::::::..:...:.... :
CCDS34 HGQLTTINQSQLSAQLGLNMGGSNVPHNSPSPPGSKSATPSPSSSVHEDEGDDTSKINGG
180 190 200 210 220 230
230 240 250 260 270 280
pF1KB8 EKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIV
::: : : ::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS34 EKRPASDMGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIV
240 250 260 270 280 290
290 300 310 320 330 340
pF1KB8 ASMWDSLGEEQKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLAS
:::::.::::::::::.::::::::::: :::::::::::. .: ....: . : ..
CCDS34 ASMWDGLGEEQKQVYKKKTEAAKKEYLKQLAAYRASLVSKSYSEPVDVKTSQPPQLINSK
300 310 320 330 340 350
350 360 370 380 390 400
pF1KB8 TNLT-------SSLLLNTPLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVT
.. :.: :.. :. .. ... ::::.::::: ::. ..:.
CCDS34 PSVFHGPSQAHSALYLSSHYHQQPGMNPHLTAMHPSLPRNIAPKP------NNQMPVTVS
360 370 380 390 400
410 420 430 440 450 460
pF1KB8 IAANMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQ
:: :: .. : : :.:: . .:: ...::.: .:: :
CCDS34 IA-NM--AVSPP-------------PPLQISPPL--HQH-LNMQQHQPLTMQQPLGNQLP
410 420 430 440 450
470 480 490 500 510 520
pF1KB8 QLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHM
. : .:. .::
CCDS34 MQVQSALHSPTMQQGFTLQPDYQTIINPTSTAAQVVTQAMEYVRSGCRNPPPQPVDWNND
460 470 480 490 500 510
>>CCDS32043.1 TOX4 gene_id:9878|Hs108|chr14 (621 aa)
initn: 872 init1: 566 opt: 1169 Z-score: 642.8 bits: 129.0 E(32554): 1.7e-29
Smith-Waterman score: 1196; 42.6% identity (66.8% similar) in 549 aa overlap (31-554:5-536)
10 20 30 40 50 60
pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG
:.:.::.... .. :....: ::::::::
CCDS32 MEFPGGNDNYLTITGPSHPFLSGAE-TFHTPSLG
10 20 30
70 80 90 100 110 120
pF1KB8 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR
:::::::::. .:::.:.. ::. :. :.:: :: . :. :. :.::.: . ...
CCDS32 DEEFEIPPIS--LDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMP-VGMTH
40 50 60 70 80 90
130 140 150 160 170 180
pF1KB8 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS
.:.:: : : :.:: :: .:. .:: .: . . : ::: . ::.: .:::::.::
CCDS32 GLMEQGGGLLSGGLTMDLDHSIGTQYSANPPVTID--VPMTDMT-SGLMGHSQLTTIDQS
100 110 120 130 140
190 200 210 220 230
pF1KB8 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAI-GEKRAAPDSGK
.::.::::.:::... . :: :.::::.::..:. ... : . ..: .. ..::
CCDS32 ELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHEDGVEDFRRQLPSQKTVVVEAGK
150 160 170 180 190 200
240 250 260 270 280 290
pF1KB8 KPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEE
: :.:::.::::::::::::::::::::::::::::::::::::::::::::::::::::
CCDS32 KQKAPKKRKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEE
210 220 230 240 250 260
300 310 320 330 340
pF1KB8 QKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAE------AQT-----IRSVQQ---
:::::::::::::::::::::::. . .:..:..: .:: . .:.
CCDS32 QKQVYKRKTEAAKKEYLKALAAYKDNQECQATVETVELDPAPPSQTPSPPPMATVDPASP
270 280 290 300 310 320
350 360 370 380 390 400
pF1KB8 ---TLASTNLTSSLLLNTPLSQHGTVSASPQTL-QQSLPRSIAPKPLTMRLPMNQIVTSV
.. :. :...:. ::.. . .:: . : .. . : : . :: .:.
CCDS32 APASIEPPALSPSIVVNSTLSSYVANQASSGAGGQPNITKLIITKQM---LP-----SSI
330 340 350 360 370
410 420 430 440 450
pF1KB8 TIA-ANMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQ-HQMQLQQQQQQQQQQMQQM
:.. ..: . : : ...: : . :.. .. ..:: :.: . :: .:.
CCDS32 TMSQGGMVTVIPATVVTSRGLQL-GQTSTATIQPSQQAQIVTRSVLQAAAAAAAAASMQL
380 390 400 410 420 430
460 470 480 490 500 510
pF1KB8 QQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQ--INQQQLQQQLQ-QRLQL
.:: ..:. : ::: .. :: .:: :. :: :: :: . . :
CCDS32 PPPRLQPPPLQQMPQPPTQQQVTILQQPPPLQAMQQPPPQKVRINLQQQPPPLQIKSVPL
440 450 460 470 480 490
520 530 540 550 560 570
pF1KB8 QQLQHMQHQSQPSPRQHSPVASQITSP-IPAIGSPQPASQQHQSQIQSQTQTQVLSQVSI
:. :: : . :: . .:: .. .:.: .
CCDS32 PTLK-MQTTLVPPTVESSPERPMNNSPEAHTVEAPSPETICEMITDVVPEVESPSQMDVE
500 510 520 530 540 550
pF1KB8 F
CCDS32 LVSGSPVALSPQPRCVRSGCENPPIVSKDWDNEYCSNECVVKHCRDVFLAWVASRNSNTV
560 570 580 590 600 610
>>CCDS13324.1 TOX2 gene_id:84969|Hs108|chr20 (464 aa)
initn: 817 init1: 738 opt: 829 Z-score: 463.7 bits: 95.4 E(32554): 1.7e-19
Smith-Waterman score: 959; 43.3% identity (66.7% similar) in 432 aa overlap (39-453:1-410)
10 20 30 40 50 60
pF1KB8 AAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLGDEEFEIPP
:...: ....: ::.. : ..:..::::
CCDS13 MSDGNPELLSTS-QTYNGQSENNEDYEIPP
10 20
70 80 90 100 110 120
pF1KB8 ITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISRNLVEQDG
:::: .:.: . : ...: : .: . : . :..:::.: .: :.. ::.
CCDS13 ITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQAMDLPAIMVS-NMLAQDS
30 40 50 60 70 80
130 140 150 160 170 180
pF1KB8 VLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQSQL
: :. : : :..:. : : : :. : .: .......::::
CCDS13 HLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP-----------AMLASHMSALSQSQL
90 100 110 120 130
190 200 210 220 230 240
pF1KB8 SAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKKPK
.:.:. .:. :.:::::.::::::::::: .::... . :::: . : ::: :
CCDS13 ISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAK
140 150 160 170 180 190
250 260 270 280 290 300
pF1KB8 TPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQKQ
.:::::::::::::::::::::::::::::::::::.::::.::::::::::::::::::
CCDS13 NPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQ
200 210 220 230 240 250
310 320 330 340 350
pF1KB8 VYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSV--------QQTLASTNLTS
.::::::::::::::::::::::::::.. ...:... .. .: . . .
CCDS13 AYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLA
260 270 280 290 300 310
360 370 380 390 400 410
pF1KB8 SLLLNTPLSQHGTVSASPQTLQQSL-PRSIAPKPLTMRLPMNQIVTSVTIAA--NMPSNI
:.: . : : .::: .: ..: .:. : . : .. : :. ..: .
CCDS13 SFLTPSDL-QAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHA
320 330 340 350 360 370
420 430 440 450 460 470
pF1KB8 GAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQ
. :.: . .. ::. : :. .. : :. .. . :
CCDS13 QGALLSP--PVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPG
380 390 400 410 420
480 490 500 510 520 530
pF1KB8 QIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPR
CCDS13 PSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT
430 440 450 460
>>CCDS46603.1 TOX2 gene_id:84969|Hs108|chr20 (506 aa)
initn: 855 init1: 738 opt: 829 Z-score: 463.1 bits: 95.4 E(32554): 1.8e-19
Smith-Waterman score: 1029; 42.6% identity (66.2% similar) in 477 aa overlap (1-453:1-452)
10 20 30 40 50
pF1KB8 MDVRFYPAA-------AGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQT
::::.::.: ...::.: : :: .:: ... :..:...: ....: ::
CCDS46 MDVRLYPSAPAVGARPGAEPAGLAH---LDYYHGGKFDGDSAYVGMSDGNPELLSTS-QT
10 20 30 40 50
60 70 80 90 100 110
pF1KB8 FHTPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLD
.. : ..:..:::::::: .:.: . : ...: : .: . : . :..:
CCDS46 YNGQSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQAMD
60 70 80 90 100 110
120 130 140 150 160
pF1KB8 LPSITISRNLVEQDGVLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAARSG
::.: .: :.. ::. : :. : : :..:. : : : :. :
CCDS46 LPAIMVS-NMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP-----------
120 130 140 150 160
170 180 190 200 210 220
pF1KB8 VMPPAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRA
.: .......:::: .:.:. .:. :.:::::.::::::::::: .::... .
CCDS46 AMLASHMSALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKI
170 180 190 200 210
230 240 250 260 270 280
pF1KB8 IGEKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSK
:::: . : ::: :.:::::::::::::::::::::::::::::::::::.::::.:::
CCDS46 SGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSK
220 230 240 250 260 270
290 300 310 320 330 340
pF1KB8 IVASMWDSLGEEQKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSV----
:::::::::::::::.::::::::::::::::::::::::::.. ...:... ..
CCDS46 IVASMWDSLGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAK
280 290 300 310 320 330
350 360 370 380 390
pF1KB8 ----QQTLASTNLTSSLLLNTPLSQHGTVSASPQTLQQSL-PRSIAPKPLTMRLPMNQIV
.: . . .:.: . : : .::: .: ..: .:. : . : ..
CCDS46 MLPPKQPMYAMPGLASFLTPSDL-QAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFP
340 350 360 370 380 390
400 410 420 430 440 450
pF1KB8 TSVTIAA--NMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQM
: :. ..: . . :.: . .. ::. : :. .. : :. .. . :
CCDS46 LSPTLHQQLSLPPHAQGALLSP--PVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFP
400 410 420 430 440 450
460 470 480 490 500 510
pF1KB8 QQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQL
CCDS46 HISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT
460 470 480 490 500
>>CCDS42875.1 TOX2 gene_id:84969|Hs108|chr20 (488 aa)
initn: 652 init1: 573 opt: 658 Z-score: 372.4 bits: 78.6 E(32554): 2e-14
Smith-Waterman score: 819; 39.9% identity (62.2% similar) in 439 aa overlap (21-428:34-451)
10 20 30 40 50
pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAAS
:.. .:: ... :..:...: ....:
CCDS42 TRTEAVAGAFSRCLGFCGMRLGLLLLARHWCIAGVFPQKFDGDSAYVGMSDGNPELLSTS
10 20 30 40 50 60
60 70 80 90 100
pF1KB8 EQTFHTPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQ
::.. : ..:..:::::::: .:.: . : ...: : .: . : . :
CCDS42 -QTYNGQSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQ
70 80 90 100 110 120
110 120 130 140 150 160
pF1KB8 SLDLPSITISRNLVEQDGVLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAA
..:::.: .: :.. ::. : :. : : :..:. : : : :. :
CCDS42 AMDLPAIMVS-NMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP--------
130 140 150 160 170
170 180 190 200 210 220
pF1KB8 RSGVMPPAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEA
.: .......:::: .:.:. .:. :.:::::.::::::::::: .::...
CCDS42 ---AMLASHMSALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVH
180 190 200 210 220
230 240 250 260 270 280
pF1KB8 NRAIGEKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGE
. :::: . : ::: :.:::::::::::::::::::::::::::::::::::.::::.
CCDS42 FKISGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGD
230 240 250 260 270 280
290 300 310 320 330
pF1KB8 VSKIVASMWDSLGEEQKQVYK-----RKTEA---AK----KEYLKALAAYRASLVSKAAA
:::::::::::::::::: ..:.: :: :. . :. . . :. .
CCDS42 VSKIVASMWDSLGEEQKQSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQ
290 300 310 320 330 340
340 350 360 370 380
pF1KB8 ESAEAQTIRSVQQTLASTNLTSSLLLNTP------LSQ--HGTVSASP--QTLQQSLPRS
. . :. .::.: .: .: . : :: : .: : : : : :
CCDS42 AFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVS
350 360 370 380 390 400
390 400 410 420 430
pF1KB8 IAPKPLTMRLPMNQIVTSVTIAANMPSNIGA---PLISSMGTTMVGSAPSTQVSPSVQTQ
..: : :: . .. .: .: . :: : : :: . .. . .:
CCDS42 MSPAPQPPVLP-TPMALQVQLAMS-PSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDW
410 420 430 440 450 460
440 450 460 470 480 490
pF1KB8 QHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQ
CCDS42 DSSYPSGECGISTCSLLPRDKSLYLT
470 480
576 residues in 1 query sequences
18511270 residues in 32554 library sequences
Tcomplib [36.3.4 Apr, 2011] (8 proc)
start: Fri Nov 4 15:02:57 2016 done: Fri Nov 4 15:02:58 2016
Total Scan time: 4.240 Total Display time: 0.070
Function used was FASTA [36.3.4 Apr, 2011]