FASTA searches a protein or DNA sequence data bank 36.3.4 Apr, 2011 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 Query: pF1KB8732, 576 aa 1>>>pF1KB8732 576 - 576 aa - 576 aa Library: human.CCDS.faa 18511270 residues in 32554 sequences Statistics: Expectation_n fit: rho(ln(x))= 11.4114+/-0.00107; mu= -7.2284+/- 0.065 mean_var=353.1654+/-71.111, 0's: 0 Z-trim(114.6): 102 B-trim: 16 in 1/53 Lambda= 0.068247 statistics sampled from 15057 (15152) to 15057 sequences Algorithm: FASTA (3.7 Nov 2010) [optimized] Parameters: BL50 matrix (15:-5), open/ext: -10/-2 ktup: 2, E-join: 1 (0.757), E-opt: 0.2 (0.465), width: 16 Scan time: 4.240 The best scores are: opt bits E(32554) CCDS54009.1 TOX3 gene_id:27324|Hs108|chr16 ( 576) 3798 387.8 2e-107 CCDS54008.1 TOX3 gene_id:27324|Hs108|chr16 ( 571) 3584 366.7 4.3e-101 CCDS34897.1 TOX gene_id:9760|Hs108|chr8 ( 526) 1320 143.8 5.1e-34 CCDS32043.1 TOX4 gene_id:9878|Hs108|chr14 ( 621) 1169 129.0 1.7e-29 CCDS13324.1 TOX2 gene_id:84969|Hs108|chr20 ( 464) 829 95.4 1.7e-19 CCDS46603.1 TOX2 gene_id:84969|Hs108|chr20 ( 506) 829 95.4 1.8e-19 CCDS42875.1 TOX2 gene_id:84969|Hs108|chr20 ( 488) 658 78.6 2e-14 >>CCDS54009.1 TOX3 gene_id:27324|Hs108|chr16 (576 aa) initn: 3798 init1: 3798 opt: 3798 Z-score: 2042.2 bits: 387.8 E(32554): 2e-107 Smith-Waterman score: 3798; 100.0% identity (100.0% similar) in 576 aa overlap (1-576:1-576) 10 20 30 40 50 60 pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG 10 20 30 40 50 60 70 80 90 100 110 120 pF1KB8 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR 70 80 90 100 110 120 130 140 150 160 170 180 pF1KB8 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS 130 140 150 160 170 180 190 200 210 220 230 240 pF1KB8 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK 190 200 210 220 230 240 250 260 270 280 290 300 pF1KB8 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ 250 260 270 280 290 300 310 320 330 340 350 360 pF1KB8 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT 310 320 330 340 350 360 370 380 390 400 410 420 pF1KB8 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG 370 380 390 400 410 420 430 440 450 460 470 480 pF1KB8 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ 430 440 450 460 470 480 490 500 510 520 530 540 pF1KB8 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI 490 500 510 520 530 540 550 560 570 pF1KB8 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF :::::::::::::::::::::::::::::::::::: CCDS54 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF 550 560 570 >>CCDS54008.1 TOX3 gene_id:27324|Hs108|chr16 (571 aa) initn: 3466 init1: 3466 opt: 3584 Z-score: 1928.4 bits: 366.7 E(32554): 4.3e-101 Smith-Waterman score: 3584; 99.6% identity (99.6% similar) in 550 aa overlap (27-576:23-571) 10 20 30 40 50 60 pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG : ::::::::::::::::::::::: :::::::: CCDS54 MKCQPRSGARRIEERLHYLITTYLKFGNNNNYMNMAEANNAFFAASE-TFHTPSLG 10 20 30 40 50 70 80 90 100 110 120 pF1KB8 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR 60 70 80 90 100 110 130 140 150 160 170 180 pF1KB8 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS 120 130 140 150 160 170 190 200 210 220 230 240 pF1KB8 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKK 180 190 200 210 220 230 250 260 270 280 290 300 pF1KB8 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQ 240 250 260 270 280 290 310 320 330 340 350 360 pF1KB8 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 KQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLASTNLTSSLLLNT 300 310 320 330 340 350 370 380 390 400 410 420 pF1KB8 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 PLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVTIAANMPSNIGAPLISSMG 360 370 380 390 400 410 430 440 450 460 470 480 pF1KB8 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 TTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQ 420 430 440 450 460 470 490 500 510 520 530 540 pF1KB8 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS54 HFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPRQHSPVASQI 480 490 500 510 520 530 550 560 570 pF1KB8 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF :::::::::::::::::::::::::::::::::::: CCDS54 TSPIPAIGSPQPASQQHQSQIQSQTQTQVLSQVSIF 540 550 560 570 >>CCDS34897.1 TOX gene_id:9760|Hs108|chr8 (526 aa) initn: 924 init1: 641 opt: 1320 Z-score: 724.2 bits: 143.8 E(32554): 5.1e-34 Smith-Waterman score: 1328; 49.2% identity (69.6% similar) in 494 aa overlap (1-476:1-464) 10 20 30 40 50 pF1KB8 MDVRFYP-----AAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFH ::::::: ::: : : . :: : .:: ..: ::.:.: .. . :: :.. CCDS34 MDVRFYPPPAQPAAAPDAPCLGPSPCLDPYYCNKFDGENMYMSMTEPSQDYVPAS-QSYP 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 TPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLP ::: .:.:.::::::: : .: . .: ...: :. .: . : : ::..::: CCDS34 GPSLESEDFNIPPITPPSLPDHSLVHLNEVESGYHSLCHPMNHNG--LLP-FHPQNMDLP 60 70 80 90 100 110 120 130 140 150 160 170 pF1KB8 SITISRNLVEQDGVLHSSGLHM--DQSHTQVSQYRQDPSLI-MRSIVHMTDAARS-GVMP ::.: :.. :::.: :... . : . . .:: . :.. :: . .: .. :.:: CCDS34 EITVS-NMLGQDGTLLSNSISVMPDIRNPEGTQYSSHPQMAAMRPRGQPADIRQQPGMMP 120 130 140 150 160 170 180 190 200 210 220 pF1KB8 PAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIG- .:::::::::::::::::.::...::.:::::.:::::::::::..:...:.... : CCDS34 HGQLTTINQSQLSAQLGLNMGGSNVPHNSPSPPGSKSATPSPSSSVHEDEGDDTSKINGG 180 190 200 210 220 230 230 240 250 260 270 280 pF1KB8 EKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIV ::: : : :::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS34 EKRPASDMGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIV 240 250 260 270 280 290 290 300 310 320 330 340 pF1KB8 ASMWDSLGEEQKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSVQQTLAS :::::.::::::::::.::::::::::: :::::::::::. .: ....: . : .. CCDS34 ASMWDGLGEEQKQVYKKKTEAAKKEYLKQLAAYRASLVSKSYSEPVDVKTSQPPQLINSK 300 310 320 330 340 350 350 360 370 380 390 400 pF1KB8 TNLT-------SSLLLNTPLSQHGTVSASPQTLQQSLPRSIAPKPLTMRLPMNQIVTSVT .. :.: :.. :. .. ... ::::.::::: ::. ..:. CCDS34 PSVFHGPSQAHSALYLSSHYHQQPGMNPHLTAMHPSLPRNIAPKP------NNQMPVTVS 360 370 380 390 400 410 420 430 440 450 460 pF1KB8 IAANMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQ :: :: .. : : :.:: . .:: ...::.: .:: : CCDS34 IA-NM--AVSPP-------------PPLQISPPL--HQH-LNMQQHQPLTMQQPLGNQLP 410 420 430 440 450 470 480 490 500 510 520 pF1KB8 QLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHM . : .:. .:: CCDS34 MQVQSALHSPTMQQGFTLQPDYQTIINPTSTAAQVVTQAMEYVRSGCRNPPPQPVDWNND 460 470 480 490 500 510 >>CCDS32043.1 TOX4 gene_id:9878|Hs108|chr14 (621 aa) initn: 872 init1: 566 opt: 1169 Z-score: 642.8 bits: 129.0 E(32554): 1.7e-29 Smith-Waterman score: 1196; 42.6% identity (66.8% similar) in 549 aa overlap (31-554:5-536) 10 20 30 40 50 60 pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLG :.:.::.... .. :....: :::::::: CCDS32 MEFPGGNDNYLTITGPSHPFLSGAE-TFHTPSLG 10 20 30 70 80 90 100 110 120 pF1KB8 DEEFEIPPITPPPESDPALGMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISR :::::::::. .:::.:.. ::. :. :.:: :: . :. :. :.::.: . ... CCDS32 DEEFEIPPIS--LDSDPSLAVSDVVGHFDDLADPSSSQDGSFSAQYGVQTLDMP-VGMTH 40 50 60 70 80 90 130 140 150 160 170 180 pF1KB8 NLVEQDGVLHSSGLHMDQSHTQVSQYRQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQS .:.:: : : :.:: :: .:. .:: .: . . : ::: . ::.: .:::::.:: CCDS32 GLMEQGGGLLSGGLTMDLDHSIGTQYSANPPVTID--VPMTDMT-SGLMGHSQLTTIDQS 100 110 120 130 140 190 200 210 220 230 pF1KB8 QLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAI-GEKRAAPDSGK .::.::::.:::... . :: :.::::.::..:. ... : . ..: .. ..:: CCDS32 ELSSQLGLSLGGGTILPPAQSPEDRLSTTPSPTSSLHEDGVEDFRRQLPSQKTVVVEAGK 150 160 170 180 190 200 240 250 260 270 280 290 pF1KB8 KPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEE : :.:::.:::::::::::::::::::::::::::::::::::::::::::::::::::: CCDS32 KQKAPKKRKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEE 210 220 230 240 250 260 300 310 320 330 340 pF1KB8 QKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAE------AQT-----IRSVQQ--- :::::::::::::::::::::::. . .:..:..: .:: . .:. CCDS32 QKQVYKRKTEAAKKEYLKALAAYKDNQECQATVETVELDPAPPSQTPSPPPMATVDPASP 270 280 290 300 310 320 350 360 370 380 390 400 pF1KB8 ---TLASTNLTSSLLLNTPLSQHGTVSASPQTL-QQSLPRSIAPKPLTMRLPMNQIVTSV .. :. :...:. ::.. . .:: . : .. . : : . :: .:. CCDS32 APASIEPPALSPSIVVNSTLSSYVANQASSGAGGQPNITKLIITKQM---LP-----SSI 330 340 350 360 370 410 420 430 440 450 pF1KB8 TIA-ANMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQ-HQMQLQQQQQQQQQQMQQM :.. ..: . : : ...: : . :.. .. ..:: :.: . :: .:. CCDS32 TMSQGGMVTVIPATVVTSRGLQL-GQTSTATIQPSQQAQIVTRSVLQAAAAAAAAASMQL 380 390 400 410 420 430 460 470 480 490 500 510 pF1KB8 QQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQ--INQQQLQQQLQ-QRLQL .:: ..:. : ::: .. :: .:: :. :: :: :: . . : CCDS32 PPPRLQPPPLQQMPQPPTQQQVTILQQPPPLQAMQQPPPQKVRINLQQQPPPLQIKSVPL 440 450 460 470 480 490 520 530 540 550 560 570 pF1KB8 QQLQHMQHQSQPSPRQHSPVASQITSP-IPAIGSPQPASQQHQSQIQSQTQTQVLSQVSI :. :: : . :: . .:: .. .:.: . CCDS32 PTLK-MQTTLVPPTVESSPERPMNNSPEAHTVEAPSPETICEMITDVVPEVESPSQMDVE 500 510 520 530 540 550 pF1KB8 F CCDS32 LVSGSPVALSPQPRCVRSGCENPPIVSKDWDNEYCSNECVVKHCRDVFLAWVASRNSNTV 560 570 580 590 600 610 >>CCDS13324.1 TOX2 gene_id:84969|Hs108|chr20 (464 aa) initn: 817 init1: 738 opt: 829 Z-score: 463.7 bits: 95.4 E(32554): 1.7e-19 Smith-Waterman score: 959; 43.3% identity (66.7% similar) in 432 aa overlap (39-453:1-410) 10 20 30 40 50 60 pF1KB8 AAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQTFHTPSLGDEEFEIPP :...: ....: ::.. : ..:..:::: CCDS13 MSDGNPELLSTS-QTYNGQSENNEDYEIPP 10 20 70 80 90 100 110 120 pF1KB8 ITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLDLPSITISRNLVEQDG :::: .:.: . : ...: : .: . : . :..:::.: .: :.. ::. CCDS13 ITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQAMDLPAIMVS-NMLAQDS 30 40 50 60 70 80 130 140 150 160 170 180 pF1KB8 VLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAARSGVMPPAQLTTINQSQL : :. : : :..:. : : : :. : .: .......:::: CCDS13 HLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP-----------AMLASHMSALSQSQL 90 100 110 120 130 190 200 210 220 230 240 pF1KB8 SAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRAIGEKRAAPDSGKKPK .:.:. .:. :.:::::.::::::::::: .::... . :::: . : ::: : CCDS13 ISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKISGEKRPSADPGKKAK 140 150 160 170 180 190 250 260 270 280 290 300 pF1KB8 TPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSKIVASMWDSLGEEQKQ .:::::::::::::::::::::::::::::::::::.::::.:::::::::::::::::: CCDS13 NPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSKIVASMWDSLGEEQKQ 200 210 220 230 240 250 310 320 330 340 350 pF1KB8 VYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSV--------QQTLASTNLTS .::::::::::::::::::::::::::.. ...:... .. .: . . . CCDS13 AYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLA 260 270 280 290 300 310 360 370 380 390 400 410 pF1KB8 SLLLNTPLSQHGTVSASPQTLQQSL-PRSIAPKPLTMRLPMNQIVTSVTIAA--NMPSNI :.: . : : .::: .: ..: .:. : . : .. : :. ..: . CCDS13 SFLTPSDL-QAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHA 320 330 340 350 360 370 420 430 440 450 460 470 pF1KB8 GAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQ . :.: . .. ::. : :. .. : :. .. . : CCDS13 QGALLSP--PVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFPHISEFPSSSGSCSPG 380 390 400 410 420 480 490 500 510 520 530 pF1KB8 QIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQLQQLQHMQHQSQPSPR CCDS13 PSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT 430 440 450 460 >>CCDS46603.1 TOX2 gene_id:84969|Hs108|chr20 (506 aa) initn: 855 init1: 738 opt: 829 Z-score: 463.1 bits: 95.4 E(32554): 1.8e-19 Smith-Waterman score: 1029; 42.6% identity (66.2% similar) in 477 aa overlap (1-453:1-452) 10 20 30 40 50 pF1KB8 MDVRFYPAA-------AGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAASEQT ::::.::.: ...::.: : :: .:: ... :..:...: ....: :: CCDS46 MDVRLYPSAPAVGARPGAEPAGLAH---LDYYHGGKFDGDSAYVGMSDGNPELLSTS-QT 10 20 30 40 50 60 70 80 90 100 110 pF1KB8 FHTPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQSLD .. : ..:..:::::::: .:.: . : ...: : .: . : . :..: CCDS46 YNGQSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQAMD 60 70 80 90 100 110 120 130 140 150 160 pF1KB8 LPSITISRNLVEQDGVLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAARSG ::.: .: :.. ::. : :. : : :..:. : : : :. : CCDS46 LPAIMVS-NMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP----------- 120 130 140 150 160 170 180 190 200 210 220 pF1KB8 VMPPAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEANRA .: .......:::: .:.:. .:. :.:::::.::::::::::: .::... . CCDS46 AMLASHMSALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVHFKI 170 180 190 200 210 230 240 250 260 270 280 pF1KB8 IGEKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGEVSK :::: . : ::: :.:::::::::::::::::::::::::::::::::::.::::.::: CCDS46 SGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGDVSK 220 230 240 250 260 270 290 300 310 320 330 340 pF1KB8 IVASMWDSLGEEQKQVYKRKTEAAKKEYLKALAAYRASLVSKAAAESAEAQTIRSV---- :::::::::::::::.::::::::::::::::::::::::::.. ...:... .. CCDS46 IVASMWDSLGEEQKQAYKRKTEAAKKEYLKALAAYRASLVSKSSPDQGETKSTQANPPAK 280 290 300 310 320 330 350 360 370 380 390 pF1KB8 ----QQTLASTNLTSSLLLNTPLSQHGTVSASPQTLQQSL-PRSIAPKPLTMRLPMNQIV .: . . .:.: . : : .::: .: ..: .:. : . : .. CCDS46 MLPPKQPMYAMPGLASFLTPSDL-QAFRSGASPASLARTLGSKSLLPGLSASPPPPPSFP 340 350 360 370 380 390 400 410 420 430 440 450 pF1KB8 TSVTIAA--NMPSNIGAPLISSMGTTMVGSAPSTQVSPSVQTQQHQMQLQQQQQQQQQQM : :. ..: . . :.: . .. ::. : :. .. : :. .. . : CCDS46 LSPTLHQQLSLPPHAQGALLSP--PVSMSPAPQPPVLPTPMALQVQLAMSPSPPGPQDFP 400 410 420 430 440 450 460 470 480 490 500 510 pF1KB8 QQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQQINQQQLQQQLQQRLQL CCDS46 HISEFPSSSGSCSPGPSNPTSSGDWDSSYPSGECGISTCSLLPRDKSLYLT 460 470 480 490 500 >>CCDS42875.1 TOX2 gene_id:84969|Hs108|chr20 (488 aa) initn: 652 init1: 573 opt: 658 Z-score: 372.4 bits: 78.6 E(32554): 2e-14 Smith-Waterman score: 819; 39.9% identity (62.2% similar) in 439 aa overlap (21-428:34-451) 10 20 30 40 50 pF1KB8 MDVRFYPAAAGDPASLDFAQCLGYYGYSKFGNNNNYMNMAEANNAFFAAS :.. .:: ... :..:...: ....: CCDS42 TRTEAVAGAFSRCLGFCGMRLGLLLLARHWCIAGVFPQKFDGDSAYVGMSDGNPELLSTS 10 20 30 40 50 60 60 70 80 90 100 pF1KB8 EQTFHTPSLGDEEFEIPPITPPPESDPAL-GMPDVLLPFQALSDPLPSQGSEFTPQFPPQ ::.. : ..:..:::::::: .:.: . : ...: : .: . : . : CCDS42 -QTYNGQSENNEDYEIPPITPPNLPEPSLLHLGDHEASYHSLCHGLTPNG--LLPAYSYQ 70 80 90 100 110 120 110 120 130 140 150 160 pF1KB8 SLDLPSITISRNLVEQDGVLHSSGLHMDQS--HTQVSQY---RQDPSLIMRSIVHMTDAA ..:::.: .: :.. ::. : :. : : :..:. : : : :. : CCDS42 AMDLPAIMVS-NMLAQDSHLLSGQLPTIQEMVHSEVAAYDSGRPGP-LLGRP-------- 130 140 150 160 170 170 180 190 200 210 220 pF1KB8 RSGVMPPAQLTTINQSQLSAQLGLNLGGASMPHTSPSPPASKSATPSPSSSINEEDADEA .: .......:::: .:.:. .:. :.:::::.::::::::::: .::... CCDS42 ---AMLASHMSALSQSQLISQMGIR---SSIAHSSPSPPGSKSATPSPSSSTQEEESEVH 180 190 200 210 220 230 240 250 260 270 280 pF1KB8 NRAIGEKRAAPDSGKKPKTPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPNATFGE . :::: . : ::: :.:::::::::::::::::::::::::::::::::::.::::. CCDS42 FKISGEKRPSADPGKKAKNPKKKKKKDPNEPQKPVSAYALFFRDTQAAIKGQNPSATFGD 230 240 250 260 270 280 290 300 310 320 330 pF1KB8 VSKIVASMWDSLGEEQKQVYK-----RKTEA---AK----KEYLKALAAYRASLVSKAAA :::::::::::::::::: ..:.: :: :. . :. . . :. . CCDS42 VSKIVASMWDSLGEEQKQSSPDQGETKSTQANPPAKMLPPKQPMYAMPGLASFLTPSDLQ 290 300 310 320 330 340 340 350 360 370 380 pF1KB8 ESAEAQTIRSVQQTLASTNLTSSLLLNTP------LSQ--HGTVSASP--QTLQQSLPRS . . :. .::.: .: .: . : :: : .: : : : : : CCDS42 AFRSGASPASLARTLGSKSLLPGLSASPPPPPSFPLSPTLHQQLSLPPHAQGALLSPPVS 350 360 370 380 390 400 390 400 410 420 430 pF1KB8 IAPKPLTMRLPMNQIVTSVTIAANMPSNIGA---PLISSMGTTMVGSAPSTQVSPSVQTQ ..: : :: . .. .: .: . :: : : :: . .. . .: CCDS42 MSPAPQPPVLP-TPMALQVQLAMS-PSPPGPQDFPHISEFPSSSGSCSPGPSNPTSSGDW 410 420 430 440 450 460 440 450 460 470 480 490 pF1KB8 QHQMQLQQQQQQQQQQMQQMQQQQLQQHQMHQQIQQQMQQQHFQHHMQQHLQQQQQHLQQ CCDS42 DSSYPSGECGISTCSLLPRDKSLYLT 470 480 576 residues in 1 query sequences 18511270 residues in 32554 library sequences Tcomplib [36.3.4 Apr, 2011] (8 proc) start: Fri Nov 4 15:02:57 2016 done: Fri Nov 4 15:02:58 2016 Total Scan time: 4.240 Total Display time: 0.070 Function used was FASTA [36.3.4 Apr, 2011]