GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:58:53 Sequence gi568815596r:63994705_64208310 : 213606 bp : 39.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 908 1515 608 1 2 21 2 267 0.715 6.23 1.02 Term + 6425 6656 232 2 1 115 46 190 0.824 12.46 1.03 PlyA + 7735 7740 6 1.05 2.00 Prom + 34732 34771 40 -2.35 2.01 Init + 44496 44566 71 2 2 50 116 31 0.187 2.67 2.02 Intr + 53712 53800 89 0 2 100 69 40 0.271 2.00 2.03 Term + 57255 57379 125 1 2 72 36 146 0.851 5.37 2.04 PlyA + 57508 57513 6 1.05 3.04 PlyA - 58492 58487 6 1.05 3.03 Term - 60865 60665 201 1 0 72 42 191 0.188 9.31 3.02 Intr - 69213 69165 49 2 1 96 81 19 0.009 -0.24 3.01 Init - 72550 72435 116 1 2 74 76 73 0.423 4.53 3.00 Prom - 81574 81535 40 -4.25 4.05 PlyA - 81650 81645 6 -0.45 4.04 Term - 83932 83723 210 1 0 64 47 207 0.782 10.51 4.03 Intr - 84581 84436 146 0 2 89 94 -31 0.523 -3.32 4.02 Intr - 85234 85043 192 2 0 62 90 60 0.762 2.14 4.01 Init - 86468 86315 154 2 1 49 80 103 0.674 5.79 4.00 Prom - 97866 97827 40 -7.15 5.09 PlyA - 97977 97972 6 1.05 5.08 Term - 100564 99998 567 1 0 108 42 390 0.996 29.83 5.07 Intr - 101609 101481 129 2 0 33 74 110 0.669 4.07 5.06 Intr - 101906 101709 198 2 0 93 50 147 0.998 10.03 5.05 Intr - 105795 105694 102 0 0 92 67 27 0.551 0.45 5.04 Intr - 111595 111459 137 2 2 69 45 92 0.284 2.37 5.03 Intr - 127989 127868 122 2 2 11 88 142 0.034 5.72 5.02 Intr - 138886 138786 101 1 2 50 93 66 0.414 1.29 5.01 Init - 149949 149377 573 0 0 83 131 247 0.804 23.66 5.00 Prom - 153959 153920 40 -6.65 6.05 PlyA - 154496 154491 6 1.05 6.04 Term - 166575 166465 111 0 0 85 38 69 0.117 -0.82 6.03 Intr - 172932 172792 141 0 0 -1 64 121 0.053 0.43 6.02 Intr - 186771 186541 231 0 0 37 89 174 0.718 9.45 6.01 Init - 194800 194600 201 1 0 89 109 125 0.978 13.67 6.00 Prom - 203860 203821 40 -7.05 7.03 PlyA - 204191 204186 6 1.05 7.02 Term - 206638 206526 113 2 2 51 49 103 0.249 0.44 7.01 Intr - 209869 209674 196 1 1 86 33 146 0.233 6.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_1|279_aa MLNSQQGIDQKLANQINDLRQSVIWLGDRVVSLEHHMQMQCDWNTSDFRITLYSYNETDH SWEMVKGHLLGREDNLSLDMTKLNKQIFEVSQAHLSIVPGAEALDQVAENLSGLNPTTWI KSIGGSTVVNFGIMFPCLIGLSLVCWTSQRILRQNRENKPSSPWHIYIKRKGEMLQEVRD PERRDRLKPWQKNINCEDFMDNLQRSLTLWPPPPKAHREYCQATVNVHKIQGLFSQLVVK AARPGTHPSVSSPLVQGRSRNAVQEPRPGIRDPKSPLGV >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_1|840_bp atgttgaattctcaacaaggcattgaccaaaaattggctaatcaaattaatgatttaaga cagtctgttatttggcttggagatcgggtagtgagtctcgaacatcacatgcaaatgcag tgcgactggaatacttcagatttccgtatcaccctgtattcctataacgagactgatcat tcatgggaaatggtcaaaggacaccttctgggtagggaagataatttatcattggacatg actaaattaaataaacaaatttttgaagtctctcaagctcatttatccattgtgcctgga gctgaggcgttagatcaggtggcagaaaatctttctggactaaaccccacaacttggatt aagtctattgggggctccactgtagtaaatttcggaattatgtttccctgtttaatcggc ttgtctttagtgtgctggaccagtcaaagaatcctgcgtcaaaatcgagagaacaagcct tcatcaccatggcacatttatataaaaagaaagggagagatgttgcaggaagtcagggac cctgaacggagggaccggctgaagccatggcagaagaacataaattgtgaagatttcatg gacaatttgcagaggagcctcaccctgtggccaccaccaccaaaggcccatagggagtac tgccaggctactgtcaatgttcacaagatacaagggctcttcagtcagcttgtggtgaag gctgccaggcctgggactcacccttcagtaagctcccctctggtccagggcaggtccaga aatgctgtccaagagccaaggcctggaatcagggaccccaagagcccactcggtgtttga >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_2|94_aa MRTLMTSPRKSNKETKRWNTGEKRGGKDFSSTLLGLKFESVKQTDNTQINRRKGHKTLVP ERVLSHTQKEGMRAQRRQEESRRTGLAGFPHSGH >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_2|285_bp atgagaacattgatgacatctcccagaaagtcaaacaaggaaacaaagagatggaacact ggagaaaaaaggggagggaaagacttttcctctaccctcctaggtttaaaatttgagtct gtgaaacaaactgacaatacgcagattaaccggagaaaaggacataagaccctggttcca gagagggtcctgtcccacacccagaaggaaggaatgcgtgctcagagacgccaagaagaa tctagacggacaggccttgctgggtttccccactccggccattag >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_3|121_aa MQAQLLLTSSLYNHEGSQSQNEAHVEGARVEKEKKQGPCLQVSTQMSLSSQTTPQVHEEI HTNMSITGLFVVQESSGQSGSPPLGATDMLRGSDDHDMIMTHNMESYAAVRRNALAVHKG T >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_3|366_bp atgcaggctcagttgctgctgacctccagcctgtataaccacgaggggagccaatcccag aatgaagctcacgttgagggtgcgagagtagagaaagagaagaaacaaggcccttgcctt caggtctcaactcagatgtcactaagcagccaaaccacaccccaggttcatgaagaaata cacacaaacatgtctatcacaggattgtttgtggtacaagagagttcggggcagtctggg agtccaccactaggggccacagatatgttaagaggatcagatgatcatgacatgatcatg acacataatatggaatcttatgcagcagttagaagaaatgcgctagctgtgcataaagga acatga >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_4|233_aa MKEKKPITKTKGKSCQKEKEIENQKRSIAHKPRDKNHLRRDGLRCHITEEQTACQASKPK LSHHIPCDLHVHIQMAGSCLNLSYLCTPIPYFRALTSLRLNPLFLCPDPFPAFLEDPCDL SPPPQSAPCQAELGPNSSSASALPPYNLFITSPPHTRSSLQFHSELATSARNLATGPRNA HSAEFLLSRVPSVQDPTENRTVQLTGSHFQSPWNSGPRLSDSFSDLLGLAAED >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_4|702_bp atgaaagaaaagaaacccataacaaagaccaaggggaagagctgtcagaaagaaaaagag atagaaaaccagaagagatcaatagcacacaagccaagagacaagaatcatttaaggaga gatggtttgcgatgtcacattactgaggagcaaactgcctgtcaggcctctaagcccaag ctaagccatcatatcccctgtgacctgcacgtacacatccagatggctggttcctgcctt aacttatcttatctctgcaccccaatcccttatttccgtgccctgacctctttgcgcctc aaccccttatttctgtgccccgacccctttcctgcttttctggaggacccatgtgacctc tcccctcctccccagtctgctccttgccaggctgagctaggtcccaattcttcctcagcc tctgctcttccaccctataatctttttatcacctcccctcctcacacccggtccagttta cagtttcattccgagcttgccacaagtgccagaaatctggccactgggccaaggaatgcc cacagcgcggaattcctcctaagccgtgtcccatctgtgcaggaccccactgaaaatcgg actgttcaactcaccggcagccacttccagagcccctggaactctggcccaaggctctct gactccttctcagatcttcttggcttagcagctgaagattga >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_5|642_aa MVTVGGGGPNLALEHAQCRASLVGCWTTMNPFFLQPENYNSQEPSRCGDTAFRCGPARSP PSGPAPRLDPSPPCECMSVTQRRCEAEGPSGGAAAASTSSSGSSPSPAVRSNWSPALRER TTKQPQRLSLRPRLLSARLRGQLPLAGLGWRARRAAAERRAVCSRSPRTTLHPLASGLGG RSGIPATSNCPVSNSITHTLFGTNPWQIQLVAVSVSVSVSQTSFRIANAICVNLHCGMEE RDLRVTSGVELKEEIIYAPAEGVAREVKLDKVSCCCCLEKSFELQERNVGLKSKIQKWDE SCNSEDSSEGAAISNKDQHSISYTLSRAQTVVVEYTHDSNTDMFQIGRSTESPIDFVVTD TVPGSQSNSDTQSVQSTISRFACRIICERNPPFTARIYAAGFDSSKNIFLGEKAAKWKTS DGQMDGLTTNGVLVMHPRNGFTEDSKPGIWREISVEIETNQLQDGSLIDLCGATLLWRTA EGLSHTPTVKHLEALRQEINAARPQCPVGFNTLAFPSMKRKDVVDEKQPWVYLNCGHVHG YHNWGNKEERDGKDRECPMCRSVGPYVPLWLGCEAGFYVDAGPPTHAFSPCGHVCSEKTT AYWSQIPLPHGTHTFHAACPFCAHQLAGEQGYIRLIFQGPLD >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_5|1929_bp atggtaacggtgggagggggcggtccaaacctcgccctcgagcatgctcagtgcagagcg agcctggtagggtgctggacaacaatgaatcccttttttcttcaaccggagaactacaat tcccaagagccttctcggtgcggcgacaccgccttccgctgtggccctgcccggtcccct ccctcaggccccgccccccggctagacccctcccctccttgcgagtgtatgtcagtgacc cagaggcggtgtgaggcggagggaccgtcggggggcgccgccgccgcctccactagcagc agcggcagcagccctagtcccgcggtgcggtcgaattggtccccagccctccgggagcgc accacaaagcagccccaacgcctctccctgcgtccgcggctcctcagcgctcggctccgt ggtcaacttcccctcgctgggctcggctggcgggcgcggagggcagcggcggaacggcgg gctgtctgctcgcgctccccgcgcacaacacttcaccctctcgcctcgggtctcgggggc cgctctgggatcccggccaccagcaattgtccggtcagtaactccatcactcacaccctc tttggaaccaacccatggcagatacagctagtagctgtatctgtatcggtatctgtatcc caaaccagtttcagaatagcaaatgccatctgcgtgaatttgcattgtgggatggaggaa cgtgacctaagagtcaccagtggagtggaactaaaggaagaaataatttatgcacctgct gagggtgtggctagagaagtgaagctggacaaggtttcatgctgttgctgcttagaaaag agctttgaactacaggaacgcaatgtgggactaaagagcaagatccagaagtgggatgag tcatgcaactcagaggacagtagtgaaggggcagcaataagcaacaaagaccagcatagc atatcatatactttatctcgggcccagactgtggtggttgaatatactcatgacagcaac accgatatgtttcagattggccggtcgactgaaagccccattgattttgtagtaactgac acggttcctggaagtcaaagtaattctgatacacagtcagtacaaagcactatatcaaga tttgcctgcagaatcatatgtgaacggaatcctccctttacagcacggatttatgctgca ggatttgactcatcaaaaaacatctttcttggggagaaggctgccaaatggaagacatca gatggacagatggatggcttgaccactaatggtgttcttgtgatgcatccacgcaatggg ttcacagaagactccaagcctggaatatggagagaaatatcggtggaaattgaaaccaat cagttacaagatggctcgttaattgacctctgtggtgcaacattgttatggcgtactgca gaaggcctttcccacactcctaccgtgaagcatttagaagctttaagacaggaaatcaat gcagcacgacctcagtgccctgtagggttcaacacactagcatttcctagtatgaagagg aaagacgttgtagatgaaaaacaaccatgggtatatctaaactgcggccatgtacatggc tatcataactggggaaacaaagaagaacgtgatggaaaagatcgtgaatgtcctatgtgt aggtctgttggtccctatgttcctctgtggcttggatgtgaagctggattttatgtggac gccggccctccaacccatgcgtttagcccgtgtgggcatgtgtgttcagaaaagacaact gcctattggtcccagatcccacttcctcatggtactcatacttttcatgcagcctgtccc ttttgtgcacatcagttggctggtgaacaaggctacatcagacttatttttcaaggacct ctagactaa >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_6|227_aa MKAVGGSTATGAIQKLTGGQFRLWVLNCGAWTSSSGGITGNLSEMQILRSHPRPAESETL GVGPSLQKIEFSTIDNVQKLQGICRCGLMEQHPITPITHGDFTHRPRERHGINLSRVGET KASNTASTQEFSLALCSTYPQNIQDIEKCKQHTIASTTPISKEDFLEKMALELSPKGLAG VVQENATYKNMLREQNVYHDYHYGTNHQYSNIDGESVGNGVKHALKM >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_6|684_bp atgaaggctgtgggtggatccacggccacgggggcgattcagaaactgacaggaggccag ttccgtctgtgggtcttgaactgtggtgcctggaccagcagcagcggcggcatcactggg aacttgtcggaaatgcaaattctcaggtcccaccccagacccgctgaatcagaaactcta ggggtggggcccagcctccagaaaattgaattcagtacaatcgataacgttcagaagctt caggggatctgcagatgtggactcatggagcagcatcctatcactcctatcactcatgga gattttacacataggcctagggaaagacatggtatcaacttgtctcgtgtgggtgagaca aaagcatccaatactgcgtcgacccaagagttctcactggctctatgtagcacatatccc cagaatattcaggacatagagaaatgcaagcagcataccatagcatccactacacccatc tccaaggaagatttcctggagaagatggccttggagctgagtcctaaaggactagcagga gttgttcaggagaatgccacatacaaaaacatgcttagagaacagaatgtatatcatgat tatcattacgggaccaatcatcaatacagtaatatagatggggaaagtgtaggaaatgga gttaaacatgcattaaagatgtaa >gi568815596r:63994705_64208310|GENSCAN_predicted_peptide_7|102_aa AVCPLLLINLQFPESRDSWETGSSKMPVRKSPDEEQQQVSGQQCGQKGEIPPSALKNSPS GKLSEGRNAEGVANEHTDWKECYLSFTMWSSEEHWPLAKSQK >gi568815596r:63994705_64208310|GENSCAN_predicted_CDS_7|309_bp gctgtttgtccccttctcttgataaatcttcagttcccagagagcagagactcttgggaa acaggaagtagcaaaatgccagtgagaaaatccccagatgaagagcagcaacaagtctcc gggcaacagtgtggtcaaaagggggaaattcccccatcagcgctcaaaaacagtccctca ggaaagcttagtgaagggagaaatgcagaaggagtagcaaatgagcacactgactggaaa gaatgctatttgtctttcactatgtggagttcagaagagcactggccattggcgaagtcc cagaagtga