GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:54:45 Sequence gi568815584f:51889640_52104513 : 214874 bp : 40.80% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6229 6340 112 0 1 52 93 64 0.264 3.78 1.02 Intr + 33042 33161 120 1 0 93 84 56 0.055 5.25 1.03 Intr + 53426 53503 78 0 0 73 94 64 0.318 4.10 1.04 Intr + 61011 61126 116 1 2 106 100 101 0.376 12.35 1.05 Term + 61698 61703 6 2 0 113 37 0 0.157 -5.71 1.06 PlyA + 62063 62068 6 1.05 2.06 PlyA - 62125 62120 6 1.05 2.05 Term - 62762 62571 192 2 0 99 39 172 0.396 9.84 2.04 Intr - 74974 74907 68 2 2 80 42 96 0.108 1.91 2.03 Intr - 76951 76715 237 2 0 47 10 213 0.029 6.06 2.02 Intr - 80357 80333 25 2 1 112 57 10 0.028 -3.02 2.01 Init - 82852 82694 159 1 0 72 77 132 0.073 10.38 2.00 Prom - 85430 85391 40 -3.55 3.03 PlyA - 88364 88359 6 1.05 3.02 Term - 89698 89412 287 2 2 22 40 170 0.187 0.18 3.01 Init - 93780 93630 151 0 1 77 75 122 0.535 10.05 3.00 Prom - 96571 96532 40 -8.35 4.00 Prom + 97282 97321 40 -5.85 4.01 Init + 99347 99386 40 1 1 70 72 72 0.735 4.20 4.02 Intr + 99807 100061 255 1 0 73 100 96 0.720 5.79 4.03 Intr + 101678 101802 125 0 2 79 90 136 0.995 12.28 4.04 Intr + 104084 104183 100 1 1 87 88 107 0.992 9.36 4.05 Intr + 108855 108941 87 1 0 70 101 116 0.997 10.12 4.06 Intr + 110069 110157 89 0 2 64 82 115 0.950 7.27 4.07 Intr + 112159 112227 69 0 0 49 94 79 0.892 3.06 4.08 Intr + 114555 114603 49 2 1 108 107 43 0.999 5.53 4.09 Term + 114723 114877 155 1 2 48 47 155 0.880 4.50 4.10 PlyA + 115036 115041 6 1.05 5.24 PlyA - 115065 115060 6 1.05 5.23 Term - 115857 115847 11 1 2 112 42 0 0.898 -4.82 5.22 Intr - 116210 116098 113 1 2 115 98 71 0.952 9.90 5.21 Intr - 117021 116898 124 1 1 77 83 76 0.988 4.72 5.20 Intr - 118328 118171 158 1 2 53 90 108 0.999 6.23 5.19 Intr - 121408 121237 172 2 1 105 44 284 0.999 23.78 5.18 Intr - 122044 121915 130 1 1 18 76 216 0.947 12.75 5.17 Intr - 124817 124630 188 0 2 150 17 177 0.768 15.29 5.16 Intr - 125636 125415 222 0 0 55 60 274 0.899 18.48 5.15 Intr - 126567 126457 111 1 0 46 38 106 0.598 0.83 5.14 Intr - 129655 129422 234 2 0 11 99 214 0.929 11.54 5.13 Intr - 130539 130420 120 1 0 97 119 67 0.327 10.25 5.12 Intr - 137705 137562 144 0 0 116 105 151 0.985 18.93 5.11 Intr - 139211 139083 129 0 0 79 65 150 0.974 11.55 5.10 Intr - 140051 139908 144 0 0 75 86 49 0.767 2.73 5.09 Intr - 144009 143881 129 1 0 9 81 91 0.476 0.25 5.08 Intr - 149338 149108 231 2 0 77 116 180 0.963 16.52 5.07 Intr - 151212 151012 201 1 0 110 47 154 0.991 11.84 5.06 Intr - 152711 152466 246 0 0 83 109 368 0.991 34.91 5.05 Intr - 153292 153143 150 2 0 59 86 119 0.807 8.01 5.04 Intr - 164682 163940 743 2 2 72 96 741 0.862 63.46 5.03 Intr - 170717 170485 233 2 2 136 89 143 0.923 14.85 5.02 Intr - 178524 178219 306 0 0 141 107 213 0.966 24.22 5.01 Init - 179355 179128 228 0 0 100 102 393 0.997 38.32 5.00 Prom - 192538 192499 40 -3.55 6.04 PlyA - 193754 193749 6 1.05 6.03 Term - 194889 194627 263 1 2 40 29 300 0.799 14.00 6.02 Intr - 210511 210371 141 2 0 74 61 110 0.889 6.30 6.01 Init - 212447 212252 196 2 1 71 55 88 0.677 2.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 46154 46076 79 2 1 85 48 79 0.977 4.88 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:51889640_52104513|GENSCAN_predicted_peptide_1|143_aa MGVVLHMLSLPAAIHVRRDLLILVFHHDCEASPAMWNYFSEGAQFFLKSKKQKSLSVGVC LRHNGLVLIKGPFILFAEKVKSTAGHASGVVLWQPQSQCGPYKVFLKDLSSTPMASNNTA SIAQARKLVEQLKMEANIDRIKT >gi568815584f:51889640_52104513|GENSCAN_predicted_CDS_1|432_bp atgggagttgtcctgcacatgctctctttgcctgctgccatccacgtaagacgtgactta ctcatcctggtcttccaccatgactgtgaggcttccccagctatgtggaactacttttca gaaggagctcagtttttcctaaagagtaagaaacaaaaaagcctttctgtgggagtctgt ttaagacataacggactcgtgcttataaaaggccctttcattctgtttgctgaaaaagtg aagtctactgctgggcatgcctctggggtggtcctctggcagccccaaagccagtgtggg ccctacaaagtgtttctgaaagatctatccagcactccgatggccagcaacaacaccgcc agcatagcacaagccaggaagctggtagagcagcttaagatggaagccaatatcgacagg ataaagacctaa >gi568815584f:51889640_52104513|GENSCAN_predicted_peptide_2|226_aa MNWKRGFGWQDGISQVGPLKGLESQVVEIGHDVVDKEHHEPENLGFLNMVKDEAVPKMYR PIGHQICSCLGHLERETKTTNTGVWYSQSRPKALEPNVLASNPCSMKQDTQPLKASASSS AKKKVDDTDLTGLLPASLGLEQIEATFWTEAAPSASIPDEGDMQESEKEEARKEEKRKMA KEHGKVALVISVMKKEEELRLLKEEQELIKSCGPLLGIKFYKGDLG >gi568815584f:51889640_52104513|GENSCAN_predicted_CDS_2|681_bp atgaactggaaaagagggtttggatggcaagatgggattagccaagtggggcctctcaag ggccttgagagccaggtagttgagattggacatgatgtggttgacaaagagcaccatgaa ccagagaaccttgggtttctgaacatggtgaaagatgaggctgtacccaagatgtacaga cctataggccatcaaatctgcagctgccttggacacctggaaagggagacaaaaacaacc aacactggagtctggtacagtcagtcaaggcccaaggctttagagcccaatgtcctggca tcaaatccttgctccatgaagcaagatactcagcctctaaaagcttctgcttcctcatct gcaaagaagaaagttgacgacactgaccttacaggattgttgccagcaagtctgggacta gaacagatagaagcaacattctggacagaggctgctccatctgccagcatccctgatgaa ggtgacatgcaagagagtgaaaaagaagaagcaagaaaagaggagaagaggaagatggca aaggagcacggtaaggtcgctttagtgatctcagtcatgaagaaagaagaagaattaagg ctgttgaaagaagagcaggagctgattaaatcctgtggtccattactaggcatcaaattt tacaaaggagatttaggataa >gi568815584f:51889640_52104513|GENSCAN_predicted_peptide_3|145_aa MEEVTSKPAIKYELLPDFPMGKGQSMNKSMSTKAPCQQGFASVKLKEDQGEHLQTKDAMM WRLLRTDQQRALLRKKNDVQDCSGQGWQIPHRRHHLAVIIVKNGKGKWISTEHEPEQPWS DSGANCGKPSGEPSQSGIQARISSL >gi568815584f:51889640_52104513|GENSCAN_predicted_CDS_3|438_bp atggaggaggttacatctaagccagctatcaaatatgagctattaccggatttcccaatg gggaaagggcagagcatgaacaaaagcatgagcacaaaggccccttgtcagcaagggttt gcttctgtgaaactgaaggaagatcagggagagcatctgcagacgaaagacgcgatgatg tggcgccttctgagaactgaccagcagagggcgctcttgagaaagaagaacgatgtgcaa gattgttccggtcagggatggcaaattccacacaggaggcaccacttagctgtaattatt gtaaagaatggtaaaggcaagtggatttccacagaacatgaaccagaacagccatggagt gatagcggtgccaattgcgggaaacctagcggagaaccttcccaaagtggtattcaggct aggatttcaagcctctag >gi568815584f:51889640_52104513|GENSCAN_predicted_peptide_4|322_aa MAGIGDLLDAVTEAGAAIGGHLPAATAFKPRLAKAAVISERLSACPPSRRVAGACASRST SLLLSRPRPGGPEREAGTMFRRKLTALDYHNPAGFNCKDETEFRNFIVWLEDQKIRHYKI EDRGNLRNIHSSDWPKFFEKYLRDVNCPFKIQDRQEAIDWLLGLAVRLEYGDNAEKYKDL VPDNSKTADNATKNAEPLINLDVNNPDFKAGVMALANLLQIQRHDDYLVMLKAIRILVQE RLTQDAVAKANQTKEGLPVALDKHILGFDTGDAVLNEAAQILRLLHIEELRELQTKINEA IVAVQAIIADPKTDHRLGKVGR >gi568815584f:51889640_52104513|GENSCAN_predicted_CDS_4|969_bp atggcaggaattggtgacctactggatgctgtgaccgaggccggagccgcgattggtggg catttgccggcggccaccgcttttaagccacgattggcgaaggccgccgtcatttcggag cgactcagcgcctgcccgccctctcgccgcgtcgccggtgcctgcgcctcccgctccacc tcgcttcttctctcccggccgaggcccgggggaccagagcgagaagcggggaccatgttc cgacgcaagttgacggctctcgactaccacaaccccgccggcttcaactgcaaagatgaa acagaatttagaaacttcatcgtttggcttgaagaccagaaaatcaggcactacaagatt gaagacagagggaatttaagaaacatccacagcagcgactggcccaagttctttgaaaag tatctcagagatgttaactgtcctttcaagattcaagatcgacaagaagctattgactgg cttcttggtttagctgttagacttgaatatggagataatgctgaaaaatacaaggattta gtacctgataattcaaaaactgctgacaatgcaactaaaaatgcagaaccattgatcaat ttggatgtaaataatcctgattttaaggctggtgtgatggctttggctaacctgcttcag attcagcgtcatgatgattacctggtaatgcttaaggcaattcggattttggttcaggag cgcctgacacaggatgcagttgctaaggcaaatcaaacaaaagagggcttacctgttgct ttagacaaacatattcttggttttgacacaggagatgcagttcttaatgaagctgctcaa attctgcgattgctgcacatagaggagctcagagagctacagacaaaaatcaacgaagcc atagtagctgttcaggcaattattgctgatccaaagacagaccacagactgggaaaagtt ggaagatga >gi568815584f:51889640_52104513|GENSCAN_predicted_peptide_5|1488_aa MEGDRVAGRPVLSSLPVLLLLPLLMLRAAALHPDELFPHGESWGDQLLQEGDDESSAVVK LANPLHFYEARFSNLYVGTNGIISTQDFPRETQYVDYDFPTDFPAIAPFLADIDTSHGRG RVLYREDTSPAVLGLAARYVRAGFPRSARFTPTHAFLATWEQVGAYEEVKRGALPSGELN TFQAVLASDGSDSYALFLYPANGLQFLGTRPKESYNVQLQLPARVGFCRGEADDLKSEGP YFSLTSTEQSVKNLYQLSNLGIPGVWAFHIGSTSPLDNVRPAAVGDLSAAHSSVPLGRSF SHATALESDYNEDNLDYYDVNEEEAEYLPGEPEEALNGHSSIDVSFQSKVDTKPLEGRIS PPDSDLSSPLHPTPTYWPFYPETESSTLDPHTKEGTSLGEVGGPDLKGQVEPWDERETRS PAPPEVDRDSLAPSWETPPPYPENGSIQPYPDGGPVPSEMDVPPAHPEEEIVLRSYPASG HTTPLSRGTYEVGLEDNIGSNTEVFTYNAANKETCEHNHRQCSRHAFCTDYATGFCCHCQ SKFYGNGKHCLPEGAPHRVNGKVSGHLHVGHTPVHFTDVDLHAYIVGNDGRAYTAISHIP QPAAQALLPLTPIGGLFGWLFALEKPGSENGFSLAGAAFTHDMEVTFYPGEETVRITQTA EGLDPENYLSIKTNIQGQVPYVSANFTAHISPYKELYHYSDSTVTSTSSRDYSLTFGAIN QTWSYRIHQNITYQVCRHAPRHPSFPTTQQLNVDRVFALYNDEERVLRFAVTNQIGPVKE LLSFSRGNDSCWNSCETAQLSTPDVGVVRLPRARMQNWGVFPKDSDPTPGNPCYDGSHMC DTTARCHPGTGVDYTCECASGYQGDGRNCVDENECATGFHRCGPNSVCINLPGSYRCECR SGYEFADDRHTCILITPPANPCEDGSHTCAPAGQARCVHHGGSTFSCACLPGYAGDGHQC TDVDECSENRCHPAATCYNTPGSFSCRCQPGYYGDGFQCIPDSTSSLTPCEQQQRHAQAQ YAYPGARFHIPQCDEQGNFLPLQCHGSTGFCWCVDPDGHEVPGTQTPPGSTPPHCGPSPA ESSQNNLYFGQSLNGTVEAAWEETALQPKPAGLQPWKPTQRPPTICERWRENLLEHYGGT PRDDQYVPQCDDLGHFIPLQCHGKSDFCWCVDKDGREVQGTRSQPGTTPACIPTVAPPMV RPTPRPDVTPPSVGTFLLYTQGQQIGYLPLNGTRLQKDAAKTLLSLHVKWISPGSIIVGI DYDCRERMVYWTDVAGRTISRAGLELGAEPETIVNSGLISPEGLAIDHIRRTMYWTDSVL DKIESALLDGSERKVLFYTDLVNPRAIAVDPIRGNLYWTDWNREAPKIETSSLDGENRRI LINTDIGLPNGLTFDPFSKLLCWADAGTKKLECTLPDGTGRRVIQNNLKYPFSIVSYADH FYHTDWRRDGVVSVNKHSGQFTDEYLPEQRSHLYGITAVYPYCPTGRK >gi568815584f:51889640_52104513|GENSCAN_predicted_CDS_5|4467_bp atggagggggaccgggtggccgggcggccggtgctgtcgtcgttaccagtgctactgctg ctgccgttgctaatgttgcgggccgcggcgctgcacccagacgagctcttcccacacggg gagtcgtggggggaccagctcctgcaggaaggcgacgacgaaagctcagccgtggtgaag ctggcgaatcccctgcacttctacgaagcccgattcagcaacctctacgtgggcaccaac ggcatcatctccactcaggacttccccagggaaacgcagtatgtggactatgatttcccc accgacttcccggccatcgccccttttctggcggacatcgacacgagccacggcagaggc cgagtcctgtaccgagaggacacctcccccgcagtgctgggcctggccgcccgctatgtg cgcgctggcttcccgcgctctgcgcgctttacccccacccacgccttcctggccacctgg gagcaggtaggcgcttacgaggaggtcaagcgcggggcgctgccctcgggagagctgaac actttccaggcagttttggcatctgatgggtctgatagctacgccctctttctttatcct gccaacggcctgcagttccttggaacccgccccaaagagtcttacaatgtccagcttcag cttccagctcgggtgggcttctgccgaggggaggctgatgatctgaagtcagaaggacca tatttcagcttgactagcactgaacagtctgtgaaaaatctctatcaactaagcaacctg gggatccctggagtgtgggctttccatatcggcagcacttccccgttggacaatgtcagg ccagctgcagttggagacctttccgctgcccactcttctgttcccctgggacgttccttc agccatgctacagccctggaaagtgactataatgaggacaatttggattactatgatgtg aatgaggaggaagctgaataccttccgggtgaaccagaggaggcattgaatggccacagc agcattgatgtttccttccaatccaaagtggatacaaagcctttagagggtaggatctcc cctccagattctgatctgtcctcccccttgcatccaacacctacttattggccattctat cctgaaacagaatcttccaccttggatcctcacaccaaagaaggaacatctctgggagag gtagggggcccagatttaaaaggccaagttgagccctgggatgagagagagaccagaagc ccagctccaccagaggtagacagagattcactggctccttcctgggaaaccccaccaccg taccccgaaaacggaagcatccagccctacccagatggagggccagtgccttcggaaatg gatgttcccccagctcatcctgaagaagaaattgttcttcgaagttaccctgcttcaggt cacactacacccttaagtcgagggacgtatgaggtgggactggaagacaacataggttcc aacaccgaggtcttcacgtataatgctgccaacaaggaaacctgtgaacacaaccacaga caatgctcccggcatgccttctgcacggactatgccactggcttctgctgccactgccaa tccaagttttatggaaatgggaagcactgtctgcctgaaggggcacctcaccgagtgaat gggaaagtgagtggccacctccacgtgggccatacacccgtgcacttcactgatgtggac ctgcatgcgtatatcgtgggcaatgatggcagagcctacacggccatcagccacatccca cagccagcagcccaggccctcctccccctcacaccaattggaggcctgtttggctggctc tttgctttagaaaaacctggctctgagaacggcttcagcctcgcaggtgctgcctttacc catgacatggaagttacattctacccgggagaggagacggttcgtatcactcaaactgct gagggacttgacccagagaactacctgagcattaagaccaacattcaaggccaggtgcct tacgtctcagcaaatttcacagcccacatctctccctacaaggagctgtaccactactcc gactccactgtgacctctacaagttccagagactactctctgacttttggtgcaatcaac caaacatggtcctaccgcatccaccagaacatcacttaccaggtgtgcaggcacgccccc agacacccgtccttccccaccacccagcagctgaacgtggaccgggtctttgccttgtat aatgacgaagaaagagtgcttagatttgctgtgaccaatcaaattggcccggtcaaagaa ctgctcagtttttccagagggaacgatagttgttggaattcatgtgaaacagcgcagctg tccacacctgatgtgggtgtggtgcgacttcccagggcgaggatgcagaactggggtgtc tttcccaaggattcagaccccactccggggaatccttgctatgatgggagccacatgtgt gacacaacagcacggtgccatccagggacaggtgtagattacacctgtgagtgcgcatct gggtaccagggagatggacggaactgtgtggatgaaaatgaatgtgcaactggctttcat cgctgtggccccaactctgtatgtatcaacttgcctggaagctacaggtgtgagtgccgg agtggttatgagtttgcagatgaccggcatacttgcatcttgatcaccccacctgccaac ccctgtgaggatggcagtcatacctgtgctcctgctgggcaggcccggtgtgttcaccat ggaggcagcacgttcagctgtgcctgcctgcctggttatgccggcgatgggcaccagtgc actgatgtagatgaatgctcagaaaacagatgtcaccctgcagctacctgctacaatact cctggttccttctcctgccgttgtcaacccggatattatggggatggatttcagtgcata cctgactccacctcaagcctgacaccctgtgaacaacagcagcgccatgcccaggcccag tatgcctaccctggggcccggttccacatcccccaatgcgacgagcagggcaacttcctg cccctacagtgtcatggcagcactggtttctgctggtgcgtggaccctgatggtcatgaa gttcctggtacccagactccacctggctccaccccgcctcactgtggaccatcaccagca gagtcttctcagaacaacctgtattttggacaaagcctcaatggcactgtggaagcagcc tgggaggaaactgctttgcaaccaaagccagcaggacttcaaccatggaagcccacccag aggcccccgaccatctgtgagcgctggagggaaaacctgctggagcactacggtggcacc ccccgggatgaccagtacgtgccccagtgcgatgacctgggccacttcatccccctgcag tgccacggaaagagcgacttctgctggtgtgtggacaaagatggcagagaggtgcagggc acccgctcccagccaggcaccacccctgcgtgtatacccaccgtcgctccacccatggtc cggcccacgccccggccagatgtgacccctccatctgtgggcaccttcctgctctatact cagggccagcagattggctacttacccctcaatggcaccaggcttcagaaggatgcagct aagaccctgctgtctctgcatgtaaagtggatttctcctggctccataatcgtgggaatt gattacgactgccgggagaggatggtgtactggacagatgttgctggacggacaatcagc cgtgctggtctggaactgggagcagagcctgagacgatcgtgaattcaggtctgataagc cctgaaggacttgccatagaccacatccgcagaacaatgtactggacggacagtgtcctg gataagatagagagcgccctgctggatggctctgagcgcaaggtcctcttctacacagat ctggtgaatccccgtgccatcgctgtggatccaatccgaggcaacttgtactggacagac tggaatagagaagctcctaaaattgaaacgtcatctttagatggagaaaacagaagaatt ctgatcaatacagacattggattgcccaatggcttaacctttgaccctttctctaaactg ctctgctgggcagatgcaggaaccaaaaaactggagtgtacactacctgatggaactgga cggcgtgtcattcaaaacaacctcaagtaccccttcagcatcgtaagctatgcagatcac ttctaccacacagactggaggagggatggtgttgtatcagtaaataaacatagtggccag tttactgatgagtatctcccagaacaacgatctcacctctacgggataactgcagtctac ccctactgcccaacaggaagaaagtaa >gi568815584f:51889640_52104513|GENSCAN_predicted_peptide_6|199_aa MNTWSTEELSRTVVEFVHCTRGCSPEGQRGLAKRLWLRLDSSGRGALSSHTKASSACKIM YPPGGEVSPRTVRYCLPCVATATATHVATTKRPAELFCPIAETTSKLSFICANAIKNVHD SWKKVKISTLTGVWKKLIPTLMDDFKELKTSVEEGAADVVETVRELELEGEPEDVTELLQ PHDKTSVNEKLLLMDEQRK >gi568815584f:51889640_52104513|GENSCAN_predicted_CDS_6|600_bp atgaatacctggagtactgaagaattatcccgtacagttgtggagtttgtgcactgcaca aggggttgcagcccggaaggccaacggggccttgcaaaacggctttggctcaggctggat tcctcaggaaggggagcactttcttctcatactaaggcatcgtctgcttgtaaaatcatg tacccacctggaggagaggtgtctcctcggactgtgaggtactgcttgccctgcgtggcc actgccacagccacacatgtggccaccaccaagaggccagctgagctgttctgtccaatt gctgagactacttctaagttgtcctttatctgtgcaaatgccattaagaatgttcatgat tcatggaagaaggtcaaaatatcaacattaacaggagtttggaagaagttgattccaacc ctcatggatgactttaaagagctcaagacttcagtggaggaaggagctgcagatgtggta gaaacagtaagagaactagaattagaaggggagcccgaagatgtgactgaattgctgcaa cctcatgataaaacttcagtgaatgagaagttacttcttatggatgagcaaagaaagtag