GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:03:41 Sequence gi568815578f:2209656_2440578 : 230923 bp : 44.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 26990 27116 127 1 1 87 98 18 0.787 3.02 1.02 Intr + 27438 27557 120 1 0 74 51 116 0.679 6.97 1.03 Intr + 45448 45720 273 2 0 122 63 84 0.795 6.81 1.04 Term + 55739 56073 335 0 2 -6 42 241 0.096 4.87 1.05 PlyA + 56082 56087 6 1.05 2.00 Prom + 56841 56880 40 -7.16 2.01 Sngl + 58686 59339 654 2 0 47 49 247 0.928 12.88 2.02 PlyA + 59714 59719 6 1.05 3.00 Prom + 60472 60511 40 -2.46 3.01 Init + 83441 83471 31 1 1 111 93 57 0.338 6.86 3.02 Intr + 86376 86415 40 1 1 81 105 7 0.324 -1.02 3.03 Intr + 86794 86865 72 1 0 26 103 86 0.284 2.42 3.04 Intr + 95971 96223 253 1 1 102 49 179 0.334 12.84 3.05 Intr + 96413 96507 95 1 2 128 37 -16 0.352 -3.94 3.06 Intr + 99973 100175 203 1 2 106 82 152 0.814 15.33 3.07 Intr + 100523 100762 240 0 0 114 78 197 0.978 18.82 3.08 Intr + 101356 101474 119 2 2 32 98 157 0.936 11.28 3.09 Intr + 103243 103371 129 0 0 48 101 107 0.996 8.89 3.10 Intr + 107413 107590 178 0 1 35 82 210 0.870 14.59 3.11 Intr + 107695 107830 136 2 1 63 115 187 0.986 18.53 3.12 Intr + 116194 116297 104 1 2 70 88 40 0.941 2.02 3.13 Intr + 118465 118710 246 2 0 69 97 552 0.999 51.63 3.14 Intr + 122347 122655 309 2 0 81 92 404 0.998 36.38 3.15 Intr + 125461 125618 158 2 2 97 80 295 0.992 29.33 3.16 Intr + 130199 130332 134 1 2 118 72 230 0.995 23.84 3.17 Term + 130779 130926 148 0 1 64 49 181 0.967 9.17 3.18 PlyA + 131396 131401 6 1.05 4.03 PlyA - 132702 132697 6 1.05 4.02 Term - 139128 139110 19 2 1 123 53 33 0.029 1.09 4.01 Init - 158559 158513 47 0 2 78 121 -8 0.350 1.65 4.00 Prom - 158813 158774 40 -0.96 5.03 PlyA - 159470 159465 6 1.05 5.02 Term - 161176 160955 222 1 0 36 48 172 0.124 4.92 5.01 Init - 165456 165325 132 0 0 80 92 76 0.320 5.43 5.00 Prom - 166939 166900 40 -5.96 6.00 Prom + 178606 178645 40 -4.66 6.01 Init + 179495 179629 135 1 0 97 103 11 0.568 3.64 6.02 Intr + 180848 180983 136 1 1 93 102 4 0.799 2.44 6.03 Intr + 184797 184970 174 1 0 103 93 250 0.999 26.91 6.04 Intr + 185539 185773 235 2 1 125 54 206 0.503 17.65 6.05 Intr + 186804 187018 215 0 2 26 63 302 0.361 19.86 6.06 Intr + 188234 188391 158 0 2 -14 81 278 0.690 16.63 6.07 Intr + 189906 190083 178 2 1 91 80 294 0.997 28.39 6.08 Intr + 190651 190789 139 2 1 94 52 217 0.988 18.22 6.09 Intr + 193742 193845 104 2 2 97 107 76 0.989 10.22 6.10 Intr + 193926 194168 243 1 0 99 116 414 0.932 42.77 6.11 Intr + 207577 207918 342 2 0 91 115 347 0.993 33.10 6.12 Intr + 210150 210212 63 1 0 94 115 -10 0.579 0.99 6.13 Intr + 220791 220945 155 1 2 115 55 207 0.725 19.89 6.14 Intr + 221239 221372 134 0 2 82 91 111 0.999 10.24 6.15 Intr + 222835 222925 91 1 1 112 63 73 0.100 7.20 6.16 Intr + 225360 225512 153 2 0 49 91 75 0.162 4.17 6.17 Term + 228543 228707 165 2 0 122 48 70 0.286 4.32 6.18 PlyA + 229104 229109 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 43896 43962 67 2 1 58 70 92 0.915 3.63 S.002 Term + 222835 222988 154 1 1 112 45 189 0.885 14.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:2209656_2440578|GENSCAN_predicted_peptide_1|284_aa MRPGMLDGQRAGTLLAAQGDGGRNVFEEDEALISLVQSSQCQASFSNHFLAKMFVAYNML QSKQGTLWKPEGEVRLAAVCQSEKCIGDVGDDEITSRINILEGASESCEYPNNKLFRRAY PVSLALDCVSTSNENKIEKEEPGIQPNLISHTKKRKLRPGEIRPPTLTRPFYQERNSINI NKKVIYNKTPFVGHQHQRPKVDKTTKMGRNQSRKAENSKNQSASSPPKDHSSSPAMEQSW MKNDFDELTEVGFRRSVITNFSELKEDVRTHCKEAKNLEKRLDE >gi568815578f:2209656_2440578|GENSCAN_predicted_CDS_1|855_bp atgaggcctggtatgttagatgggcagagagccgggacactgctggctgcgcagggagat gggggccggaacgtttttgaggaggatgaggctttgatttctctggtccaatcctctcag tgtcaagcttccttctctaatcatttcttggcgaagatgtttgtggcttacaacatgctg cagtcaaagcagggcacgctttggaagccagagggcgaggttcggctggctgcggtttgc cagtctgaaaaatgtattggagacgtaggagatgatgaaatcacatctaggattaacatc ctagaaggagcctcggagagctgtgaataccctaataataaattatttcgcagggcatat cctgtgtctttagctttggattgtgtttctacctctaatgagaacaaaatagagaaggag gagccaggcatacagccgaatttgatctcacataccaagaagaggaaactgaggcctgga gagataaggccacccacccttaccagacccttttaccaagaaaggaatagcatcaacatc aacaaaaaggtcatctacaacaaaaccccatttgtaggtcaccaacatcaaagaccaaag gtagataaaaccacaaagatggggagaaaccagagcagaaaagctgaaaattctaaaaac cagagtgcctcttctcctccaaaggatcacagctcctcgccagcaatggaacaaagctgg atgaagaatgactttgacgagttgacagaagtaggcttcagaaggtcggtaataacaaac ttctccgagctaaaggaggatgttcgaacccattgcaaggaagctaaaaaccttgaaaaa agattagatgaatga >gi568815578f:2209656_2440578|GENSCAN_predicted_peptide_2|217_aa MAKTTIISIDSGKAFDKIQQCFMLKTLNKLGIDGTYLKIITAIYDKPPANIILNGQKLEA FPLKTSTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLRNEEVKLSLFADDMIVYLEN PIVSAQNLLKLINNFSKVSGYKINAQKSQAFLYTINRQTESQIMSELPFTIAAKRIKYPG IQLTRDVKDLFKENYKPLLNEIKEFTQTNGRTFRAHG >gi568815578f:2209656_2440578|GENSCAN_predicted_CDS_2|654_bp atggcaaaaaccacgattatctcaatagattcaggaaaggcctttgacaaaattcaacag tgcttcatgctaaaaactctcaataaactaggtattgatggaacatatctcaaaataata acagctatttatgacaaacccccagccaatatcatactgaatgggcaaaaactggaagca ttccctttgaaaaccagtacaagacaaggatgccctctctcaccactcctattcaacata gtgttggaagttctagccagggcaatcaggcaagagaaagaaataaaaggtattcaatta cgaaatgaggaagtcaagttgtccctgtttgcagatgacatgattgtatatctagaaaac cccatcgtctcagcccaaaatctccttaagctgataaacaacttcagcaaagtctcagga tacaaaatcaatgcgcaaaaatcacaagcattcctatacaccattaacagacaaacagag agccaaatcatgagtgaactcccattcacaattgctgcaaagagaataaaatacccagga attcaactaacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaat gaaataaaagaatttacacaaacaaatggaagaacattccgtgctcatggatag >gi568815578f:2209656_2440578|GENSCAN_predicted_peptide_3|864_aa MAFSLMTALEEEPEKRQRKAKHGCCIQLSAVSAKKHISELPGSESGNISEQSRTSHSHPN REQGSLVLPGLTLGVRKGRDDECKCELWSLQSSTVEWGDNDQGTWFLYTLASLSVKQDVY KCLPYGVNELMHGQDIGLWDEWEEPLLLTQPQQTMHELCWSKEGTSGSPFCLTALGVQSI NWQTAFNRQAHHTDKFSSQELILRRGQNFQVLMIMNKGLGSNERLEFIVSTGPYPSESAM TKAVFPLSNGSSGGWSAVLQASNGNTLTISISSPASAPIGRYTMALQIFSQGGISSVKLG TFILLFNPWLNVDSVFMGNHAEREEYVQEDAGIIFVGSTNRIGMIGWNFGQFEEDILSIC LSILDRSLNFRRDAATDVASRNDPKYVGRVLSAMINSNDDNGVLAGNWSGTYTGGRDPRS WNGSVEILKNWKKSGFSPVRYGQCWVFAGTLNTALRSLGIPSRVITNFNSAHDTDRNLSV DVYYDPMGNPLDKGSDSVWNFHVWNEGWFVRSDLGPSYGGWQVLDATPQERSQGVFQCGP ASVIGVREGDVQLNFDMPFIFAEVNADRITWLYDNTTGKQWKNSVNSHTIGRYISTKAVG SNARMDVTDKYKYPEGSDQERQVFQKALGKLKPNTPFAATSSMGLETEEQEPSIIGKLKV AGMLAVGKEVNLVLLLKNLSRDTKTVTVNMTAWTIIYNGTLVHEVWKDSATMSLDPEEEA EHPIKISYAQYEKYLKSDNMIRITAVCKVPDESEVVVERDIILDNPTLTLEVLNEARVRK PVNVQMLFSNPLDEPVRDCVLMVEGSGLLLGNLKIDVPTLGPKEGSRVRFDILPSRSGTK QLLADFSCNKFPAIKAMLSIDVAE >gi568815578f:2209656_2440578|GENSCAN_predicted_CDS_3|2595_bp atggctttcagcctcatgacagccctggaagaggagcctgagaagaggcagaggaaggcg aaacatggctgctgtatccagctgtcagcagtgtcagcaaagaagcacatctcagaacta ccagggagtgagagtggtaacatctcggagcagtctcgcacatcccacagccaccctaac agggagcaaggctcgttggtgctgccggggctcacgcttggtgtcagaaaaggcagagat gatgagtgcaaatgcgagctctggagcctgcagtcctccactgtggagtggggtgacaac gaccaaggcacctggtttctctacaccttggcttccttgtctgtaaaacaggacgtctac aagtgcctgccttatggagttaatgagctaatgcacggacaagacattggtctctgggat gaatgggaggagcctctactattgacacagccacagcaaacaatgcatgaactttgctgg agcaaagaagggacttcaggttctcctttctgcctcacagctctaggagtccagagtatc aactggcagacggccttcaaccgacaagcgcatcacacagacaagttctccagccaggag ctcatcttgcggagaggccaaaacttccaggtcttaatgatcatgaacaaaggccttggc tctaacgaaagactggagttcattgtctccacagggccttacccctcagagtcggccatg acgaaggctgtgtttccactctccaatggcagtagtggtggctggagtgcggtgcttcag gccagcaatggcaatactctgactatcagcatctccagtcctgccagcgcacccatagga cggtacacaatggccctccagatcttctcccagggcggcatctcctctgtgaaacttggg acgttcatactgctttttaacccctggctgaatgtggatagcgtctttatgggtaaccac gctgagagagaagagtatgttcaggaagatgccggcatcatctttgtgggaagcacaaac cgaattggcatgattggctggaactttggacagtttgaagaagacattctcagcatctgc ctctcaatcttggataggagtctgaatttccgccgtgacgctgctactgatgtggccagc agaaatgaccccaaatacgttggccgggtgctgagtgccatgatcaatagcaatgatgac aatggtgtgcttgctgggaattggagcggcacttacaccggtggccgggacccaaggagc tggaacggcagcgtggagatcctcaaaaattggaaaaaatctggcttcagcccagtccga tatggccagtgctgggtctttgctgggaccctcaacacagcgctgcggtctttggggatt ccttcccgggtgatcaccaacttcaactcagctcatgacacagaccgaaatctcagtgtg gatgtgtactacgaccccatgggaaaccccctggacaagggtagtgatagcgtatggaat ttccatgtctggaatgaaggctggtttgtgaggtctgacctgggcccctcgtacggtgga tggcaggtgttggatgctaccccgcaggaaagaagccaaggggtgttccagtgcggcccc gcttcggtcattggtgttcgagagggtgatgtgcagctgaacttcgacatgccctttatc ttcgcggaggttaatgccgaccgcatcacctggctgtacgacaacaccactggcaaacag tggaagaattccgtgaacagtcacaccattggcaggtacatcagcaccaaggcggtgggc agcaatgctcgcatggacgtcacggacaagtacaagtacccagaaggctctgaccaggaa agacaagtgttccaaaaggctttggggaaacttaaacccaacacgccatttgccgcgacg tcttcaatgggtttggaaacagaggaacaggagcccagcatcatcgggaagctgaaggtc gctggcatgctggcagtaggcaaagaagtcaacctggtcctactgctcaaaaacctgagc agggatacgaagacagtgacagtgaacatgacagcctggaccatcatctacaacggcacg cttgtacatgaagtgtggaaggactctgccacaatgtccctggaccctgaggaagaggca gaacatcccataaagatctcgtacgctcagtatgagaagtacctgaagtcagacaacatg atccggatcacagcggtgtgcaaggtcccagatgagtctgaggtggtggtggagcgggac atcatcctggacaaccccaccttgaccctggaggtgctgaacgaggctcgtgtgcggaag cctgtgaacgtgcagatgctcttctccaatccactggatgagccggtgagggactgcgtg ctgatggtggagggaagcggcctgctgttgggtaacctgaagatcgacgtgccgacccta gggcccaaggaggggtcccgggtccgttttgatatcctgccctcccggagtggcaccaag caactgctcgccgacttctcctgcaacaagttccctgcaatcaaggccatgttgtccatc gatgtagccgaatga >gi568815578f:2209656_2440578|GENSCAN_predicted_peptide_4|21_aa MILLHGSTVNLQSLIRTGVEL >gi568815578f:2209656_2440578|GENSCAN_predicted_CDS_4|66_bp atgattttactacatggatcaactgtgaatttacaaagcttaataaggactggtgtggag ctctga >gi568815578f:2209656_2440578|GENSCAN_predicted_peptide_5|117_aa MRLRARKIQSSCALWVAPQASLLLGTPISRAPWVQQPQTWYRPWPGSSSGQSLQKARHEC RPLKHKNALLRALQNRGVKGLPGIDIGVNTCSQFLKPHQYGKKDFGRIREKLTTAKP >gi568815578f:2209656_2440578|GENSCAN_predicted_CDS_5|354_bp atgaggctgagggccagaaaaatccagagcagctgtgccctgtgggtggcaccccaggcc tctctgctgcttggaacccccatatccagagccccgtgggtccagcaaccacagacctgg taccgcccatggcctggctcatcctcaggccagtcactacagaaagccagacatgaatgc cgccccctaaaacacaaaaatgccttactgagagctttgcagaacagaggagtaaaaggg cttcccggcatagatattggcgtgaacacctgcagtcagttcctgaagcctcaccagtac ggcaagaaggacttcggaagaatcagagagaagttaacaacagcaaaaccttga >gi568815578f:2209656_2440578|GENSCAN_predicted_peptide_6|939_aa MVSQVTCSGETKHHARRTPKHSHGKIHMERNQGASTCIQHQFASHFPHSFINIFKKYLLS TFHVSGAEDSLMNKTDTAPCPYAVDILEREGIRVTKVDWQRSRNGAAHHTQEYPCPELVV RRGQSFSLTLELSRALDCEEILIFTMETGPRASEALHTKAVFQTSELERGEGWTAAREAQ MEKTLTVSLASPPSAVIGRYLLSIRLSSHRKHSNRRLGEFVLLFNPCKARAPVHTGPDDC FSEDDVFLASEEERQEYVLSDSGIIFRGVEKHIRAQGWNYGQVSRGTGQTRMWAGAWGAS KHSLSGEQFEEDILNICLSILDRSPGHQNNPATDVSCRHNPIYVTRVISAMVNSNNDRGV VQGQWQGKYGGGTSPLHWRGSVAILQKWLKGRYKPVKYGQCWVFAGVLCTVLRCLGIATR VVSNFNSAHDTDQNLSVDKYVDSFGRTLEDLTEDSMWNFHVWNESWFARQDLGPSYNGWQ VLDATPQEESEGVFRCGPASVTAIREGDVHLAHDGPFVFAEVNADYITWLWHEDESRERV YSNTKKIGRCISTKAVGSDSRVDITDLYKYPEGSRKERQVYSKAVNRLFGVEASGRRIWI RRAGGRCLWRDDLLEPATKPSIAGKFKVLEPPMLGHDLRLALCLANLTSRAQRVRVNLSG ATILYTRKPVAEILHESHAVRLGPQEGSYQLIAIKNNTKLLFIYRVSEKRIPITISYSKY KEDLTEDKKILLAAMCLVTKGEKLLVEKDITLEDFITIKVLGPAMVGVAVTVEVTVVNPL IERVKDCALMVEGSGLLQEQLSIDVPTLEPQERASVQFDITPSKSGPRQLQVDLPQWKAE AAPTANEKALLVSGRGREIDRRQLRSPRESVSGNVVLEITKKNVQAEEDRSFCSLPSVQP TVGALTQMVHMLPPGTWILRDQNGEKLQKALKVVVVVVP >gi568815578f:2209656_2440578|GENSCAN_predicted_CDS_6|2820_bp atggtctctcaggtcacttgctcaggggaaactaagcaccatgccaggaggacacccaag cattcccatggaaagatccacatggaaaggaaccaaggtgcctcaacctgtattcagcac cagtttgccagccattttcctcattcattcattaatatatttaagaaatatttgttgagc accttccatgtatcaggcgctgaagattcactaatgaacaaaacagacacagctccctgc ccttatgcagttgatattcttgaaagggaagggatcagagtcaccaaggtggactggcag cggtcgaggaatggcgctgcccaccacacccaggagtacccctgccctgagctggtggtt cgcaggggccagtcgttcagcctcacgctggagctgagcagagccctggactgtgaggag atcctcatcttcacgatggagacaggaccccgggcttctgaggccctccacaccaaagct gtgttccagacatcggagctggagcggggtgagggctggacagcagcaagggaggctcag atggagaaaactctgaccgtcagtctcgccagccctcccagtgctgtcattggccgctac ctgctgagcatcaggctttcctctcaccgcaaacacagcaaccggaggctgggcgagttt gttctccttttcaacccatgcaaggccagagccccagtccacaccgggcctgatgactgc ttttcagaggacgatgtgtttctggcctcagaggaggagagacaggagtacgtgctcagc gacagcggcatcatcttccgaggcgtggagaagcacatacgagcccagggctggaactac gggcaggtctccaggggcacaggccagacaaggatgtgggctggggcatggggagcctct aagcacagcctctctggggagcagtttgaggaggacatcctgaacatctgcctctccatc ctggatcgaagccccggtcaccaaaacaacccagccaccgacgtgtcctgccgccacaac cccatctacgtcaccagggtcatcagtgccatggtgaacagcaacaacgaccgaggtgtg gtgcaaggacagtggcagggcaagtacggcggcggcaccagcccgctgcactggcgcggc agcgtggccattctgcagaagtggctcaagggcaggtacaagccagtcaagtacggccag tgctgggtcttcgccggagtcctgtgcacagtcctcaggtgcttggggatagccacacgg gtcgtgtccaacttcaactcagcccacgacacagaccagaacctgagtgtggacaaatac gtggactccttcgggcggaccctggaggacctgacagaagacagcatgtggaatttccat gtctggaatgagagctggtttgcccggcaggacctaggcccctcttacaatggctggcag gttctggatgccaccccccaggaggagagtgaaggtgtgttccggtgcggcccagcctca gtcaccgccatccgcgagggtgatgtgcacctggctcacgatggccccttcgtgtttgcg gaggtcaacgccgactacatcacctggctgtggcacgaggatgagagccgggagcgtgta tactcaaacacgaagaagattgggagatgcatcagcaccaaggcggtgggcagtgactcc cgcgtggacatcactgacctctacaagtatccggaagggtcccggaaagagaggcaggtg tacagcaaggcggtgaacaggctgttcggcgtggaagcctctggaaggagaatctggatc cgcagggctgggggtcgctgtctctggcgtgacgacctcctggagcctgccaccaagccc agcatcgctggcaagttcaaggtgctagagcctcccatgctgggccacgacctgagactg gccctgtgcttggccaacctcacctcccgggcccagcgggtgagggtcaacctgagcggt gccaccatcctctatacccgcaagccagtggcagagatcctgcatgaatcccacgccgtg aggctggggccgcaagaaggtagttatcagcttattgctattaaaaataacactaaactt ttgtttatctacagagtgtcagagaagagaatcccaattacaatatcttactctaagtat aaagaagacctgacagaggacaagaagatcctgttggctgccatgtgccttgtcaccaaa ggagagaagcttctggtggagaaggacattactctagaggacttcatcaccatcaaggtt ctgggcccagccatggtgggagtggcagttacagtggaagtgacagtagtcaaccccctc atagagagagtgaaggactgtgcgctgatggtggagggcagcggccttctccaggaacag ctcagcatcgacgtgcctaccctggagcctcaggagagggcctcagtccagtttgacatc accccctccaaaagtggcccaaggcagctgcaggtggaccttccccagtggaaagcagaa gccgccccaacagccaatgagaaagccctgcttgtgagtgggagaggtagagaaattgac agaagacagcttagaagtccaagggagagtgtgagtgggaatgtggttttggaaattaca aagaaaaatgtgcaggctgaggaagacagatctttctgcagcctgccctcagtacagccc acagtgggtgccttgactcagatggtgcatatgctccctcctggaacttggatcttaaga gatcagaatggagagaagcttcagaaggcactaaaggtggtggtggtggtagtgccctag