Proteogenomic Annotation of Chinese Hamsters Reveals Extensive Novel Translation Events and Endogenous Retroviral Elements

Shangzhong Li, Seong Won Cha, Kelly Heffner, Deniz Baycin Hizal, Michael A. Bowen, Raghothama Chaerkady, Robert N. Cole, Vijay Tejwani, Prashant Kaushik, Michael Henry, Paula Meleady, Susan T. Sharfstein, Michael J. Betenbaugh, Vineet Bafna, Nathan E. Lewis

Research output: Contribution to journalArticlepeer-review


A high-quality genome annotation greatly facilitates successful cell line engineering. Standard draft genome annotation pipelines are based largely on de novo gene prediction, homology, and RNA-Seq data. However, draft annotations can suffer from incorrect predictions of translated sequence, inaccurate splice isoforms, and missing genes. Here, we generated a draft annotation for the newly assembled Chinese hamster genome and used RNA-Seq, proteomics, and Ribo-Seq to experimentally annotate the genome. We identified 3529 new proteins compared to the hamster RefSeq protein annotation and 2256 novel translational events (e.g., alternative splices, mutations, and novel splices). Finally, we used this pipeline to identify the source of translated retroviruses contaminating recombinant products from Chinese hamster ovary (CHO) cell lines, including 119 type-C retroviruses, thus enabling future efforts to eliminate retroviruses to reduce the costs incurred with retroviral particle clearance. In summary, the improved annotation provides a more accurate resource for CHO cell line engineering, by facilitating the interpretation of omics data, defining of cellular pathways, and engineering of complex phenotypes.

Original languageEnglish (US)
Pages (from-to)2433-2445
Number of pages13
JournalJournal of proteome research
Issue number6
StatePublished - Jun 7 2019


  • Chinese hamster
  • endogenous retrovirus
  • genome annotation
  • proteogenomics

ASJC Scopus subject areas

  • Biochemistry
  • Chemistry(all)


Dive into the research topics of 'Proteogenomic Annotation of Chinese Hamsters Reveals Extensive Novel Translation Events and Endogenous Retroviral Elements'. Together they form a unique fingerprint.

Cite this