1. Academic Validation
  2. Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

Deep transcriptome annotation enables the discovery and functional characterization of cryptic small proteins

  • Elife. 2017 Oct 30;6:e27860. doi: 10.7554/eLife.27860.
Sondos Samandi # 1 2 Annie V Roy # 1 2 Vivian Delcourt 1 2 3 Jean-François Lucier 4 5 Jules Gagnon 4 5 Maxime C Beaudoin 1 2 Benoît Vanderperre 1 Marc-André Breton 1 Julie Motard 1 2 Jean-François Jacques 1 2 Mylène Brunelle 1 2 Isabelle Gagnon-Arsenault 2 6 7 Isabelle Fournier 3 Aida Ouangraoua 8 Darel J Hunting 9 Alan A Cohen 10 Christian R Landry 2 6 7 Michelle S Scott 1 Xavier Roucou 1 2
Affiliations

Affiliations

  • 1 Department of Biochemistry, Université de Sherbrooke, Sherbrooke, Canada.
  • 2 PROTEO, Québec Network for Research on Protein Function, Structure and Engineering, Québec, Canada.
  • 3 INSERM U1192, Laboratoire Protéomique, Réponse Inflammatoire & Spectrométrie de Masse (PRISM) F-59000 Lille, Université de Lille, Lille, France.
  • 4 Department of Biology, Université de Sherbrooke, Québec, Canada.
  • 5 Center for Scientific computing, Information Technologies Services,, Université de Sherbrooke, Québec, Canada.
  • 6 Département de biochimie, microbiologie et bioinformatique, Université Laval, Québec, Canada.
  • 7 IBIS, Université Laval, Québec, Canada.
  • 8 Department of Computer Science, Université de Sherbrooke, Québec, Canada.
  • 9 Department of Nuclear Medicine and Radiobiology, Université de Sherbrooke, Québec, Canada.
  • 10 Department of Family Medicine, Université de Sherbrooke, Québec, Canada.
  • # Contributed equally.
Abstract

Recent functional, proteomic and ribosome profiling studies in eukaryotes have concurrently demonstrated the translation of alternative open-reading frames (altORFs) in addition to annotated protein coding sequences (CDSs). We show that a large number of small proteins could in fact be coded by these altORFs. The putative alternative proteins translated from altORFs have orthologs in many species and contain functional domains. Evolutionary analyses indicate that altORFs often show more extreme conservation patterns than their CDSs. Thousands of alternative proteins are detected in proteomic datasets by reanalysis using a database containing predicted alternative proteins. This is illustrated with specific examples, including altMiD51, a 70 amino acid mitochondrial fission-promoting protein encoded in MiD51/Mief1/SMCR7L, a gene encoding an annotated protein promoting mitochondrial fission. Our results suggest that many genes are multicoding genes and code for a large protein and one or several small proteins.

Keywords

alternative translation; biochemistry; computational biology; human; open reading frames; small proteins; systems biology; translation; translation initiation sites.

Figures