Cargando…

Extensive identification and analysis of conserved small ORFs in animals

BACKGROUND: There is increasing evidence that transcripts or transcript regions annotated as non-coding can harbor functional short open reading frames (sORFs). Loss-of-function experiments have identified essential developmental or physiological roles for a few of the encoded peptides (micropeptide...

Descripción completa

Detalles Bibliográficos
Autores principales: Mackowiak, Sebastian D., Zauber, Henrik, Bielow, Chris, Thiel, Denise, Kutz, Kamila, Calviello, Lorenzo, Mastrobuoni, Guido, Rajewsky, Nikolaus, Kempa, Stefan, Selbach, Matthias, Obermayer, Benedikt
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4568590/
https://www.ncbi.nlm.nih.gov/pubmed/26364619
http://dx.doi.org/10.1186/s13059-015-0742-x
_version_ 1782389933560823808
author Mackowiak, Sebastian D.
Zauber, Henrik
Bielow, Chris
Thiel, Denise
Kutz, Kamila
Calviello, Lorenzo
Mastrobuoni, Guido
Rajewsky, Nikolaus
Kempa, Stefan
Selbach, Matthias
Obermayer, Benedikt
author_facet Mackowiak, Sebastian D.
Zauber, Henrik
Bielow, Chris
Thiel, Denise
Kutz, Kamila
Calviello, Lorenzo
Mastrobuoni, Guido
Rajewsky, Nikolaus
Kempa, Stefan
Selbach, Matthias
Obermayer, Benedikt
author_sort Mackowiak, Sebastian D.
collection PubMed
description BACKGROUND: There is increasing evidence that transcripts or transcript regions annotated as non-coding can harbor functional short open reading frames (sORFs). Loss-of-function experiments have identified essential developmental or physiological roles for a few of the encoded peptides (micropeptides), but genome-wide experimental or computational identification of functional sORFs remains challenging. RESULTS: Here, we expand our previously developed method and present results of an integrated computational pipeline for the identification of conserved sORFs in human, mouse, zebrafish, fruit fly, and the nematode C. elegans. Isolating specific conservation signatures indicative of purifying selection on amino acid (rather than nucleotide) sequence, we identify about 2,000 novel small ORFs located in the untranslated regions of canonical mRNAs or on transcripts annotated as non-coding. Predicted sORFs show stronger conservation signatures than those identified in previous studies and are sometimes conserved over large evolutionary distances. The encoded peptides have little homology to known proteins and are enriched in disordered regions and short linear interaction motifs. Published ribosome profiling data indicate translation of more than 100 novel sORFs, and mass spectrometry data provide evidence for more than 70 novel candidates. CONCLUSIONS: Taken together, we identify hundreds of previously unknown conserved sORFs in major model organisms. Our computational analyses and integration with experimental data show that these sORFs are expressed, often translated, and sometimes widely conserved, in some cases even between vertebrates and invertebrates. We thus provide an integrated resource of putatively functional micropeptides for functional validation in vivo. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-015-0742-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4568590
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45685902015-09-15 Extensive identification and analysis of conserved small ORFs in animals Mackowiak, Sebastian D. Zauber, Henrik Bielow, Chris Thiel, Denise Kutz, Kamila Calviello, Lorenzo Mastrobuoni, Guido Rajewsky, Nikolaus Kempa, Stefan Selbach, Matthias Obermayer, Benedikt Genome Biol Research BACKGROUND: There is increasing evidence that transcripts or transcript regions annotated as non-coding can harbor functional short open reading frames (sORFs). Loss-of-function experiments have identified essential developmental or physiological roles for a few of the encoded peptides (micropeptides), but genome-wide experimental or computational identification of functional sORFs remains challenging. RESULTS: Here, we expand our previously developed method and present results of an integrated computational pipeline for the identification of conserved sORFs in human, mouse, zebrafish, fruit fly, and the nematode C. elegans. Isolating specific conservation signatures indicative of purifying selection on amino acid (rather than nucleotide) sequence, we identify about 2,000 novel small ORFs located in the untranslated regions of canonical mRNAs or on transcripts annotated as non-coding. Predicted sORFs show stronger conservation signatures than those identified in previous studies and are sometimes conserved over large evolutionary distances. The encoded peptides have little homology to known proteins and are enriched in disordered regions and short linear interaction motifs. Published ribosome profiling data indicate translation of more than 100 novel sORFs, and mass spectrometry data provide evidence for more than 70 novel candidates. CONCLUSIONS: Taken together, we identify hundreds of previously unknown conserved sORFs in major model organisms. Our computational analyses and integration with experimental data show that these sORFs are expressed, often translated, and sometimes widely conserved, in some cases even between vertebrates and invertebrates. We thus provide an integrated resource of putatively functional micropeptides for functional validation in vivo. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-015-0742-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-09-14 2015 /pmc/articles/PMC4568590/ /pubmed/26364619 http://dx.doi.org/10.1186/s13059-015-0742-x Text en © Mackowiak et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Mackowiak, Sebastian D.
Zauber, Henrik
Bielow, Chris
Thiel, Denise
Kutz, Kamila
Calviello, Lorenzo
Mastrobuoni, Guido
Rajewsky, Nikolaus
Kempa, Stefan
Selbach, Matthias
Obermayer, Benedikt
Extensive identification and analysis of conserved small ORFs in animals
title Extensive identification and analysis of conserved small ORFs in animals
title_full Extensive identification and analysis of conserved small ORFs in animals
title_fullStr Extensive identification and analysis of conserved small ORFs in animals
title_full_unstemmed Extensive identification and analysis of conserved small ORFs in animals
title_short Extensive identification and analysis of conserved small ORFs in animals
title_sort extensive identification and analysis of conserved small orfs in animals
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4568590/
https://www.ncbi.nlm.nih.gov/pubmed/26364619
http://dx.doi.org/10.1186/s13059-015-0742-x
work_keys_str_mv AT mackowiaksebastiand extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT zauberhenrik extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT bielowchris extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT thieldenise extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT kutzkamila extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT calviellolorenzo extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT mastrobuoniguido extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT rajewskynikolaus extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT kempastefan extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT selbachmatthias extensiveidentificationandanalysisofconservedsmallorfsinanimals
AT obermayerbenedikt extensiveidentificationandanalysisofconservedsmallorfsinanimals