Cargando…

REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes

Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE...

Descripción completa

Detalles Bibliográficos
Autores principales: Ndah, Elvis, Jonckheere, Veronique, Giess, Adam, Valen, Eivind, Menschaert, Gerben, Van Damme, Petra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5714196/
https://www.ncbi.nlm.nih.gov/pubmed/28977509
http://dx.doi.org/10.1093/nar/gkx758
_version_ 1783283543026171904
author Ndah, Elvis
Jonckheere, Veronique
Giess, Adam
Valen, Eivind
Menschaert, Gerben
Van Damme, Petra
author_facet Ndah, Elvis
Jonckheere, Veronique
Giess, Adam
Valen, Eivind
Menschaert, Gerben
Van Damme, Petra
author_sort Ndah, Elvis
collection PubMed
description Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames.
format Online
Article
Text
id pubmed-5714196
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57141962017-12-08 REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes Ndah, Elvis Jonckheere, Veronique Giess, Adam Valen, Eivind Menschaert, Gerben Van Damme, Petra Nucleic Acids Res Methods Online Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames. Oxford University Press 2017-11-16 2017-08-31 /pmc/articles/PMC5714196/ /pubmed/28977509 http://dx.doi.org/10.1093/nar/gkx758 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Ndah, Elvis
Jonckheere, Veronique
Giess, Adam
Valen, Eivind
Menschaert, Gerben
Van Damme, Petra
REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
title REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
title_full REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
title_fullStr REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
title_full_unstemmed REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
title_short REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes
title_sort reparation: ribosome profiling assisted (re-)annotation of bacterial genomes
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5714196/
https://www.ncbi.nlm.nih.gov/pubmed/28977509
http://dx.doi.org/10.1093/nar/gkx758
work_keys_str_mv AT ndahelvis reparationribosomeprofilingassistedreannotationofbacterialgenomes
AT jonckheereveronique reparationribosomeprofilingassistedreannotationofbacterialgenomes
AT giessadam reparationribosomeprofilingassistedreannotationofbacterialgenomes
AT valeneivind reparationribosomeprofilingassistedreannotationofbacterialgenomes
AT menschaertgerben reparationribosomeprofilingassistedreannotationofbacterialgenomes
AT vandammepetra reparationribosomeprofilingassistedreannotationofbacterialgenomes