Cargando…

RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria

Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread transla...

Descripción completa

Detalles Bibliográficos
Autores principales: Gelhausen, Rick, Müller, Teresa, Svensson, Sarah L, Alkhnbashi, Omer S, Sharma, Cynthia M, Eggenhofer, Florian, Backofen, Rolf
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921622/
https://www.ncbi.nlm.nih.gov/pubmed/35037022
http://dx.doi.org/10.1093/bib/bbab549
_version_ 1784669360319627264
author Gelhausen, Rick
Müller, Teresa
Svensson, Sarah L
Alkhnbashi, Omer S
Sharma, Cynthia M
Eggenhofer, Florian
Backofen, Rolf
author_facet Gelhausen, Rick
Müller, Teresa
Svensson, Sarah L
Alkhnbashi, Omer S
Sharma, Cynthia M
Eggenhofer, Florian
Backofen, Rolf
author_sort Gelhausen, Rick
collection PubMed
description Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread translation in genomic regions previously thought to be non-coding, driving the development of ORF detection tools using Ribo-seq data. However, only a handful of tools have been designed for bacteria, and these have not yet been systematically compared. Here, we aimed to identify tools that use Ribo-seq data to correctly determine the translational status of annotated bacterial ORFs and also discover novel translated regions with high sensitivity. To this end, we generated a large set of annotated ORFs from four diverse bacterial organisms, manually labeled for their translation status based on Ribo-seq data, which are available for future benchmarking studies. This set was used to investigate the predictive performance of seven Ribo-seq-based ORF detection tools (REPARATION_blast, DeepRibo, Ribo-TISH, PRICE, smORFer, ribotricer and SPECtre), as well as IRSOM, which uses coding potential and RNA-seq coverage only. DeepRibo and REPARATION_blast robustly predicted translated ORFs, including sORFs, with no significant difference for ORFs in close proximity to other genes versus stand-alone genes. However, no tool predicted a set of novel, experimentally verified sORFs with high sensitivity. Start codon predictions with smORFer show the value of initiation site profiling data to further improve the sensitivity of ORF prediction tools in bacteria. Overall, we find that bacterial tools perform well for sORF detection, although there is potential for improving their performance, applicability, usability and reproducibility.
format Online
Article
Text
id pubmed-8921622
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89216222022-03-15 RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria Gelhausen, Rick Müller, Teresa Svensson, Sarah L Alkhnbashi, Omer S Sharma, Cynthia M Eggenhofer, Florian Backofen, Rolf Brief Bioinform Problem Solving Protocol Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread translation in genomic regions previously thought to be non-coding, driving the development of ORF detection tools using Ribo-seq data. However, only a handful of tools have been designed for bacteria, and these have not yet been systematically compared. Here, we aimed to identify tools that use Ribo-seq data to correctly determine the translational status of annotated bacterial ORFs and also discover novel translated regions with high sensitivity. To this end, we generated a large set of annotated ORFs from four diverse bacterial organisms, manually labeled for their translation status based on Ribo-seq data, which are available for future benchmarking studies. This set was used to investigate the predictive performance of seven Ribo-seq-based ORF detection tools (REPARATION_blast, DeepRibo, Ribo-TISH, PRICE, smORFer, ribotricer and SPECtre), as well as IRSOM, which uses coding potential and RNA-seq coverage only. DeepRibo and REPARATION_blast robustly predicted translated ORFs, including sORFs, with no significant difference for ORFs in close proximity to other genes versus stand-alone genes. However, no tool predicted a set of novel, experimentally verified sORFs with high sensitivity. Start codon predictions with smORFer show the value of initiation site profiling data to further improve the sensitivity of ORF prediction tools in bacteria. Overall, we find that bacterial tools perform well for sORF detection, although there is potential for improving their performance, applicability, usability and reproducibility. Oxford University Press 2022-01-17 /pmc/articles/PMC8921622/ /pubmed/35037022 http://dx.doi.org/10.1093/bib/bbab549 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Gelhausen, Rick
Müller, Teresa
Svensson, Sarah L
Alkhnbashi, Omer S
Sharma, Cynthia M
Eggenhofer, Florian
Backofen, Rolf
RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
title RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
title_full RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
title_fullStr RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
title_full_unstemmed RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
title_short RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
title_sort riboreport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921622/
https://www.ncbi.nlm.nih.gov/pubmed/35037022
http://dx.doi.org/10.1093/bib/bbab549
work_keys_str_mv AT gelhausenrick riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria
AT mullerteresa riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria
AT svenssonsarahl riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria
AT alkhnbashiomers riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria
AT sharmacynthiam riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria
AT eggenhoferflorian riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria
AT backofenrolf riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria