Cargando…
RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria
Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread transla...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921622/ https://www.ncbi.nlm.nih.gov/pubmed/35037022 http://dx.doi.org/10.1093/bib/bbab549 |
_version_ | 1784669360319627264 |
---|---|
author | Gelhausen, Rick Müller, Teresa Svensson, Sarah L Alkhnbashi, Omer S Sharma, Cynthia M Eggenhofer, Florian Backofen, Rolf |
author_facet | Gelhausen, Rick Müller, Teresa Svensson, Sarah L Alkhnbashi, Omer S Sharma, Cynthia M Eggenhofer, Florian Backofen, Rolf |
author_sort | Gelhausen, Rick |
collection | PubMed |
description | Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread translation in genomic regions previously thought to be non-coding, driving the development of ORF detection tools using Ribo-seq data. However, only a handful of tools have been designed for bacteria, and these have not yet been systematically compared. Here, we aimed to identify tools that use Ribo-seq data to correctly determine the translational status of annotated bacterial ORFs and also discover novel translated regions with high sensitivity. To this end, we generated a large set of annotated ORFs from four diverse bacterial organisms, manually labeled for their translation status based on Ribo-seq data, which are available for future benchmarking studies. This set was used to investigate the predictive performance of seven Ribo-seq-based ORF detection tools (REPARATION_blast, DeepRibo, Ribo-TISH, PRICE, smORFer, ribotricer and SPECtre), as well as IRSOM, which uses coding potential and RNA-seq coverage only. DeepRibo and REPARATION_blast robustly predicted translated ORFs, including sORFs, with no significant difference for ORFs in close proximity to other genes versus stand-alone genes. However, no tool predicted a set of novel, experimentally verified sORFs with high sensitivity. Start codon predictions with smORFer show the value of initiation site profiling data to further improve the sensitivity of ORF prediction tools in bacteria. Overall, we find that bacterial tools perform well for sORF detection, although there is potential for improving their performance, applicability, usability and reproducibility. |
format | Online Article Text |
id | pubmed-8921622 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89216222022-03-15 RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria Gelhausen, Rick Müller, Teresa Svensson, Sarah L Alkhnbashi, Omer S Sharma, Cynthia M Eggenhofer, Florian Backofen, Rolf Brief Bioinform Problem Solving Protocol Small proteins encoded by short open reading frames (ORFs) with 50 codons or fewer are emerging as an important class of cellular macromolecules in diverse organisms. However, they often evade detection by proteomics or in silico methods. Ribosome profiling (Ribo-seq) has revealed widespread translation in genomic regions previously thought to be non-coding, driving the development of ORF detection tools using Ribo-seq data. However, only a handful of tools have been designed for bacteria, and these have not yet been systematically compared. Here, we aimed to identify tools that use Ribo-seq data to correctly determine the translational status of annotated bacterial ORFs and also discover novel translated regions with high sensitivity. To this end, we generated a large set of annotated ORFs from four diverse bacterial organisms, manually labeled for their translation status based on Ribo-seq data, which are available for future benchmarking studies. This set was used to investigate the predictive performance of seven Ribo-seq-based ORF detection tools (REPARATION_blast, DeepRibo, Ribo-TISH, PRICE, smORFer, ribotricer and SPECtre), as well as IRSOM, which uses coding potential and RNA-seq coverage only. DeepRibo and REPARATION_blast robustly predicted translated ORFs, including sORFs, with no significant difference for ORFs in close proximity to other genes versus stand-alone genes. However, no tool predicted a set of novel, experimentally verified sORFs with high sensitivity. Start codon predictions with smORFer show the value of initiation site profiling data to further improve the sensitivity of ORF prediction tools in bacteria. Overall, we find that bacterial tools perform well for sORF detection, although there is potential for improving their performance, applicability, usability and reproducibility. Oxford University Press 2022-01-17 /pmc/articles/PMC8921622/ /pubmed/35037022 http://dx.doi.org/10.1093/bib/bbab549 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Gelhausen, Rick Müller, Teresa Svensson, Sarah L Alkhnbashi, Omer S Sharma, Cynthia M Eggenhofer, Florian Backofen, Rolf RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
title | RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
title_full | RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
title_fullStr | RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
title_full_unstemmed | RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
title_short | RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
title_sort | riboreport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921622/ https://www.ncbi.nlm.nih.gov/pubmed/35037022 http://dx.doi.org/10.1093/bib/bbab549 |
work_keys_str_mv | AT gelhausenrick riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria AT mullerteresa riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria AT svenssonsarahl riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria AT alkhnbashiomers riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria AT sharmacynthiam riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria AT eggenhoferflorian riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria AT backofenrolf riboreportbenchmarkingtoolsforribosomeprofilingbasedidentificationofopenreadingframesinbacteria |