Cargando…

RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties

Metagenomic classifiers are widely used for the taxonomic profiling of metagenomics data and estimation of taxa relative abundance. Small subunit rRNA genes are a gold standard for phylogenetic resolution of microbiota, although the power of this marker comes down to its use as full-length. We aimed...

Descripción completa

Detalles Bibliográficos
Autores principales: Chakoory, Oshma, Comtet-Marre, Sophie, Peyret, Pierre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492272/
https://www.ncbi.nlm.nih.gov/pubmed/36159175
http://dx.doi.org/10.1093/nargab/lqac070
_version_ 1784793443329900544
author Chakoory, Oshma
Comtet-Marre, Sophie
Peyret, Pierre
author_facet Chakoory, Oshma
Comtet-Marre, Sophie
Peyret, Pierre
author_sort Chakoory, Oshma
collection PubMed
description Metagenomic classifiers are widely used for the taxonomic profiling of metagenomics data and estimation of taxa relative abundance. Small subunit rRNA genes are a gold standard for phylogenetic resolution of microbiota, although the power of this marker comes down to its use as full-length. We aimed at identifying the tools that can efficiently lead to taxonomic resolution down to the species level. To reach this goal, we benchmarked the performance and accuracy of rRNA-specialized versus general-purpose read mappers, reference-targeted assemblers and taxonomic classifiers. We then compiled the best tools (BBTools, FastQC, SortMeRNA, MetaRib, EMIRGE, VSEARCH, BBMap and QIIME 2’s Sklearn classifier) to build a pipeline called RiboTaxa. Using metagenomics datasets, RiboTaxa gave the best results compared to other tools (i.e. Kraken2, Centrifuge, METAXA2, phyloFlash, SPINGO, BLCA, MEGAN) with precise taxonomic identification and relative abundance description without false positive detection (F-measure of 100% and 83.7% at genus level and species level, respectively). Using real datasets from various environments (i.e. ocean, soil, human gut) and from different approaches (e.g. metagenomics and gene capture by hybridization), RiboTaxa revealed microbial novelties not discerned by current bioinformatics analysis opening new biological perspectives in human and environmental health.
format Online
Article
Text
id pubmed-9492272
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-94922722022-09-22 RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties Chakoory, Oshma Comtet-Marre, Sophie Peyret, Pierre NAR Genom Bioinform Methods Article Metagenomic classifiers are widely used for the taxonomic profiling of metagenomics data and estimation of taxa relative abundance. Small subunit rRNA genes are a gold standard for phylogenetic resolution of microbiota, although the power of this marker comes down to its use as full-length. We aimed at identifying the tools that can efficiently lead to taxonomic resolution down to the species level. To reach this goal, we benchmarked the performance and accuracy of rRNA-specialized versus general-purpose read mappers, reference-targeted assemblers and taxonomic classifiers. We then compiled the best tools (BBTools, FastQC, SortMeRNA, MetaRib, EMIRGE, VSEARCH, BBMap and QIIME 2’s Sklearn classifier) to build a pipeline called RiboTaxa. Using metagenomics datasets, RiboTaxa gave the best results compared to other tools (i.e. Kraken2, Centrifuge, METAXA2, phyloFlash, SPINGO, BLCA, MEGAN) with precise taxonomic identification and relative abundance description without false positive detection (F-measure of 100% and 83.7% at genus level and species level, respectively). Using real datasets from various environments (i.e. ocean, soil, human gut) and from different approaches (e.g. metagenomics and gene capture by hybridization), RiboTaxa revealed microbial novelties not discerned by current bioinformatics analysis opening new biological perspectives in human and environmental health. Oxford University Press 2022-09-21 /pmc/articles/PMC9492272/ /pubmed/36159175 http://dx.doi.org/10.1093/nargab/lqac070 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Chakoory, Oshma
Comtet-Marre, Sophie
Peyret, Pierre
RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties
title RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties
title_full RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties
title_fullStr RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties
title_full_unstemmed RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties
title_short RiboTaxa: combined approaches for rRNA genes taxonomic resolution down to the species level from metagenomics data revealing novelties
title_sort ribotaxa: combined approaches for rrna genes taxonomic resolution down to the species level from metagenomics data revealing novelties
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9492272/
https://www.ncbi.nlm.nih.gov/pubmed/36159175
http://dx.doi.org/10.1093/nargab/lqac070
work_keys_str_mv AT chakooryoshma ribotaxacombinedapproachesforrrnagenestaxonomicresolutiondowntothespecieslevelfrommetagenomicsdatarevealingnovelties
AT comtetmarresophie ribotaxacombinedapproachesforrrnagenestaxonomicresolutiondowntothespecieslevelfrommetagenomicsdatarevealingnovelties
AT peyretpierre ribotaxacombinedapproachesforrrnagenestaxonomicresolutiondowntothespecieslevelfrommetagenomicsdatarevealingnovelties