Cargando…
Expanding the taxonomic range in the fecal metagenome
BACKGROUND: Except for bacteria, the taxonomic diversity of the human fecal metagenome has not been widely studied, despite the potential importance of viruses and eukaryotes. Widely used bioinformatic tools contain limited numbers of non-bacterial species in their databases compared to available ge...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8188691/ https://www.ncbi.nlm.nih.gov/pubmed/34107881 http://dx.doi.org/10.1186/s12859-021-04212-6 |
_version_ | 1783705375339446272 |
---|---|
author | Allnutt, Theo R. Roth-Schulze, Alexandra J. Harrison, Leonard C. |
author_facet | Allnutt, Theo R. Roth-Schulze, Alexandra J. Harrison, Leonard C. |
author_sort | Allnutt, Theo R. |
collection | PubMed |
description | BACKGROUND: Except for bacteria, the taxonomic diversity of the human fecal metagenome has not been widely studied, despite the potential importance of viruses and eukaryotes. Widely used bioinformatic tools contain limited numbers of non-bacterial species in their databases compared to available genomic sequences and their methodologies do not favour classification of rare sequences which may represent only a small fraction of their parent genome. In seeking to optimise identification of non-bacterial species, we evaluated five widely-used metagenome classifier programs (BURST, Kraken2, Centrifuge, MetaPhlAn2 and CCMetagen) for their ability to correctly assign and count simulations of bacterial, viral and eukaryotic DNA sequence reads, including the effect of taxonomic order of analysis of bacteria, viruses and eukaryotes and the effect of sequencing depth. RESULTS: We found that the precision of metagenome classifiers varied significantly between programs and between taxonomic groups. When classifying viruses and eukaryotes, ordering the analysis such that bacteria were classified first significantly improved classification precision. Increasing sequencing depth decreased classification precision and did not improve recall of rare species. CONCLUSIONS: Choice of metagenome classifier program can have a marked effect on results with respect to precision of species assignment in different taxonomic groups. The order of taxonomic classification can markedly improve precision. Increasing sequencing depth can decrease classification precision and yields diminishing returns in probability of species detection. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04212-6. |
format | Online Article Text |
id | pubmed-8188691 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81886912021-06-10 Expanding the taxonomic range in the fecal metagenome Allnutt, Theo R. Roth-Schulze, Alexandra J. Harrison, Leonard C. BMC Bioinformatics Research Article BACKGROUND: Except for bacteria, the taxonomic diversity of the human fecal metagenome has not been widely studied, despite the potential importance of viruses and eukaryotes. Widely used bioinformatic tools contain limited numbers of non-bacterial species in their databases compared to available genomic sequences and their methodologies do not favour classification of rare sequences which may represent only a small fraction of their parent genome. In seeking to optimise identification of non-bacterial species, we evaluated five widely-used metagenome classifier programs (BURST, Kraken2, Centrifuge, MetaPhlAn2 and CCMetagen) for their ability to correctly assign and count simulations of bacterial, viral and eukaryotic DNA sequence reads, including the effect of taxonomic order of analysis of bacteria, viruses and eukaryotes and the effect of sequencing depth. RESULTS: We found that the precision of metagenome classifiers varied significantly between programs and between taxonomic groups. When classifying viruses and eukaryotes, ordering the analysis such that bacteria were classified first significantly improved classification precision. Increasing sequencing depth decreased classification precision and did not improve recall of rare species. CONCLUSIONS: Choice of metagenome classifier program can have a marked effect on results with respect to precision of species assignment in different taxonomic groups. The order of taxonomic classification can markedly improve precision. Increasing sequencing depth can decrease classification precision and yields diminishing returns in probability of species detection. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04212-6. BioMed Central 2021-06-09 /pmc/articles/PMC8188691/ /pubmed/34107881 http://dx.doi.org/10.1186/s12859-021-04212-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Allnutt, Theo R. Roth-Schulze, Alexandra J. Harrison, Leonard C. Expanding the taxonomic range in the fecal metagenome |
title | Expanding the taxonomic range in the fecal metagenome |
title_full | Expanding the taxonomic range in the fecal metagenome |
title_fullStr | Expanding the taxonomic range in the fecal metagenome |
title_full_unstemmed | Expanding the taxonomic range in the fecal metagenome |
title_short | Expanding the taxonomic range in the fecal metagenome |
title_sort | expanding the taxonomic range in the fecal metagenome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8188691/ https://www.ncbi.nlm.nih.gov/pubmed/34107881 http://dx.doi.org/10.1186/s12859-021-04212-6 |
work_keys_str_mv | AT allnutttheor expandingthetaxonomicrangeinthefecalmetagenome AT rothschulzealexandraj expandingthetaxonomicrangeinthefecalmetagenome AT harrisonleonardc expandingthetaxonomicrangeinthefecalmetagenome |