Cargando…
Richness estimation in microbiome data obtained from denoising pipelines
The quantification of richness within a sample—either measured as the number of observed species or approximated by estimation—is a common first step in microbiome studies and is known to be highly dependent on sequencing depth, which itself is highly variable between samples. Rarefaction curves ser...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762370/ https://www.ncbi.nlm.nih.gov/pubmed/35070172 http://dx.doi.org/10.1016/j.csbj.2021.12.036 |
_version_ | 1784633749348024320 |
---|---|
author | Kleine Bardenhorst, Sven Vital, Marius Karch, André Rübsamen, Nicole |
author_facet | Kleine Bardenhorst, Sven Vital, Marius Karch, André Rübsamen, Nicole |
author_sort | Kleine Bardenhorst, Sven |
collection | PubMed |
description | The quantification of richness within a sample—either measured as the number of observed species or approximated by estimation—is a common first step in microbiome studies and is known to be highly dependent on sequencing depth, which itself is highly variable between samples. Rarefaction curves serve as a tool to investigate this dependency and it is often argued that after rarefying data—sub-sampling to an equal sequencing depth—richness estimates no longer depend on sequencing depth. However, the estimation of richness from data obtained by high throughput sequencing methods and processed by current bioinformatics pipelines may be subject to various sources of variation related to sequencing depth. Those that may confound inference based on richness estimates and cannot be solved by sub-sampling. We investigated how pipeline settings in DADA2 and deblur affect estimates of richness and showed that the use of rarefaction and sub-sampling is inappropriate when default pipeline settings are applied. Furthermore, we showed how independent sample-wise processing established spurious correlations between sequencing depth and richness estimations in data produced by DADA2 and how this problem can be solved by a pooled processing approach. |
format | Online Article Text |
id | pubmed-8762370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-87623702022-01-21 Richness estimation in microbiome data obtained from denoising pipelines Kleine Bardenhorst, Sven Vital, Marius Karch, André Rübsamen, Nicole Comput Struct Biotechnol J Research Article The quantification of richness within a sample—either measured as the number of observed species or approximated by estimation—is a common first step in microbiome studies and is known to be highly dependent on sequencing depth, which itself is highly variable between samples. Rarefaction curves serve as a tool to investigate this dependency and it is often argued that after rarefying data—sub-sampling to an equal sequencing depth—richness estimates no longer depend on sequencing depth. However, the estimation of richness from data obtained by high throughput sequencing methods and processed by current bioinformatics pipelines may be subject to various sources of variation related to sequencing depth. Those that may confound inference based on richness estimates and cannot be solved by sub-sampling. We investigated how pipeline settings in DADA2 and deblur affect estimates of richness and showed that the use of rarefaction and sub-sampling is inappropriate when default pipeline settings are applied. Furthermore, we showed how independent sample-wise processing established spurious correlations between sequencing depth and richness estimations in data produced by DADA2 and how this problem can be solved by a pooled processing approach. Research Network of Computational and Structural Biotechnology 2022-01-10 /pmc/articles/PMC8762370/ /pubmed/35070172 http://dx.doi.org/10.1016/j.csbj.2021.12.036 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Kleine Bardenhorst, Sven Vital, Marius Karch, André Rübsamen, Nicole Richness estimation in microbiome data obtained from denoising pipelines |
title | Richness estimation in microbiome data obtained from denoising pipelines |
title_full | Richness estimation in microbiome data obtained from denoising pipelines |
title_fullStr | Richness estimation in microbiome data obtained from denoising pipelines |
title_full_unstemmed | Richness estimation in microbiome data obtained from denoising pipelines |
title_short | Richness estimation in microbiome data obtained from denoising pipelines |
title_sort | richness estimation in microbiome data obtained from denoising pipelines |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8762370/ https://www.ncbi.nlm.nih.gov/pubmed/35070172 http://dx.doi.org/10.1016/j.csbj.2021.12.036 |
work_keys_str_mv | AT kleinebardenhorstsven richnessestimationinmicrobiomedataobtainedfromdenoisingpipelines AT vitalmarius richnessestimationinmicrobiomedataobtainedfromdenoisingpipelines AT karchandre richnessestimationinmicrobiomedataobtainedfromdenoisingpipelines AT rubsamennicole richnessestimationinmicrobiomedataobtainedfromdenoisingpipelines |