Cargando…

Extracting abundance information from DNA‐based data

The accurate extraction of species‐abundance information from DNA‐based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and food‐web reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring...

Descripción completa

Detalles Bibliográficos
Autores principales: Luo, Mingjie, Ji, Yinqiu, Warton, David, Yu, Douglas W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10087802/
https://www.ncbi.nlm.nih.gov/pubmed/35986714
http://dx.doi.org/10.1111/1755-0998.13703
_version_ 1785022432890847232
author Luo, Mingjie
Ji, Yinqiu
Warton, David
Yu, Douglas W.
author_facet Luo, Mingjie
Ji, Yinqiu
Warton, David
Yu, Douglas W.
author_sort Luo, Mingjie
collection PubMed
description The accurate extraction of species‐abundance information from DNA‐based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and food‐web reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring of environmental state and change, and the inference of false positives and negatives. However, multiple sources of bias and noise in sampling and processing combine to inject error into DNA‐based data sets. To understand how to extract abundance information, it is useful to distinguish two concepts. (i) Within‐sample across‐species quantification describes relative species abundances in one sample. (ii) Across‐sample within‐species quantification describes how the abundance of each individual species varies from sample to sample, such as over a time series, an environmental gradient or different experimental treatments. First, we review the literature on methods to recover across‐species abundance information (by removing what we call “species pipeline biases”) and within‐species abundance information (by removing what we call “pipeline noise”). We argue that many ecological questions can be answered with just within‐species quantification, and we therefore demonstrate how to use a “DNA spike‐in” to correct for pipeline noise and recover within‐species abundance information. We also introduce a model‐based estimator that can be used on data sets without a physical spike‐in to approximate and correct for pipeline noise.
format Online
Article
Text
id pubmed-10087802
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-100878022023-04-12 Extracting abundance information from DNA‐based data Luo, Mingjie Ji, Yinqiu Warton, David Yu, Douglas W. Mol Ecol Resour RESOURCE ARTICLES The accurate extraction of species‐abundance information from DNA‐based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and food‐web reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring of environmental state and change, and the inference of false positives and negatives. However, multiple sources of bias and noise in sampling and processing combine to inject error into DNA‐based data sets. To understand how to extract abundance information, it is useful to distinguish two concepts. (i) Within‐sample across‐species quantification describes relative species abundances in one sample. (ii) Across‐sample within‐species quantification describes how the abundance of each individual species varies from sample to sample, such as over a time series, an environmental gradient or different experimental treatments. First, we review the literature on methods to recover across‐species abundance information (by removing what we call “species pipeline biases”) and within‐species abundance information (by removing what we call “pipeline noise”). We argue that many ecological questions can be answered with just within‐species quantification, and we therefore demonstrate how to use a “DNA spike‐in” to correct for pipeline noise and recover within‐species abundance information. We also introduce a model‐based estimator that can be used on data sets without a physical spike‐in to approximate and correct for pipeline noise. John Wiley and Sons Inc. 2022-08-30 2023-01 /pmc/articles/PMC10087802/ /pubmed/35986714 http://dx.doi.org/10.1111/1755-0998.13703 Text en © 2022 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle RESOURCE ARTICLES
Luo, Mingjie
Ji, Yinqiu
Warton, David
Yu, Douglas W.
Extracting abundance information from DNA‐based data
title Extracting abundance information from DNA‐based data
title_full Extracting abundance information from DNA‐based data
title_fullStr Extracting abundance information from DNA‐based data
title_full_unstemmed Extracting abundance information from DNA‐based data
title_short Extracting abundance information from DNA‐based data
title_sort extracting abundance information from dna‐based data
topic RESOURCE ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10087802/
https://www.ncbi.nlm.nih.gov/pubmed/35986714
http://dx.doi.org/10.1111/1755-0998.13703
work_keys_str_mv AT luomingjie extractingabundanceinformationfromdnabaseddata
AT jiyinqiu extractingabundanceinformationfromdnabaseddata
AT wartondavid extractingabundanceinformationfromdnabaseddata
AT yudouglasw extractingabundanceinformationfromdnabaseddata