Cargando…

Improved Inference of Taxonomic Richness from Environmental DNA

Accurate estimation of biological diversity in environmental DNA samples using high-throughput amplicon pyrosequencing must account for errors generated by PCR and sequencing. We describe a novel approach to distinguish the underlying sequence diversity in environmental DNA samples from errors that...

Descripción completa

Detalles Bibliográficos
Autores principales: Morgan, Matthew J., Chariton, Anthony A., Hartley, Diana M., Court, Leon N., Hardy, Christopher M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753314/
https://www.ncbi.nlm.nih.gov/pubmed/23991013
http://dx.doi.org/10.1371/journal.pone.0071974
_version_ 1782281812851032064
author Morgan, Matthew J.
Chariton, Anthony A.
Hartley, Diana M.
Court, Leon N.
Hardy, Christopher M.
author_facet Morgan, Matthew J.
Chariton, Anthony A.
Hartley, Diana M.
Court, Leon N.
Hardy, Christopher M.
author_sort Morgan, Matthew J.
collection PubMed
description Accurate estimation of biological diversity in environmental DNA samples using high-throughput amplicon pyrosequencing must account for errors generated by PCR and sequencing. We describe a novel approach to distinguish the underlying sequence diversity in environmental DNA samples from errors that uses information on the abundance distribution of similar sequences across independent samples, as well as the frequency and diversity of sequences within individual samples. We have further refined this approach into a bioinformatics pipeline, Amplicon Pyrosequence Denoising Program (APDP) that is able to process raw sequence datasets into a set of validated sequences in formats compatible with commonly used downstream analyses packages. We demonstrate, by sequencing complex environmental samples and mock communities, that APDP is effective for removing errors from deeply sequenced datasets comprising biological and technical replicates, and can efficiently denoise single-sample datasets. APDP provides more conservative diversity estimates for complex datasets than other approaches; however, for some applications this may provide a more accurate and appropriate level of resolution, and result in greater confidence that returned sequences reflect the diversity of the underlying sample.
format Online
Article
Text
id pubmed-3753314
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37533142013-08-29 Improved Inference of Taxonomic Richness from Environmental DNA Morgan, Matthew J. Chariton, Anthony A. Hartley, Diana M. Court, Leon N. Hardy, Christopher M. PLoS One Research Article Accurate estimation of biological diversity in environmental DNA samples using high-throughput amplicon pyrosequencing must account for errors generated by PCR and sequencing. We describe a novel approach to distinguish the underlying sequence diversity in environmental DNA samples from errors that uses information on the abundance distribution of similar sequences across independent samples, as well as the frequency and diversity of sequences within individual samples. We have further refined this approach into a bioinformatics pipeline, Amplicon Pyrosequence Denoising Program (APDP) that is able to process raw sequence datasets into a set of validated sequences in formats compatible with commonly used downstream analyses packages. We demonstrate, by sequencing complex environmental samples and mock communities, that APDP is effective for removing errors from deeply sequenced datasets comprising biological and technical replicates, and can efficiently denoise single-sample datasets. APDP provides more conservative diversity estimates for complex datasets than other approaches; however, for some applications this may provide a more accurate and appropriate level of resolution, and result in greater confidence that returned sequences reflect the diversity of the underlying sample. Public Library of Science 2013-08-26 /pmc/articles/PMC3753314/ /pubmed/23991013 http://dx.doi.org/10.1371/journal.pone.0071974 Text en © 2013 Morgan et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Morgan, Matthew J.
Chariton, Anthony A.
Hartley, Diana M.
Court, Leon N.
Hardy, Christopher M.
Improved Inference of Taxonomic Richness from Environmental DNA
title Improved Inference of Taxonomic Richness from Environmental DNA
title_full Improved Inference of Taxonomic Richness from Environmental DNA
title_fullStr Improved Inference of Taxonomic Richness from Environmental DNA
title_full_unstemmed Improved Inference of Taxonomic Richness from Environmental DNA
title_short Improved Inference of Taxonomic Richness from Environmental DNA
title_sort improved inference of taxonomic richness from environmental dna
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3753314/
https://www.ncbi.nlm.nih.gov/pubmed/23991013
http://dx.doi.org/10.1371/journal.pone.0071974
work_keys_str_mv AT morganmatthewj improvedinferenceoftaxonomicrichnessfromenvironmentaldna
AT charitonanthonya improvedinferenceoftaxonomicrichnessfromenvironmentaldna
AT hartleydianam improvedinferenceoftaxonomicrichnessfromenvironmentaldna
AT courtleonn improvedinferenceoftaxonomicrichnessfromenvironmentaldna
AT hardychristopherm improvedinferenceoftaxonomicrichnessfromenvironmentaldna