Cargando…

Exact sequence variants should replace operational taxonomic units in marker-gene data analysis

Recent advances have made it possible to analyze high-throughput marker-gene sequencing data without resorting to the customary construction of molecular operational taxonomic units (OTUs): clusters of sequencing reads that differ by less than a fixed dissimilarity threshold. New methods control err...

Descripción completa

Detalles Bibliográficos
Autores principales: Callahan, Benjamin J, McMurdie, Paul J, Holmes, Susan P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5702726/
https://www.ncbi.nlm.nih.gov/pubmed/28731476
http://dx.doi.org/10.1038/ismej.2017.119
_version_ 1783281576870674432
author Callahan, Benjamin J
McMurdie, Paul J
Holmes, Susan P
author_facet Callahan, Benjamin J
McMurdie, Paul J
Holmes, Susan P
author_sort Callahan, Benjamin J
collection PubMed
description Recent advances have made it possible to analyze high-throughput marker-gene sequencing data without resorting to the customary construction of molecular operational taxonomic units (OTUs): clusters of sequencing reads that differ by less than a fixed dissimilarity threshold. New methods control errors sufficiently such that amplicon sequence variants (ASVs) can be resolved exactly, down to the level of single-nucleotide differences over the sequenced gene region. The benefits of finer resolution are immediately apparent, and arguments for ASV methods have focused on their improved resolution. Less obvious, but we believe more important, are the broad benefits that derive from the status of ASVs as consistent labels with intrinsic biological meaning identified independently from a reference database. Here we discuss how these features grant ASVs the combined advantages of closed-reference OTUs—including computational costs that scale linearly with study size, simple merging between independently processed data sets, and forward prediction—and of de novo OTUs—including accurate measurement of diversity and applicability to communities lacking deep coverage in reference databases. We argue that the improvements in reusability, reproducibility and comprehensiveness are sufficiently great that ASVs should replace OTUs as the standard unit of marker-gene analysis and reporting.
format Online
Article
Text
id pubmed-5702726
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-57027262017-12-01 Exact sequence variants should replace operational taxonomic units in marker-gene data analysis Callahan, Benjamin J McMurdie, Paul J Holmes, Susan P ISME J Perspective Recent advances have made it possible to analyze high-throughput marker-gene sequencing data without resorting to the customary construction of molecular operational taxonomic units (OTUs): clusters of sequencing reads that differ by less than a fixed dissimilarity threshold. New methods control errors sufficiently such that amplicon sequence variants (ASVs) can be resolved exactly, down to the level of single-nucleotide differences over the sequenced gene region. The benefits of finer resolution are immediately apparent, and arguments for ASV methods have focused on their improved resolution. Less obvious, but we believe more important, are the broad benefits that derive from the status of ASVs as consistent labels with intrinsic biological meaning identified independently from a reference database. Here we discuss how these features grant ASVs the combined advantages of closed-reference OTUs—including computational costs that scale linearly with study size, simple merging between independently processed data sets, and forward prediction—and of de novo OTUs—including accurate measurement of diversity and applicability to communities lacking deep coverage in reference databases. We argue that the improvements in reusability, reproducibility and comprehensiveness are sufficiently great that ASVs should replace OTUs as the standard unit of marker-gene analysis and reporting. Nature Publishing Group 2017-12 2017-07-21 /pmc/articles/PMC5702726/ /pubmed/28731476 http://dx.doi.org/10.1038/ismej.2017.119 Text en Copyright © 2017 The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Perspective
Callahan, Benjamin J
McMurdie, Paul J
Holmes, Susan P
Exact sequence variants should replace operational taxonomic units in marker-gene data analysis
title Exact sequence variants should replace operational taxonomic units in marker-gene data analysis
title_full Exact sequence variants should replace operational taxonomic units in marker-gene data analysis
title_fullStr Exact sequence variants should replace operational taxonomic units in marker-gene data analysis
title_full_unstemmed Exact sequence variants should replace operational taxonomic units in marker-gene data analysis
title_short Exact sequence variants should replace operational taxonomic units in marker-gene data analysis
title_sort exact sequence variants should replace operational taxonomic units in marker-gene data analysis
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5702726/
https://www.ncbi.nlm.nih.gov/pubmed/28731476
http://dx.doi.org/10.1038/ismej.2017.119
work_keys_str_mv AT callahanbenjaminj exactsequencevariantsshouldreplaceoperationaltaxonomicunitsinmarkergenedataanalysis
AT mcmurdiepaulj exactsequencevariantsshouldreplaceoperationaltaxonomicunitsinmarkergenedataanalysis
AT holmessusanp exactsequencevariantsshouldreplaceoperationaltaxonomicunitsinmarkergenedataanalysis