Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold

Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variant...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiarello, Marlène, McCauley, Mark, Villéger, Sébastien, Jackson, Colin R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870492/
https://www.ncbi.nlm.nih.gov/pubmed/35202411
http://dx.doi.org/10.1371/journal.pone.0264443
_version_ 1784656771014459392
author Chiarello, Marlène
McCauley, Mark
Villéger, Sébastien
Jackson, Colin R.
author_facet Chiarello, Marlène
McCauley, Mark
Villéger, Sébastien
Jackson, Colin R.
author_sort Chiarello, Marlène
collection PubMed
description Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variants, ASVs). While denoising methods have several inherent properties that make them desirable compared to clustering-based methods, questions remain as to the influence that these pipelines have on the ecological patterns being assessed, especially when compared to other methodological choices made when processing data (e.g. rarefaction) and computing diversity indices. We compared the respective influences of two widely used methods, namely DADA2 (a denoising method) vs. Mothur (a clustering method) on 16S rRNA gene amplicon datasets (hypervariable region v4), and compared such effects to the rarefaction of the community table and OTU identity threshold (97% vs. 99%) on the ecological signals detected. We used a dataset comprising freshwater invertebrate (three Unionidae species) gut and environmental (sediment, seston) communities sampled in six rivers in the southeastern USA. We ranked the respective effects of each methodological choice on alpha and beta diversity, and taxonomic composition. The choice of the pipeline significantly influenced alpha and beta diversities and changed the ecological signal detected, especially on presence/absence indices such as the richness index and unweighted Unifrac. Interestingly, the discrepancy between OTU and ASV-based diversity metrics could be attenuated by the use of rarefaction. The identification of major classes and genera also revealed significant discrepancies across pipelines. Compared to the pipeline’s effect, OTU threshold and rarefaction had a minimal impact on all measurements.
format Online
Article
Text
id pubmed-8870492
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-88704922022-02-25 Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold Chiarello, Marlène McCauley, Mark Villéger, Sébastien Jackson, Colin R. PLoS One Research Article Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variants, ASVs). While denoising methods have several inherent properties that make them desirable compared to clustering-based methods, questions remain as to the influence that these pipelines have on the ecological patterns being assessed, especially when compared to other methodological choices made when processing data (e.g. rarefaction) and computing diversity indices. We compared the respective influences of two widely used methods, namely DADA2 (a denoising method) vs. Mothur (a clustering method) on 16S rRNA gene amplicon datasets (hypervariable region v4), and compared such effects to the rarefaction of the community table and OTU identity threshold (97% vs. 99%) on the ecological signals detected. We used a dataset comprising freshwater invertebrate (three Unionidae species) gut and environmental (sediment, seston) communities sampled in six rivers in the southeastern USA. We ranked the respective effects of each methodological choice on alpha and beta diversity, and taxonomic composition. The choice of the pipeline significantly influenced alpha and beta diversities and changed the ecological signal detected, especially on presence/absence indices such as the richness index and unweighted Unifrac. Interestingly, the discrepancy between OTU and ASV-based diversity metrics could be attenuated by the use of rarefaction. The identification of major classes and genera also revealed significant discrepancies across pipelines. Compared to the pipeline’s effect, OTU threshold and rarefaction had a minimal impact on all measurements. Public Library of Science 2022-02-24 /pmc/articles/PMC8870492/ /pubmed/35202411 http://dx.doi.org/10.1371/journal.pone.0264443 Text en © 2022 Chiarello et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chiarello, Marlène
McCauley, Mark
Villéger, Sébastien
Jackson, Colin R.
Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
title Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
title_full Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
title_fullStr Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
title_full_unstemmed Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
title_short Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
title_sort ranking the biases: the choice of otus vs. asvs in 16s rrna amplicon data analysis has stronger effects on diversity measures than rarefaction and otu identity threshold
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870492/
https://www.ncbi.nlm.nih.gov/pubmed/35202411
http://dx.doi.org/10.1371/journal.pone.0264443
work_keys_str_mv AT chiarellomarlene rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold
AT mccauleymark rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold
AT villegersebastien rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold
AT jacksoncolinr rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold