Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold
Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variant...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870492/ https://www.ncbi.nlm.nih.gov/pubmed/35202411 http://dx.doi.org/10.1371/journal.pone.0264443 |
_version_ | 1784656771014459392 |
---|---|
author | Chiarello, Marlène McCauley, Mark Villéger, Sébastien Jackson, Colin R. |
author_facet | Chiarello, Marlène McCauley, Mark Villéger, Sébastien Jackson, Colin R. |
author_sort | Chiarello, Marlène |
collection | PubMed |
description | Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variants, ASVs). While denoising methods have several inherent properties that make them desirable compared to clustering-based methods, questions remain as to the influence that these pipelines have on the ecological patterns being assessed, especially when compared to other methodological choices made when processing data (e.g. rarefaction) and computing diversity indices. We compared the respective influences of two widely used methods, namely DADA2 (a denoising method) vs. Mothur (a clustering method) on 16S rRNA gene amplicon datasets (hypervariable region v4), and compared such effects to the rarefaction of the community table and OTU identity threshold (97% vs. 99%) on the ecological signals detected. We used a dataset comprising freshwater invertebrate (three Unionidae species) gut and environmental (sediment, seston) communities sampled in six rivers in the southeastern USA. We ranked the respective effects of each methodological choice on alpha and beta diversity, and taxonomic composition. The choice of the pipeline significantly influenced alpha and beta diversities and changed the ecological signal detected, especially on presence/absence indices such as the richness index and unweighted Unifrac. Interestingly, the discrepancy between OTU and ASV-based diversity metrics could be attenuated by the use of rarefaction. The identification of major classes and genera also revealed significant discrepancies across pipelines. Compared to the pipeline’s effect, OTU threshold and rarefaction had a minimal impact on all measurements. |
format | Online Article Text |
id | pubmed-8870492 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-88704922022-02-25 Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold Chiarello, Marlène McCauley, Mark Villéger, Sébastien Jackson, Colin R. PLoS One Research Article Advances in the analysis of amplicon sequence datasets have introduced a methodological shift in how research teams investigate microbial biodiversity, away from sequence identity-based clustering (producing Operational Taxonomic Units, OTUs) to denoising methods (producing amplicon sequence variants, ASVs). While denoising methods have several inherent properties that make them desirable compared to clustering-based methods, questions remain as to the influence that these pipelines have on the ecological patterns being assessed, especially when compared to other methodological choices made when processing data (e.g. rarefaction) and computing diversity indices. We compared the respective influences of two widely used methods, namely DADA2 (a denoising method) vs. Mothur (a clustering method) on 16S rRNA gene amplicon datasets (hypervariable region v4), and compared such effects to the rarefaction of the community table and OTU identity threshold (97% vs. 99%) on the ecological signals detected. We used a dataset comprising freshwater invertebrate (three Unionidae species) gut and environmental (sediment, seston) communities sampled in six rivers in the southeastern USA. We ranked the respective effects of each methodological choice on alpha and beta diversity, and taxonomic composition. The choice of the pipeline significantly influenced alpha and beta diversities and changed the ecological signal detected, especially on presence/absence indices such as the richness index and unweighted Unifrac. Interestingly, the discrepancy between OTU and ASV-based diversity metrics could be attenuated by the use of rarefaction. The identification of major classes and genera also revealed significant discrepancies across pipelines. Compared to the pipeline’s effect, OTU threshold and rarefaction had a minimal impact on all measurements. Public Library of Science 2022-02-24 /pmc/articles/PMC8870492/ /pubmed/35202411 http://dx.doi.org/10.1371/journal.pone.0264443 Text en © 2022 Chiarello et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Chiarello, Marlène McCauley, Mark Villéger, Sébastien Jackson, Colin R. Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold |
title | Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold |
title_full | Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold |
title_fullStr | Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold |
title_full_unstemmed | Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold |
title_short | Ranking the biases: The choice of OTUs vs. ASVs in 16S rRNA amplicon data analysis has stronger effects on diversity measures than rarefaction and OTU identity threshold |
title_sort | ranking the biases: the choice of otus vs. asvs in 16s rrna amplicon data analysis has stronger effects on diversity measures than rarefaction and otu identity threshold |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8870492/ https://www.ncbi.nlm.nih.gov/pubmed/35202411 http://dx.doi.org/10.1371/journal.pone.0264443 |
work_keys_str_mv | AT chiarellomarlene rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold AT mccauleymark rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold AT villegersebastien rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold AT jacksoncolinr rankingthebiasesthechoiceofotusvsasvsin16srrnaamplicondataanalysishasstrongereffectsondiversitymeasuresthanrarefactionandotuidentitythreshold |