Cargando…
A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
The standard pipeline for 16S amplicon analysis starts by clustering sequences within a percent sequence similarity threshold (typically 97%) into ‘Operational Taxonomic Units’ (OTUs). From each OTU, a single sequence is selected as a representative. This representative sequence is annotated, and th...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5515256/ https://www.ncbi.nlm.nih.gov/pubmed/28721243 http://dx.doi.org/10.1038/npjbiofilms.2016.4 |
_version_ | 1783250967268950016 |
---|---|
author | Nguyen, Nam-Phuong Warnow, Tandy Pop, Mihai White, Bryan |
author_facet | Nguyen, Nam-Phuong Warnow, Tandy Pop, Mihai White, Bryan |
author_sort | Nguyen, Nam-Phuong |
collection | PubMed |
description | The standard pipeline for 16S amplicon analysis starts by clustering sequences within a percent sequence similarity threshold (typically 97%) into ‘Operational Taxonomic Units’ (OTUs). From each OTU, a single sequence is selected as a representative. This representative sequence is annotated, and that annotation is applied to all remaining sequences within that OTU. This perspective paper will discuss the known shortcomings of this standard approach using results obtained from the Human Microbiome Project. In particular, we will show that the traditional approach of using pairwise sequence alignments to compute sequence similarity can result in poorly clustered OTUs. As OTUs are typically annotated based upon a single representative sequence, poorly clustered OTUs can have significant impact on downstream analyses. These results suggest that we need to move beyond simple clustering techniques for 16S analysis. |
format | Online Article Text |
id | pubmed-5515256 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-55152562017-07-18 A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity Nguyen, Nam-Phuong Warnow, Tandy Pop, Mihai White, Bryan NPJ Biofilms Microbiomes Perspective The standard pipeline for 16S amplicon analysis starts by clustering sequences within a percent sequence similarity threshold (typically 97%) into ‘Operational Taxonomic Units’ (OTUs). From each OTU, a single sequence is selected as a representative. This representative sequence is annotated, and that annotation is applied to all remaining sequences within that OTU. This perspective paper will discuss the known shortcomings of this standard approach using results obtained from the Human Microbiome Project. In particular, we will show that the traditional approach of using pairwise sequence alignments to compute sequence similarity can result in poorly clustered OTUs. As OTUs are typically annotated based upon a single representative sequence, poorly clustered OTUs can have significant impact on downstream analyses. These results suggest that we need to move beyond simple clustering techniques for 16S analysis. Nature Publishing Group 2016-04-20 /pmc/articles/PMC5515256/ /pubmed/28721243 http://dx.doi.org/10.1038/npjbiofilms.2016.4 Text en Copyright © 2016 Nanyang Technological University/Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Perspective Nguyen, Nam-Phuong Warnow, Tandy Pop, Mihai White, Bryan A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity |
title | A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity |
title_full | A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity |
title_fullStr | A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity |
title_full_unstemmed | A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity |
title_short | A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity |
title_sort | perspective on 16s rrna operational taxonomic unit clustering using sequence similarity |
topic | Perspective |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5515256/ https://www.ncbi.nlm.nih.gov/pubmed/28721243 http://dx.doi.org/10.1038/npjbiofilms.2016.4 |
work_keys_str_mv | AT nguyennamphuong aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT warnowtandy aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT popmihai aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT whitebryan aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT nguyennamphuong perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT warnowtandy perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT popmihai perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity AT whitebryan perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity |