Cargando…

A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity

The standard pipeline for 16S amplicon analysis starts by clustering sequences within a percent sequence similarity threshold (typically 97%) into ‘Operational Taxonomic Units’ (OTUs). From each OTU, a single sequence is selected as a representative. This representative sequence is annotated, and th...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Nam-Phuong, Warnow, Tandy, Pop, Mihai, White, Bryan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5515256/
https://www.ncbi.nlm.nih.gov/pubmed/28721243
http://dx.doi.org/10.1038/npjbiofilms.2016.4
_version_ 1783250967268950016
author Nguyen, Nam-Phuong
Warnow, Tandy
Pop, Mihai
White, Bryan
author_facet Nguyen, Nam-Phuong
Warnow, Tandy
Pop, Mihai
White, Bryan
author_sort Nguyen, Nam-Phuong
collection PubMed
description The standard pipeline for 16S amplicon analysis starts by clustering sequences within a percent sequence similarity threshold (typically 97%) into ‘Operational Taxonomic Units’ (OTUs). From each OTU, a single sequence is selected as a representative. This representative sequence is annotated, and that annotation is applied to all remaining sequences within that OTU. This perspective paper will discuss the known shortcomings of this standard approach using results obtained from the Human Microbiome Project. In particular, we will show that the traditional approach of using pairwise sequence alignments to compute sequence similarity can result in poorly clustered OTUs. As OTUs are typically annotated based upon a single representative sequence, poorly clustered OTUs can have significant impact on downstream analyses. These results suggest that we need to move beyond simple clustering techniques for 16S analysis.
format Online
Article
Text
id pubmed-5515256
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-55152562017-07-18 A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity Nguyen, Nam-Phuong Warnow, Tandy Pop, Mihai White, Bryan NPJ Biofilms Microbiomes Perspective The standard pipeline for 16S amplicon analysis starts by clustering sequences within a percent sequence similarity threshold (typically 97%) into ‘Operational Taxonomic Units’ (OTUs). From each OTU, a single sequence is selected as a representative. This representative sequence is annotated, and that annotation is applied to all remaining sequences within that OTU. This perspective paper will discuss the known shortcomings of this standard approach using results obtained from the Human Microbiome Project. In particular, we will show that the traditional approach of using pairwise sequence alignments to compute sequence similarity can result in poorly clustered OTUs. As OTUs are typically annotated based upon a single representative sequence, poorly clustered OTUs can have significant impact on downstream analyses. These results suggest that we need to move beyond simple clustering techniques for 16S analysis. Nature Publishing Group 2016-04-20 /pmc/articles/PMC5515256/ /pubmed/28721243 http://dx.doi.org/10.1038/npjbiofilms.2016.4 Text en Copyright © 2016 Nanyang Technological University/Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Perspective
Nguyen, Nam-Phuong
Warnow, Tandy
Pop, Mihai
White, Bryan
A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
title A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
title_full A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
title_fullStr A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
title_full_unstemmed A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
title_short A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity
title_sort perspective on 16s rrna operational taxonomic unit clustering using sequence similarity
topic Perspective
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5515256/
https://www.ncbi.nlm.nih.gov/pubmed/28721243
http://dx.doi.org/10.1038/npjbiofilms.2016.4
work_keys_str_mv AT nguyennamphuong aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT warnowtandy aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT popmihai aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT whitebryan aperspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT nguyennamphuong perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT warnowtandy perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT popmihai perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity
AT whitebryan perspectiveon16srrnaoperationaltaxonomicunitclusteringusingsequencesimilarity