Cargando…

Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity

BACKGROUND: The operational taxonomic unit (OTU) is widely used in microbial ecology. Reproducibility in microbial ecology research depends on the reliability of OTU-based 16S ribosomal subunit RNA (rRNA) analyses. RESULTS: Here, we report that many hierarchical and greedy clustering methods produce...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Yan, Caporaso, J Gregory, Jiang, Xiao-Tao, Sheng, Hua-Fang, Huse, Susan M, Rideout, Jai Ram, Edgar, Robert C, Kopylova, Evguenia, Walters, William A, Knight, Rob, Zhou, Hong-Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4438525/
https://www.ncbi.nlm.nih.gov/pubmed/25995836
http://dx.doi.org/10.1186/s40168-015-0081-x
_version_ 1782372348083568640
author He, Yan
Caporaso, J Gregory
Jiang, Xiao-Tao
Sheng, Hua-Fang
Huse, Susan M
Rideout, Jai Ram
Edgar, Robert C
Kopylova, Evguenia
Walters, William A
Knight, Rob
Zhou, Hong-Wei
author_facet He, Yan
Caporaso, J Gregory
Jiang, Xiao-Tao
Sheng, Hua-Fang
Huse, Susan M
Rideout, Jai Ram
Edgar, Robert C
Kopylova, Evguenia
Walters, William A
Knight, Rob
Zhou, Hong-Wei
author_sort He, Yan
collection PubMed
description BACKGROUND: The operational taxonomic unit (OTU) is widely used in microbial ecology. Reproducibility in microbial ecology research depends on the reliability of OTU-based 16S ribosomal subunit RNA (rRNA) analyses. RESULTS: Here, we report that many hierarchical and greedy clustering methods produce unstable OTUs, with membership that depends on the number of sequences clustered. If OTUs are regenerated with additional sequences or samples, sequences originally assigned to a given OTU can be split into different OTUs. Alternatively, sequences assigned to different OTUs can be merged into a single OTU. This OTU instability affects alpha-diversity analyses such as rarefaction curves, beta-diversity analyses such as distance-based ordination (for example, Principal Coordinate Analysis (PCoA)), and the identification of differentially represented OTUs. Our results show that the proportion of unstable OTUs varies for different clustering methods. We found that the closed-reference method is the only one that produces completely stable OTUs, with the caveat that sequences that do not match a pre-existing reference sequence collection are discarded. CONCLUSIONS: As a compromise to the factors listed above, we propose using an open-reference method to enhance OTU stability. This type of method clusters sequences against a database and includes unmatched sequences by clustering them via a relatively stable de novo clustering method. OTU stability is an important consideration when analyzing microbial diversity and is a feature that should be taken into account during the development of novel OTU clustering methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40168-015-0081-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4438525
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44385252015-05-21 Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity He, Yan Caporaso, J Gregory Jiang, Xiao-Tao Sheng, Hua-Fang Huse, Susan M Rideout, Jai Ram Edgar, Robert C Kopylova, Evguenia Walters, William A Knight, Rob Zhou, Hong-Wei Microbiome Research BACKGROUND: The operational taxonomic unit (OTU) is widely used in microbial ecology. Reproducibility in microbial ecology research depends on the reliability of OTU-based 16S ribosomal subunit RNA (rRNA) analyses. RESULTS: Here, we report that many hierarchical and greedy clustering methods produce unstable OTUs, with membership that depends on the number of sequences clustered. If OTUs are regenerated with additional sequences or samples, sequences originally assigned to a given OTU can be split into different OTUs. Alternatively, sequences assigned to different OTUs can be merged into a single OTU. This OTU instability affects alpha-diversity analyses such as rarefaction curves, beta-diversity analyses such as distance-based ordination (for example, Principal Coordinate Analysis (PCoA)), and the identification of differentially represented OTUs. Our results show that the proportion of unstable OTUs varies for different clustering methods. We found that the closed-reference method is the only one that produces completely stable OTUs, with the caveat that sequences that do not match a pre-existing reference sequence collection are discarded. CONCLUSIONS: As a compromise to the factors listed above, we propose using an open-reference method to enhance OTU stability. This type of method clusters sequences against a database and includes unmatched sequences by clustering them via a relatively stable de novo clustering method. OTU stability is an important consideration when analyzing microbial diversity and is a feature that should be taken into account during the development of novel OTU clustering methods. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40168-015-0081-x) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-20 /pmc/articles/PMC4438525/ /pubmed/25995836 http://dx.doi.org/10.1186/s40168-015-0081-x Text en © He et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
He, Yan
Caporaso, J Gregory
Jiang, Xiao-Tao
Sheng, Hua-Fang
Huse, Susan M
Rideout, Jai Ram
Edgar, Robert C
Kopylova, Evguenia
Walters, William A
Knight, Rob
Zhou, Hong-Wei
Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
title Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
title_full Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
title_fullStr Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
title_full_unstemmed Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
title_short Stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
title_sort stability of operational taxonomic units: an important but neglected property for analyzing microbial diversity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4438525/
https://www.ncbi.nlm.nih.gov/pubmed/25995836
http://dx.doi.org/10.1186/s40168-015-0081-x
work_keys_str_mv AT heyan stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT caporasojgregory stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT jiangxiaotao stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT shenghuafang stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT husesusanm stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT rideoutjairam stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT edgarrobertc stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT kopylovaevguenia stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT walterswilliama stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT knightrob stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity
AT zhouhongwei stabilityofoperationaltaxonomicunitsanimportantbutneglectedpropertyforanalyzingmicrobialdiversity