Cargando…

Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering

Motivation: With the advancements of next-generation sequencing technology, it is now possible to study samples directly obtained from the environment. Particularly, 16S rRNA gene sequences have been frequently used to profile the diversity of organisms in a sample. However, such studies are still t...

Descripción completa

Detalles Bibliográficos
Autores principales: Hao, Xiaolin, Jiang, Rui, Chen, Ting
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3042185/
https://www.ncbi.nlm.nih.gov/pubmed/21233169
http://dx.doi.org/10.1093/bioinformatics/btq725
_version_ 1782198520904679424
author Hao, Xiaolin
Jiang, Rui
Chen, Ting
author_facet Hao, Xiaolin
Jiang, Rui
Chen, Ting
author_sort Hao, Xiaolin
collection PubMed
description Motivation: With the advancements of next-generation sequencing technology, it is now possible to study samples directly obtained from the environment. Particularly, 16S rRNA gene sequences have been frequently used to profile the diversity of organisms in a sample. However, such studies are still taxed to determine both the number of operational taxonomic units (OTUs) and their relative abundance in a sample. Results: To address these challenges, we propose an unsupervised Bayesian clustering method termed Clustering 16S rRNA for OTU Prediction (CROP). CROP can find clusters based on the natural organization of data without setting a hard cut-off threshold (3%/5%) as required by hierarchical clustering methods. By applying our method to several datasets, we demonstrate that CROP is robust against sequencing errors and that it produces more accurate results than conventional hierarchical clustering methods. Availability and Implementation: Source code freely available at the following URL: http://code.google.com/p/crop-tingchenlab/, implemented in C++ and supported on Linux and MS Windows. Contact: tingchen@usc.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-3042185
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30421852011-02-24 Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering Hao, Xiaolin Jiang, Rui Chen, Ting Bioinformatics Original Papers Motivation: With the advancements of next-generation sequencing technology, it is now possible to study samples directly obtained from the environment. Particularly, 16S rRNA gene sequences have been frequently used to profile the diversity of organisms in a sample. However, such studies are still taxed to determine both the number of operational taxonomic units (OTUs) and their relative abundance in a sample. Results: To address these challenges, we propose an unsupervised Bayesian clustering method termed Clustering 16S rRNA for OTU Prediction (CROP). CROP can find clusters based on the natural organization of data without setting a hard cut-off threshold (3%/5%) as required by hierarchical clustering methods. By applying our method to several datasets, we demonstrate that CROP is robust against sequencing errors and that it produces more accurate results than conventional hierarchical clustering methods. Availability and Implementation: Source code freely available at the following URL: http://code.google.com/p/crop-tingchenlab/, implemented in C++ and supported on Linux and MS Windows. Contact: tingchen@usc.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-03-01 2011-01-13 /pmc/articles/PMC3042185/ /pubmed/21233169 http://dx.doi.org/10.1093/bioinformatics/btq725 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Hao, Xiaolin
Jiang, Rui
Chen, Ting
Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
title Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
title_full Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
title_fullStr Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
title_full_unstemmed Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
title_short Clustering 16S rRNA for OTU prediction: a method of unsupervised Bayesian clustering
title_sort clustering 16s rrna for otu prediction: a method of unsupervised bayesian clustering
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3042185/
https://www.ncbi.nlm.nih.gov/pubmed/21233169
http://dx.doi.org/10.1093/bioinformatics/btq725
work_keys_str_mv AT haoxiaolin clustering16srrnaforotupredictionamethodofunsupervisedbayesianclustering
AT jiangrui clustering16srrnaforotupredictionamethodofunsupervisedbayesianclustering
AT chenting clustering16srrnaforotupredictionamethodofunsupervisedbayesianclustering