Cargando…

Poisson hurdle model-based method for clustering microbiome features

MOTIVATION: High-throughput sequencing technologies have greatly facilitated microbiome research and have generated a large volume of microbiome data with the potential to answer key questions regarding microbiome assembly, structure and function. Cluster analysis aims to group features that behave...

Descripción completa

Detalles Bibliográficos
Autores principales: Qiao, Zhili, Barnes, Elle, Tringe, Susannah, Schachtman, Daniel P, Liu, Peng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825753/
https://www.ncbi.nlm.nih.gov/pubmed/36469352
http://dx.doi.org/10.1093/bioinformatics/btac782
_version_ 1784866690687827968
author Qiao, Zhili
Barnes, Elle
Tringe, Susannah
Schachtman, Daniel P
Liu, Peng
author_facet Qiao, Zhili
Barnes, Elle
Tringe, Susannah
Schachtman, Daniel P
Liu, Peng
author_sort Qiao, Zhili
collection PubMed
description MOTIVATION: High-throughput sequencing technologies have greatly facilitated microbiome research and have generated a large volume of microbiome data with the potential to answer key questions regarding microbiome assembly, structure and function. Cluster analysis aims to group features that behave similarly across treatments, and such grouping helps to highlight the functional relationships among features and may provide biological insights into microbiome networks. However, clustering microbiome data are challenging due to the sparsity and high dimensionality. RESULTS: We propose a model-based clustering method based on Poisson hurdle models for sparse microbiome count data. We describe an expectation–maximization algorithm and a modified version using simulated annealing to conduct the cluster analysis. Moreover, we provide algorithms for initialization and choosing the number of clusters. Simulation results demonstrate that our proposed methods provide better clustering results than alternative methods under a variety of settings. We also apply the proposed method to a sorghum rhizosphere microbiome dataset that results in interesting biological findings. AVAILABILITY AND IMPLEMENTATION: R package is freely available for download at https://cran.r-project.org/package=PHclust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9825753
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98257532023-01-10 Poisson hurdle model-based method for clustering microbiome features Qiao, Zhili Barnes, Elle Tringe, Susannah Schachtman, Daniel P Liu, Peng Bioinformatics Original Paper MOTIVATION: High-throughput sequencing technologies have greatly facilitated microbiome research and have generated a large volume of microbiome data with the potential to answer key questions regarding microbiome assembly, structure and function. Cluster analysis aims to group features that behave similarly across treatments, and such grouping helps to highlight the functional relationships among features and may provide biological insights into microbiome networks. However, clustering microbiome data are challenging due to the sparsity and high dimensionality. RESULTS: We propose a model-based clustering method based on Poisson hurdle models for sparse microbiome count data. We describe an expectation–maximization algorithm and a modified version using simulated annealing to conduct the cluster analysis. Moreover, we provide algorithms for initialization and choosing the number of clusters. Simulation results demonstrate that our proposed methods provide better clustering results than alternative methods under a variety of settings. We also apply the proposed method to a sorghum rhizosphere microbiome dataset that results in interesting biological findings. AVAILABILITY AND IMPLEMENTATION: R package is freely available for download at https://cran.r-project.org/package=PHclust. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-12-05 /pmc/articles/PMC9825753/ /pubmed/36469352 http://dx.doi.org/10.1093/bioinformatics/btac782 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Qiao, Zhili
Barnes, Elle
Tringe, Susannah
Schachtman, Daniel P
Liu, Peng
Poisson hurdle model-based method for clustering microbiome features
title Poisson hurdle model-based method for clustering microbiome features
title_full Poisson hurdle model-based method for clustering microbiome features
title_fullStr Poisson hurdle model-based method for clustering microbiome features
title_full_unstemmed Poisson hurdle model-based method for clustering microbiome features
title_short Poisson hurdle model-based method for clustering microbiome features
title_sort poisson hurdle model-based method for clustering microbiome features
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9825753/
https://www.ncbi.nlm.nih.gov/pubmed/36469352
http://dx.doi.org/10.1093/bioinformatics/btac782
work_keys_str_mv AT qiaozhili poissonhurdlemodelbasedmethodforclusteringmicrobiomefeatures
AT barneselle poissonhurdlemodelbasedmethodforclusteringmicrobiomefeatures
AT tringesusannah poissonhurdlemodelbasedmethodforclusteringmicrobiomefeatures
AT schachtmandanielp poissonhurdlemodelbasedmethodforclusteringmicrobiomefeatures
AT liupeng poissonhurdlemodelbasedmethodforclusteringmicrobiomefeatures