Cargando…
A marginalized two-part Beta regression model for microbiome compositional data
In microbiome studies, an important goal is to detect differential abundance of microbes across clinical conditions and treatment options. However, the microbiome compositional data (quantified by relative abundance) are highly skewed, bounded in [0, 1), and often have many zeros. A two-part model i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072097/ https://www.ncbi.nlm.nih.gov/pubmed/30036363 http://dx.doi.org/10.1371/journal.pcbi.1006329 |
_version_ | 1783343970232827904 |
---|---|
author | Chai, Haitao Jiang, Hongmei Lin, Lu Liu, Lei |
author_facet | Chai, Haitao Jiang, Hongmei Lin, Lu Liu, Lei |
author_sort | Chai, Haitao |
collection | PubMed |
description | In microbiome studies, an important goal is to detect differential abundance of microbes across clinical conditions and treatment options. However, the microbiome compositional data (quantified by relative abundance) are highly skewed, bounded in [0, 1), and often have many zeros. A two-part model is commonly used to separate zeros and positive values explicitly by two submodels: a logistic model for the probability of a specie being present in Part I, and a Beta regression model for the relative abundance conditional on the presence of the specie in Part II. However, the regression coefficients in Part II cannot provide a marginal (unconditional) interpretation of covariate effects on the microbial abundance, which is of great interest in many applications. In this paper, we propose a marginalized two-part Beta regression model which captures the zero-inflation and skewness of microbiome data and also allows investigators to examine covariate effects on the marginal (unconditional) mean. We demonstrate its practical performance using simulation studies and apply the model to a real metagenomic dataset on mouse skin microbiota. We find that under the proposed marginalized model, without loss in power, the likelihood ratio test performs better in controlling the type I error than those under conventional methods. |
format | Online Article Text |
id | pubmed-6072097 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-60720972018-08-16 A marginalized two-part Beta regression model for microbiome compositional data Chai, Haitao Jiang, Hongmei Lin, Lu Liu, Lei PLoS Comput Biol Research Article In microbiome studies, an important goal is to detect differential abundance of microbes across clinical conditions and treatment options. However, the microbiome compositional data (quantified by relative abundance) are highly skewed, bounded in [0, 1), and often have many zeros. A two-part model is commonly used to separate zeros and positive values explicitly by two submodels: a logistic model for the probability of a specie being present in Part I, and a Beta regression model for the relative abundance conditional on the presence of the specie in Part II. However, the regression coefficients in Part II cannot provide a marginal (unconditional) interpretation of covariate effects on the microbial abundance, which is of great interest in many applications. In this paper, we propose a marginalized two-part Beta regression model which captures the zero-inflation and skewness of microbiome data and also allows investigators to examine covariate effects on the marginal (unconditional) mean. We demonstrate its practical performance using simulation studies and apply the model to a real metagenomic dataset on mouse skin microbiota. We find that under the proposed marginalized model, without loss in power, the likelihood ratio test performs better in controlling the type I error than those under conventional methods. Public Library of Science 2018-07-23 /pmc/articles/PMC6072097/ /pubmed/30036363 http://dx.doi.org/10.1371/journal.pcbi.1006329 Text en © 2018 Chai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Chai, Haitao Jiang, Hongmei Lin, Lu Liu, Lei A marginalized two-part Beta regression model for microbiome compositional data |
title | A marginalized two-part Beta regression model for microbiome compositional data |
title_full | A marginalized two-part Beta regression model for microbiome compositional data |
title_fullStr | A marginalized two-part Beta regression model for microbiome compositional data |
title_full_unstemmed | A marginalized two-part Beta regression model for microbiome compositional data |
title_short | A marginalized two-part Beta regression model for microbiome compositional data |
title_sort | marginalized two-part beta regression model for microbiome compositional data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072097/ https://www.ncbi.nlm.nih.gov/pubmed/30036363 http://dx.doi.org/10.1371/journal.pcbi.1006329 |
work_keys_str_mv | AT chaihaitao amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT jianghongmei amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT linlu amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT liulei amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT chaihaitao marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT jianghongmei marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT linlu marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata AT liulei marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata |