Cargando…

A marginalized two-part Beta regression model for microbiome compositional data

In microbiome studies, an important goal is to detect differential abundance of microbes across clinical conditions and treatment options. However, the microbiome compositional data (quantified by relative abundance) are highly skewed, bounded in [0, 1), and often have many zeros. A two-part model i...

Descripción completa

Detalles Bibliográficos
Autores principales: Chai, Haitao, Jiang, Hongmei, Lin, Lu, Liu, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072097/
https://www.ncbi.nlm.nih.gov/pubmed/30036363
http://dx.doi.org/10.1371/journal.pcbi.1006329
_version_ 1783343970232827904
author Chai, Haitao
Jiang, Hongmei
Lin, Lu
Liu, Lei
author_facet Chai, Haitao
Jiang, Hongmei
Lin, Lu
Liu, Lei
author_sort Chai, Haitao
collection PubMed
description In microbiome studies, an important goal is to detect differential abundance of microbes across clinical conditions and treatment options. However, the microbiome compositional data (quantified by relative abundance) are highly skewed, bounded in [0, 1), and often have many zeros. A two-part model is commonly used to separate zeros and positive values explicitly by two submodels: a logistic model for the probability of a specie being present in Part I, and a Beta regression model for the relative abundance conditional on the presence of the specie in Part II. However, the regression coefficients in Part II cannot provide a marginal (unconditional) interpretation of covariate effects on the microbial abundance, which is of great interest in many applications. In this paper, we propose a marginalized two-part Beta regression model which captures the zero-inflation and skewness of microbiome data and also allows investigators to examine covariate effects on the marginal (unconditional) mean. We demonstrate its practical performance using simulation studies and apply the model to a real metagenomic dataset on mouse skin microbiota. We find that under the proposed marginalized model, without loss in power, the likelihood ratio test performs better in controlling the type I error than those under conventional methods.
format Online
Article
Text
id pubmed-6072097
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60720972018-08-16 A marginalized two-part Beta regression model for microbiome compositional data Chai, Haitao Jiang, Hongmei Lin, Lu Liu, Lei PLoS Comput Biol Research Article In microbiome studies, an important goal is to detect differential abundance of microbes across clinical conditions and treatment options. However, the microbiome compositional data (quantified by relative abundance) are highly skewed, bounded in [0, 1), and often have many zeros. A two-part model is commonly used to separate zeros and positive values explicitly by two submodels: a logistic model for the probability of a specie being present in Part I, and a Beta regression model for the relative abundance conditional on the presence of the specie in Part II. However, the regression coefficients in Part II cannot provide a marginal (unconditional) interpretation of covariate effects on the microbial abundance, which is of great interest in many applications. In this paper, we propose a marginalized two-part Beta regression model which captures the zero-inflation and skewness of microbiome data and also allows investigators to examine covariate effects on the marginal (unconditional) mean. We demonstrate its practical performance using simulation studies and apply the model to a real metagenomic dataset on mouse skin microbiota. We find that under the proposed marginalized model, without loss in power, the likelihood ratio test performs better in controlling the type I error than those under conventional methods. Public Library of Science 2018-07-23 /pmc/articles/PMC6072097/ /pubmed/30036363 http://dx.doi.org/10.1371/journal.pcbi.1006329 Text en © 2018 Chai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chai, Haitao
Jiang, Hongmei
Lin, Lu
Liu, Lei
A marginalized two-part Beta regression model for microbiome compositional data
title A marginalized two-part Beta regression model for microbiome compositional data
title_full A marginalized two-part Beta regression model for microbiome compositional data
title_fullStr A marginalized two-part Beta regression model for microbiome compositional data
title_full_unstemmed A marginalized two-part Beta regression model for microbiome compositional data
title_short A marginalized two-part Beta regression model for microbiome compositional data
title_sort marginalized two-part beta regression model for microbiome compositional data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072097/
https://www.ncbi.nlm.nih.gov/pubmed/30036363
http://dx.doi.org/10.1371/journal.pcbi.1006329
work_keys_str_mv AT chaihaitao amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT jianghongmei amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT linlu amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT liulei amarginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT chaihaitao marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT jianghongmei marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT linlu marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata
AT liulei marginalizedtwopartbetaregressionmodelformicrobiomecompositionaldata