Cargando…

Marginalized mixture models for count data from multiple source populations

Mixture distributions provide flexibility in modeling data collected from populations having unexplained heterogeneity. While interpretations of regression parameters from traditional finite mixture models are specific to unobserved subpopulations or latent classes, investigators are often intereste...

Descripción completa

Detalles Bibliográficos
Autores principales: Benecha, Habtamu K., Neelon, Brian, Divaris, Kimon, Preisser, John S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5384970/
https://www.ncbi.nlm.nih.gov/pubmed/28446995
http://dx.doi.org/10.1186/s40488-017-0057-4
_version_ 1782520519136903168
author Benecha, Habtamu K.
Neelon, Brian
Divaris, Kimon
Preisser, John S.
author_facet Benecha, Habtamu K.
Neelon, Brian
Divaris, Kimon
Preisser, John S.
author_sort Benecha, Habtamu K.
collection PubMed
description Mixture distributions provide flexibility in modeling data collected from populations having unexplained heterogeneity. While interpretations of regression parameters from traditional finite mixture models are specific to unobserved subpopulations or latent classes, investigators are often interested in making inferences about the marginal mean of a count variable in the overall population. Recently, marginal mean regression modeling procedures for zero-inflated count outcomes have been introduced within the framework of maximum likelihood estimation of zero-inflated Poisson and negative binomial regression models. In this article, we propose marginalized mixture regression models based on two-component mixtures of non-degenerate count data distributions that provide directly interpretable estimates of exposure effects on the overall population mean of a count outcome. The models are examined using simulations and applied to two datasets, one from a double-blind dental caries incidence trial, and the other from a horticultural experiment. The finite sample performance of the proposed models are compared with each other and with marginalized zero-inflated count models, as well as ordinary Poisson and negative binomial regression. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40488-017-0057-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5384970
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-53849702017-04-24 Marginalized mixture models for count data from multiple source populations Benecha, Habtamu K. Neelon, Brian Divaris, Kimon Preisser, John S. J Stat Distrib Appl Research Mixture distributions provide flexibility in modeling data collected from populations having unexplained heterogeneity. While interpretations of regression parameters from traditional finite mixture models are specific to unobserved subpopulations or latent classes, investigators are often interested in making inferences about the marginal mean of a count variable in the overall population. Recently, marginal mean regression modeling procedures for zero-inflated count outcomes have been introduced within the framework of maximum likelihood estimation of zero-inflated Poisson and negative binomial regression models. In this article, we propose marginalized mixture regression models based on two-component mixtures of non-degenerate count data distributions that provide directly interpretable estimates of exposure effects on the overall population mean of a count outcome. The models are examined using simulations and applied to two datasets, one from a double-blind dental caries incidence trial, and the other from a horticultural experiment. The finite sample performance of the proposed models are compared with each other and with marginalized zero-inflated count models, as well as ordinary Poisson and negative binomial regression. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40488-017-0057-4) contains supplementary material, which is available to authorized users. Springer Berlin Heidelberg 2017-04-07 2017 /pmc/articles/PMC5384970/ /pubmed/28446995 http://dx.doi.org/10.1186/s40488-017-0057-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Benecha, Habtamu K.
Neelon, Brian
Divaris, Kimon
Preisser, John S.
Marginalized mixture models for count data from multiple source populations
title Marginalized mixture models for count data from multiple source populations
title_full Marginalized mixture models for count data from multiple source populations
title_fullStr Marginalized mixture models for count data from multiple source populations
title_full_unstemmed Marginalized mixture models for count data from multiple source populations
title_short Marginalized mixture models for count data from multiple source populations
title_sort marginalized mixture models for count data from multiple source populations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5384970/
https://www.ncbi.nlm.nih.gov/pubmed/28446995
http://dx.doi.org/10.1186/s40488-017-0057-4
work_keys_str_mv AT benechahabtamuk marginalizedmixturemodelsforcountdatafrommultiplesourcepopulations
AT neelonbrian marginalizedmixturemodelsforcountdatafrommultiplesourcepopulations
AT divariskimon marginalizedmixturemodelsforcountdatafrommultiplesourcepopulations
AT preisserjohns marginalizedmixturemodelsforcountdatafrommultiplesourcepopulations