Cargando…

Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?

INTRODUCTION: Count data are often collected in chronic disease research, and sometimes these data have a skewed distribution. The number of unhealthy days reported in the Behavioral Risk Factor Surveillance System (BRFSS) is an example of such data: most respondents report zero days. Studies have e...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Hong, Siegel, Paul Z., Barile, John, Njai, Rashid S., Thompson, William W., Kent, Charlotte, Liao, Youlian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Centers for Disease Control and Prevention 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3970772/
https://www.ncbi.nlm.nih.gov/pubmed/24674632
http://dx.doi.org/10.5888/pcd11.130252
_version_ 1782309427654688768
author Zhou, Hong
Siegel, Paul Z.
Barile, John
Njai, Rashid S.
Thompson, William W.
Kent, Charlotte
Liao, Youlian
author_facet Zhou, Hong
Siegel, Paul Z.
Barile, John
Njai, Rashid S.
Thompson, William W.
Kent, Charlotte
Liao, Youlian
author_sort Zhou, Hong
collection PubMed
description INTRODUCTION: Count data are often collected in chronic disease research, and sometimes these data have a skewed distribution. The number of unhealthy days reported in the Behavioral Risk Factor Surveillance System (BRFSS) is an example of such data: most respondents report zero days. Studies have either categorized the Healthy Days measure or used linear regression models. We used alternative regression models for these count data and examined the effect on statistical inference. METHODS: Using responses from participants aged 35 years or older from 12 states that included a homeownership question in their 2009 BRFSS, we compared 5 multivariate regression models — logistic, linear, Poisson, negative binomial, and zero-inflated negative binomial — with respect to 1) how well the modeled data fit the observed data and 2) how model selections affect inferences. RESULTS: Most respondents (66.8%) reported zero mentally unhealthy days. The distribution was highly skewed (variance = 58.7, mean = 3.3 d). Zero-inflated negative binomial regression provided the best-fitting model, followed by negative binomial regression. A significant independent association between homeownership and number of mentally unhealthy days was not found in the logistic, linear, or Poisson regression model but was found in the negative binomial model. The zero-inflated negative binomial model showed that homeowners were 24% more likely than nonowners to have excess zero mentally unhealthy days (adjusted odds ratio, 1.24; 95% confidence interval, 1.08–1.43), but it did not show an association between homeownership and the number of unhealthy days. CONCLUSION: Our comparison of regression models indicates the importance of examining data distribution and selecting models with appropriate assumptions. Otherwise, statistical inferences might be misleading.
format Online
Article
Text
id pubmed-3970772
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Centers for Disease Control and Prevention
record_format MEDLINE/PubMed
spelling pubmed-39707722014-04-15 Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer? Zhou, Hong Siegel, Paul Z. Barile, John Njai, Rashid S. Thompson, William W. Kent, Charlotte Liao, Youlian Prev Chronic Dis Original Research INTRODUCTION: Count data are often collected in chronic disease research, and sometimes these data have a skewed distribution. The number of unhealthy days reported in the Behavioral Risk Factor Surveillance System (BRFSS) is an example of such data: most respondents report zero days. Studies have either categorized the Healthy Days measure or used linear regression models. We used alternative regression models for these count data and examined the effect on statistical inference. METHODS: Using responses from participants aged 35 years or older from 12 states that included a homeownership question in their 2009 BRFSS, we compared 5 multivariate regression models — logistic, linear, Poisson, negative binomial, and zero-inflated negative binomial — with respect to 1) how well the modeled data fit the observed data and 2) how model selections affect inferences. RESULTS: Most respondents (66.8%) reported zero mentally unhealthy days. The distribution was highly skewed (variance = 58.7, mean = 3.3 d). Zero-inflated negative binomial regression provided the best-fitting model, followed by negative binomial regression. A significant independent association between homeownership and number of mentally unhealthy days was not found in the logistic, linear, or Poisson regression model but was found in the negative binomial model. The zero-inflated negative binomial model showed that homeowners were 24% more likely than nonowners to have excess zero mentally unhealthy days (adjusted odds ratio, 1.24; 95% confidence interval, 1.08–1.43), but it did not show an association between homeownership and the number of unhealthy days. CONCLUSION: Our comparison of regression models indicates the importance of examining data distribution and selecting models with appropriate assumptions. Otherwise, statistical inferences might be misleading. Centers for Disease Control and Prevention 2014-03-27 /pmc/articles/PMC3970772/ /pubmed/24674632 http://dx.doi.org/10.5888/pcd11.130252 Text en https://creativecommons.org/licenses/by/4.0/This is a publication of the U.S. Government. This publication is in the public domain and is therefore without copyright. All text from this work may be reprinted freely. Use of these materials should be properly cited.
spellingShingle Original Research
Zhou, Hong
Siegel, Paul Z.
Barile, John
Njai, Rashid S.
Thompson, William W.
Kent, Charlotte
Liao, Youlian
Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
title Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
title_full Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
title_fullStr Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
title_full_unstemmed Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
title_short Models for Count Data With an Application to Healthy Days Measures: Are You Driving in Screws With a Hammer?
title_sort models for count data with an application to healthy days measures: are you driving in screws with a hammer?
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3970772/
https://www.ncbi.nlm.nih.gov/pubmed/24674632
http://dx.doi.org/10.5888/pcd11.130252
work_keys_str_mv AT zhouhong modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer
AT siegelpaulz modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer
AT barilejohn modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer
AT njairashids modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer
AT thompsonwilliamw modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer
AT kentcharlotte modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer
AT liaoyoulian modelsforcountdatawithanapplicationtohealthydaysmeasuresareyoudrivinginscrewswithahammer