Cargando…
Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a ri...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719265/ https://www.ncbi.nlm.nih.gov/pubmed/29187409 http://dx.doi.org/10.1136/bmjopen-2017-016471 |
_version_ | 1783284453107302400 |
---|---|
author | Xu, Tao Zhu, Guangjin Han, Shaomei |
author_facet | Xu, Tao Zhu, Guangjin Han, Shaomei |
author_sort | Xu, Tao |
collection | PubMed |
description | OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a risk factor study of the number of depression symptoms. METHODS: 15 820 subjects, aged 10 to 80 years old, who were not suffering from serious chronic diseases and had not run a high fever in the past 15 days, agreed to take part in this survey; 15 462 subjects completed all the survey scales. The number of depression symptoms was the sum of the ‘positive’ responses of seven depression questions. Four count outcomes models and a logistic model were constructed to identify the optimum model of the number of depression symptoms. RESULTS: The mean number of depression symptoms was 1.37±1.55. The over-dispersion test statistic O was 308.011. The alpha dispersion parameter was 0.475 (95% CI 0.443 to 0.508), which was significantly larger than 0. The Vuong test statistic Z was 6.782 and the P value was <0.001, which showed that there were too many zero counts to be accounted for with traditional negative binomial distribution. The zero-inflated negative binomial (ZINB) model had the largest log likelihood and smallest AIC and BIC, suggesting best goodness of fit. In addition, predictive probabilities for many counts in the ZINB model fitted the observed counts best. CONCLUSIONS: All fitting test statistics and the predictive probability curve produced the same findings that the ZINB model was the best model for fitting the number of depression symptoms, assessing both the presence or absence of depression and its severity. |
format | Online Article Text |
id | pubmed-5719265 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-57192652017-12-08 Study of depression influencing factors with zero-inflated regression models in a large-scale population survey Xu, Tao Zhu, Guangjin Han, Shaomei BMJ Open Epidemiology OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a risk factor study of the number of depression symptoms. METHODS: 15 820 subjects, aged 10 to 80 years old, who were not suffering from serious chronic diseases and had not run a high fever in the past 15 days, agreed to take part in this survey; 15 462 subjects completed all the survey scales. The number of depression symptoms was the sum of the ‘positive’ responses of seven depression questions. Four count outcomes models and a logistic model were constructed to identify the optimum model of the number of depression symptoms. RESULTS: The mean number of depression symptoms was 1.37±1.55. The over-dispersion test statistic O was 308.011. The alpha dispersion parameter was 0.475 (95% CI 0.443 to 0.508), which was significantly larger than 0. The Vuong test statistic Z was 6.782 and the P value was <0.001, which showed that there were too many zero counts to be accounted for with traditional negative binomial distribution. The zero-inflated negative binomial (ZINB) model had the largest log likelihood and smallest AIC and BIC, suggesting best goodness of fit. In addition, predictive probabilities for many counts in the ZINB model fitted the observed counts best. CONCLUSIONS: All fitting test statistics and the predictive probability curve produced the same findings that the ZINB model was the best model for fitting the number of depression symptoms, assessing both the presence or absence of depression and its severity. BMJ Publishing Group 2017-11-28 /pmc/articles/PMC5719265/ /pubmed/29187409 http://dx.doi.org/10.1136/bmjopen-2017-016471 Text en © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted. This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ |
spellingShingle | Epidemiology Xu, Tao Zhu, Guangjin Han, Shaomei Study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
title | Study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
title_full | Study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
title_fullStr | Study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
title_full_unstemmed | Study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
title_short | Study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
title_sort | study of depression influencing factors with zero-inflated regression models in a large-scale population survey |
topic | Epidemiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719265/ https://www.ncbi.nlm.nih.gov/pubmed/29187409 http://dx.doi.org/10.1136/bmjopen-2017-016471 |
work_keys_str_mv | AT xutao studyofdepressioninfluencingfactorswithzeroinflatedregressionmodelsinalargescalepopulationsurvey AT zhuguangjin studyofdepressioninfluencingfactorswithzeroinflatedregressionmodelsinalargescalepopulationsurvey AT hanshaomei studyofdepressioninfluencingfactorswithzeroinflatedregressionmodelsinalargescalepopulationsurvey |