Cargando…

Study of depression influencing factors with zero-inflated regression models in a large-scale population survey

OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a ri...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Tao, Zhu, Guangjin, Han, Shaomei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719265/
https://www.ncbi.nlm.nih.gov/pubmed/29187409
http://dx.doi.org/10.1136/bmjopen-2017-016471
_version_ 1783284453107302400
author Xu, Tao
Zhu, Guangjin
Han, Shaomei
author_facet Xu, Tao
Zhu, Guangjin
Han, Shaomei
author_sort Xu, Tao
collection PubMed
description OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a risk factor study of the number of depression symptoms. METHODS: 15 820 subjects, aged 10 to 80 years old, who were not suffering from serious chronic diseases and had not run a high fever in the past 15 days, agreed to take part in this survey; 15 462 subjects completed all the survey scales. The number of depression symptoms was the sum of the ‘positive’ responses of seven depression questions. Four count outcomes models and a logistic model were constructed to identify the optimum model of the number of depression symptoms. RESULTS: The mean number of depression symptoms was 1.37±1.55. The over-dispersion test statistic O was 308.011. The alpha dispersion parameter was 0.475 (95% CI 0.443 to 0.508), which was significantly larger than 0. The Vuong test statistic Z was 6.782 and the P value was <0.001, which showed that there were too many zero counts to be accounted for with traditional negative binomial distribution. The zero-inflated negative binomial (ZINB) model had the largest log likelihood and smallest AIC and BIC, suggesting best goodness of fit. In addition, predictive probabilities for many counts in the ZINB model fitted the observed counts best. CONCLUSIONS: All fitting test statistics and the predictive probability curve produced the same findings that the ZINB model was the best model for fitting the number of depression symptoms, assessing both the presence or absence of depression and its severity.
format Online
Article
Text
id pubmed-5719265
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-57192652017-12-08 Study of depression influencing factors with zero-inflated regression models in a large-scale population survey Xu, Tao Zhu, Guangjin Han, Shaomei BMJ Open Epidemiology OBJECTIVES: The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a risk factor study of the number of depression symptoms. METHODS: 15 820 subjects, aged 10 to 80 years old, who were not suffering from serious chronic diseases and had not run a high fever in the past 15 days, agreed to take part in this survey; 15 462 subjects completed all the survey scales. The number of depression symptoms was the sum of the ‘positive’ responses of seven depression questions. Four count outcomes models and a logistic model were constructed to identify the optimum model of the number of depression symptoms. RESULTS: The mean number of depression symptoms was 1.37±1.55. The over-dispersion test statistic O was 308.011. The alpha dispersion parameter was 0.475 (95% CI 0.443 to 0.508), which was significantly larger than 0. The Vuong test statistic Z was 6.782 and the P value was <0.001, which showed that there were too many zero counts to be accounted for with traditional negative binomial distribution. The zero-inflated negative binomial (ZINB) model had the largest log likelihood and smallest AIC and BIC, suggesting best goodness of fit. In addition, predictive probabilities for many counts in the ZINB model fitted the observed counts best. CONCLUSIONS: All fitting test statistics and the predictive probability curve produced the same findings that the ZINB model was the best model for fitting the number of depression symptoms, assessing both the presence or absence of depression and its severity. BMJ Publishing Group 2017-11-28 /pmc/articles/PMC5719265/ /pubmed/29187409 http://dx.doi.org/10.1136/bmjopen-2017-016471 Text en © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted. This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
spellingShingle Epidemiology
Xu, Tao
Zhu, Guangjin
Han, Shaomei
Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
title Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
title_full Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
title_fullStr Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
title_full_unstemmed Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
title_short Study of depression influencing factors with zero-inflated regression models in a large-scale population survey
title_sort study of depression influencing factors with zero-inflated regression models in a large-scale population survey
topic Epidemiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5719265/
https://www.ncbi.nlm.nih.gov/pubmed/29187409
http://dx.doi.org/10.1136/bmjopen-2017-016471
work_keys_str_mv AT xutao studyofdepressioninfluencingfactorswithzeroinflatedregressionmodelsinalargescalepopulationsurvey
AT zhuguangjin studyofdepressioninfluencingfactorswithzeroinflatedregressionmodelsinalargescalepopulationsurvey
AT hanshaomei studyofdepressioninfluencingfactorswithzeroinflatedregressionmodelsinalargescalepopulationsurvey