Cargando…

Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study

BACKGROUND: Depression is highly prevalent and considered as the most common psychiatric disorder in home-based elderly, while study on forecasting depression risk in the elderly is still limited. In an endeavor to improve accuracy of depression forecasting, machine learning (ML) approaches have bee...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Shaowu, Wu, Yafei, Fang, Ya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8801448/
https://www.ncbi.nlm.nih.gov/pubmed/35111085
http://dx.doi.org/10.3389/fpsyt.2021.764806
_version_ 1784642460979298304
author Lin, Shaowu
Wu, Yafei
Fang, Ya
author_facet Lin, Shaowu
Wu, Yafei
Fang, Ya
author_sort Lin, Shaowu
collection PubMed
description BACKGROUND: Depression is highly prevalent and considered as the most common psychiatric disorder in home-based elderly, while study on forecasting depression risk in the elderly is still limited. In an endeavor to improve accuracy of depression forecasting, machine learning (ML) approaches have been recommended, in addition to the application of more traditional regression approaches. METHODS: A prospective study was employed in home-based elderly Chinese, using baseline (2011) and follow-up (2013) data of the China Health and Retirement Longitudinal Study (CHARLS), a nationally representative cohort study. We compared four algorithms, including the regression-based models (logistic regression, lasso, ridge) and ML method (random forest). Model performance was assessed using repeated nested 10-fold cross-validation. As the main measure of predictive performance, we used the area under the receiver operating characteristic curve (AUC). RESULTS: The mean AUCs of the four predictive models, logistic regression, lasso, ridge, and random forest, were 0.795, 0.794, 0.794, and 0.769, respectively. The main determinants were life satisfaction, self-reported memory, cognitive ability, ADL (activities of daily living) impairment, CESD-10 score. Life satisfaction increased the odds ratio of a future depression by 128.6% (logistic), 13.8% (lasso), and 13.2% (ridge), and cognitive ability was the most important predictor in random forest. CONCLUSIONS: The three regression-based models and one ML algorithm performed equally well in differentiating between a future depression case and a non-depression case in home-based elderly. When choosing a model, different considerations, however, such as easy operating, might in some instances lead to one model being prioritized over another.
format Online
Article
Text
id pubmed-8801448
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88014482022-02-01 Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study Lin, Shaowu Wu, Yafei Fang, Ya Front Psychiatry Psychiatry BACKGROUND: Depression is highly prevalent and considered as the most common psychiatric disorder in home-based elderly, while study on forecasting depression risk in the elderly is still limited. In an endeavor to improve accuracy of depression forecasting, machine learning (ML) approaches have been recommended, in addition to the application of more traditional regression approaches. METHODS: A prospective study was employed in home-based elderly Chinese, using baseline (2011) and follow-up (2013) data of the China Health and Retirement Longitudinal Study (CHARLS), a nationally representative cohort study. We compared four algorithms, including the regression-based models (logistic regression, lasso, ridge) and ML method (random forest). Model performance was assessed using repeated nested 10-fold cross-validation. As the main measure of predictive performance, we used the area under the receiver operating characteristic curve (AUC). RESULTS: The mean AUCs of the four predictive models, logistic regression, lasso, ridge, and random forest, were 0.795, 0.794, 0.794, and 0.769, respectively. The main determinants were life satisfaction, self-reported memory, cognitive ability, ADL (activities of daily living) impairment, CESD-10 score. Life satisfaction increased the odds ratio of a future depression by 128.6% (logistic), 13.8% (lasso), and 13.2% (ridge), and cognitive ability was the most important predictor in random forest. CONCLUSIONS: The three regression-based models and one ML algorithm performed equally well in differentiating between a future depression case and a non-depression case in home-based elderly. When choosing a model, different considerations, however, such as easy operating, might in some instances lead to one model being prioritized over another. Frontiers Media S.A. 2022-01-17 /pmc/articles/PMC8801448/ /pubmed/35111085 http://dx.doi.org/10.3389/fpsyt.2021.764806 Text en Copyright © 2022 Lin, Wu and Fang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychiatry
Lin, Shaowu
Wu, Yafei
Fang, Ya
Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study
title Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study
title_full Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study
title_fullStr Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study
title_full_unstemmed Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study
title_short Comparison of Regression and Machine Learning Methods in Depression Forecasting Among Home-Based Elderly Chinese: A Community Based Study
title_sort comparison of regression and machine learning methods in depression forecasting among home-based elderly chinese: a community based study
topic Psychiatry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8801448/
https://www.ncbi.nlm.nih.gov/pubmed/35111085
http://dx.doi.org/10.3389/fpsyt.2021.764806
work_keys_str_mv AT linshaowu comparisonofregressionandmachinelearningmethodsindepressionforecastingamonghomebasedelderlychineseacommunitybasedstudy
AT wuyafei comparisonofregressionandmachinelearningmethodsindepressionforecastingamonghomebasedelderlychineseacommunitybasedstudy
AT fangya comparisonofregressionandmachinelearningmethodsindepressionforecastingamonghomebasedelderlychineseacommunitybasedstudy