Cargando…

Optimizing the predictive power of depression screenings using machine learning

OBJECTIVE: Mental health self-report and clinician-rating scales with diagnoses defined by sum-score cut-offs are often used for depression screening. This study investigates whether machine learning (ML) can detect major depressive episodes (MDE) based on screening scales with higher accuracy than...

Descripción completa

Detalles Bibliográficos
Autores principales:	Terhorst, Yannik, Sander, Lasse B, Ebert, David D, Baumeister, Harald
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	SAGE Publications 2023
Materias:	Original Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10467308/ https://www.ncbi.nlm.nih.gov/pubmed/37654715 http://dx.doi.org/10.1177/20552076231194939

_version_	1785099088582148096
author	Terhorst, Yannik Sander, Lasse B Ebert, David D Baumeister, Harald
author_facet	Terhorst, Yannik Sander, Lasse B Ebert, David D Baumeister, Harald
author_sort	Terhorst, Yannik
collection	PubMed
description	OBJECTIVE: Mental health self-report and clinician-rating scales with diagnoses defined by sum-score cut-offs are often used for depression screening. This study investigates whether machine learning (ML) can detect major depressive episodes (MDE) based on screening scales with higher accuracy than best-practice clinical sum-score approaches. METHODS: Primary data was obtained from two RCTs on the treatment of depression. Ground truth were DSM 5 MDE diagnoses based on structured clinical interviews (SCID) and PHQ-9 self-report, clinician-rated QIDS-16, and HAM-D-17 were predictors. ML models were trained using 10-fold cross-validation. Performance was compared against best-practice sum-score cut-offs. Primary outcome was the Area Under the Curve (AUC) of the Receiver Operating Characteristic curve. DeLong's test with bootstrapping was used to test for differences in AUC. Secondary outcomes were balanced accuracy, precision, recall, F1-score, and number needed to diagnose (NND). RESULTS: A total of k = 1030 diagnoses (no diagnosis: k = 775; MDE: k = 255) were included. ML models achieved an AUC(QIDS-16) = 0.94, AUC(HAM-D-17) = 0.88, and AUC(PHQ-9) = 0.83 in the testing set. ML AUC was significantly higher than sum-score cut-offs for QIDS-16 and PHQ-9 (ps ≤ 0.01; HAM_D-17: p = 0.847). Applying optimal prediction thresholds, QIDS-16 classifier achieved clinically relevant improvements (Δbalanced accuracy = 8%, ΔF1-score = 14%, ΔNND = 21%). Differences for PHQ_9 and HAM-D-17 were marginal. CONCLUSIONS: ML augmented depression screenings could potentially make a major contribution to improving MDE diagnosis depending on questionnaire (e.g., QIDS-16). Confirmatory studies are needed before ML enhanced screening can be implemented into routine care practice.
format	Online Article Text
id	pubmed-10467308
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	SAGE Publications
record_format	MEDLINE/PubMed
spelling	pubmed-104673082023-08-31 Optimizing the predictive power of depression screenings using machine learning Terhorst, Yannik Sander, Lasse B Ebert, David D Baumeister, Harald Digit Health Original Research OBJECTIVE: Mental health self-report and clinician-rating scales with diagnoses defined by sum-score cut-offs are often used for depression screening. This study investigates whether machine learning (ML) can detect major depressive episodes (MDE) based on screening scales with higher accuracy than best-practice clinical sum-score approaches. METHODS: Primary data was obtained from two RCTs on the treatment of depression. Ground truth were DSM 5 MDE diagnoses based on structured clinical interviews (SCID) and PHQ-9 self-report, clinician-rated QIDS-16, and HAM-D-17 were predictors. ML models were trained using 10-fold cross-validation. Performance was compared against best-practice sum-score cut-offs. Primary outcome was the Area Under the Curve (AUC) of the Receiver Operating Characteristic curve. DeLong's test with bootstrapping was used to test for differences in AUC. Secondary outcomes were balanced accuracy, precision, recall, F1-score, and number needed to diagnose (NND). RESULTS: A total of k = 1030 diagnoses (no diagnosis: k = 775; MDE: k = 255) were included. ML models achieved an AUC(QIDS-16) = 0.94, AUC(HAM-D-17) = 0.88, and AUC(PHQ-9) = 0.83 in the testing set. ML AUC was significantly higher than sum-score cut-offs for QIDS-16 and PHQ-9 (ps ≤ 0.01; HAM_D-17: p = 0.847). Applying optimal prediction thresholds, QIDS-16 classifier achieved clinically relevant improvements (Δbalanced accuracy = 8%, ΔF1-score = 14%, ΔNND = 21%). Differences for PHQ_9 and HAM-D-17 were marginal. CONCLUSIONS: ML augmented depression screenings could potentially make a major contribution to improving MDE diagnosis depending on questionnaire (e.g., QIDS-16). Confirmatory studies are needed before ML enhanced screening can be implemented into routine care practice. SAGE Publications 2023-08-29 /pmc/articles/PMC10467308/ /pubmed/37654715 http://dx.doi.org/10.1177/20552076231194939 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle	Original Research Terhorst, Yannik Sander, Lasse B Ebert, David D Baumeister, Harald Optimizing the predictive power of depression screenings using machine learning
title	Optimizing the predictive power of depression screenings using machine learning
title_full	Optimizing the predictive power of depression screenings using machine learning
title_fullStr	Optimizing the predictive power of depression screenings using machine learning
title_full_unstemmed	Optimizing the predictive power of depression screenings using machine learning
title_short	Optimizing the predictive power of depression screenings using machine learning
title_sort	optimizing the predictive power of depression screenings using machine learning
topic	Original Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10467308/ https://www.ncbi.nlm.nih.gov/pubmed/37654715 http://dx.doi.org/10.1177/20552076231194939
work_keys_str_mv	AT terhorstyannik optimizingthepredictivepowerofdepressionscreeningsusingmachinelearning AT sanderlasseb optimizingthepredictivepowerofdepressionscreeningsusingmachinelearning AT ebertdavidd optimizingthepredictivepowerofdepressionscreeningsusingmachinelearning AT baumeisterharald optimizingthepredictivepowerofdepressionscreeningsusingmachinelearning

Optimizing the predictive power of depression screenings using machine learning

Ejemplares similares