Cargando…

Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank

BACKGROUND: Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice. OBJECTIVE: We aimed to construct applicable models (ie, with li...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Junren, Qiu, Jiajun, Zhu, Ting, Zeng, Yu, Yang, Huazhen, Shang, Yanan, Yin, Jin, Sun, Yajing, Qu, Yuanyuan, Valdimarsdóttir, Unnur A, Song, Huan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9989910/
https://www.ncbi.nlm.nih.gov/pubmed/36805366
http://dx.doi.org/10.2196/43419
_version_ 1784901849946521600
author Wang, Junren
Qiu, Jiajun
Zhu, Ting
Zeng, Yu
Yang, Huazhen
Shang, Yanan
Yin, Jin
Sun, Yajing
Qu, Yuanyuan
Valdimarsdóttir, Unnur A
Song, Huan
author_facet Wang, Junren
Qiu, Jiajun
Zhu, Ting
Zeng, Yu
Yang, Huazhen
Shang, Yanan
Yin, Jin
Sun, Yajing
Qu, Yuanyuan
Valdimarsdóttir, Unnur A
Song, Huan
author_sort Wang, Junren
collection PubMed
description BACKGROUND: Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice. OBJECTIVE: We aimed to construct applicable models (ie, with limited features) for short- and long-term suicidal behavior prediction. We further validated these models among individuals with different genetic risks of suicide. METHODS: Based on the prospective cohort of UK Biobank, we included 223 (0.06%) eligible cases of suicide attempts or deaths, according to hospital inpatient or death register data within 1 year from baseline and randomly selected 4460 (1.18%) controls (1:20) without such records. We similarly identified 833 (0.22%) cases of suicidal behaviors 1 to 6 years from baseline and 16,660 (4.42%) corresponding controls. Based on 143 input features, mainly including sociodemographic, environmental, and psychosocial factors; medical history; and polygenic risk scores (PRS) for suicidality, we applied a bagged balanced light gradient-boosting machine (LightGBM) with stratified 10-fold cross-validation and grid-search to construct the full prediction models for suicide attempts or deaths within 1 year or between 1 and 6 years. The Shapley Additive Explanations (SHAP) approach was used to quantify the importance of input features, and the top 20 features with the highest SHAP values were selected to train the applicable models. The external validity of the established models was assessed among 50,310 individuals who participated in UK Biobank repeated assessments both overall and by the level of PRS for suicidality. RESULTS: Individuals with suicidal behaviors were on average 56 years old, with equal sex distribution. The application of these full models in the external validation data set demonstrated good model performance, with the area under the receiver operating characteristic (AUROC) curves of 0.919 and 0.892 within 1 year and between 1 and 6 years, respectively. Importantly, the applicable models with the top 20 most important features showed comparable external-validated performance (AUROC curves of 0.901 and 0.885) as the full models, based on which we found that individuals in the top quintile of predicted risk accounted for 91.7% (n=11) and 80.7% (n=25) of all suicidality cases within 1 year and during 1 to 6 years, respectively. We further obtained comparable prediction accuracy when applying these models to subpopulations with different genetic susceptibilities to suicidality. For example, for the 1-year risk prediction, the AUROC curves were 0.907 and 0.885 for the high (>2nd tertile of PRS) and low (<1st) genetic susceptibilities groups, respectively. CONCLUSIONS: We established applicable machine learning–based models for predicting both the short- and long-term risk of suicidality with high accuracy across populations of varying genetic risk for suicide, highlighting a cost-effective method of identifying individuals with a high risk of suicidality.
format Online
Article
Text
id pubmed-9989910
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-99899102023-03-08 Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank Wang, Junren Qiu, Jiajun Zhu, Ting Zeng, Yu Yang, Huazhen Shang, Yanan Yin, Jin Sun, Yajing Qu, Yuanyuan Valdimarsdóttir, Unnur A Song, Huan JMIR Public Health Surveill Original Paper BACKGROUND: Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice. OBJECTIVE: We aimed to construct applicable models (ie, with limited features) for short- and long-term suicidal behavior prediction. We further validated these models among individuals with different genetic risks of suicide. METHODS: Based on the prospective cohort of UK Biobank, we included 223 (0.06%) eligible cases of suicide attempts or deaths, according to hospital inpatient or death register data within 1 year from baseline and randomly selected 4460 (1.18%) controls (1:20) without such records. We similarly identified 833 (0.22%) cases of suicidal behaviors 1 to 6 years from baseline and 16,660 (4.42%) corresponding controls. Based on 143 input features, mainly including sociodemographic, environmental, and psychosocial factors; medical history; and polygenic risk scores (PRS) for suicidality, we applied a bagged balanced light gradient-boosting machine (LightGBM) with stratified 10-fold cross-validation and grid-search to construct the full prediction models for suicide attempts or deaths within 1 year or between 1 and 6 years. The Shapley Additive Explanations (SHAP) approach was used to quantify the importance of input features, and the top 20 features with the highest SHAP values were selected to train the applicable models. The external validity of the established models was assessed among 50,310 individuals who participated in UK Biobank repeated assessments both overall and by the level of PRS for suicidality. RESULTS: Individuals with suicidal behaviors were on average 56 years old, with equal sex distribution. The application of these full models in the external validation data set demonstrated good model performance, with the area under the receiver operating characteristic (AUROC) curves of 0.919 and 0.892 within 1 year and between 1 and 6 years, respectively. Importantly, the applicable models with the top 20 most important features showed comparable external-validated performance (AUROC curves of 0.901 and 0.885) as the full models, based on which we found that individuals in the top quintile of predicted risk accounted for 91.7% (n=11) and 80.7% (n=25) of all suicidality cases within 1 year and during 1 to 6 years, respectively. We further obtained comparable prediction accuracy when applying these models to subpopulations with different genetic susceptibilities to suicidality. For example, for the 1-year risk prediction, the AUROC curves were 0.907 and 0.885 for the high (>2nd tertile of PRS) and low (<1st) genetic susceptibilities groups, respectively. CONCLUSIONS: We established applicable machine learning–based models for predicting both the short- and long-term risk of suicidality with high accuracy across populations of varying genetic risk for suicide, highlighting a cost-effective method of identifying individuals with a high risk of suicidality. JMIR Publications 2023-02-20 /pmc/articles/PMC9989910/ /pubmed/36805366 http://dx.doi.org/10.2196/43419 Text en ©Junren Wang, Jiajun Qiu, Ting Zhu, Yu Zeng, Huazhen Yang, Yanan Shang, Jin Yin, Yajing Sun, Yuanyuan Qu, Unnur A Valdimarsdóttir, Huan Song. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 20.02.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Wang, Junren
Qiu, Jiajun
Zhu, Ting
Zeng, Yu
Yang, Huazhen
Shang, Yanan
Yin, Jin
Sun, Yajing
Qu, Yuanyuan
Valdimarsdóttir, Unnur A
Song, Huan
Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
title Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
title_full Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
title_fullStr Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
title_full_unstemmed Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
title_short Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
title_sort prediction of suicidal behaviors in the middle-aged population: machine learning analyses of uk biobank
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9989910/
https://www.ncbi.nlm.nih.gov/pubmed/36805366
http://dx.doi.org/10.2196/43419
work_keys_str_mv AT wangjunren predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT qiujiajun predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT zhuting predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT zengyu predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT yanghuazhen predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT shangyanan predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT yinjin predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT sunyajing predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT quyuanyuan predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT valdimarsdottirunnura predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank
AT songhuan predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank