Cargando…
Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank
BACKGROUND: Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice. OBJECTIVE: We aimed to construct applicable models (ie, with li...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9989910/ https://www.ncbi.nlm.nih.gov/pubmed/36805366 http://dx.doi.org/10.2196/43419 |
_version_ | 1784901849946521600 |
---|---|
author | Wang, Junren Qiu, Jiajun Zhu, Ting Zeng, Yu Yang, Huazhen Shang, Yanan Yin, Jin Sun, Yajing Qu, Yuanyuan Valdimarsdóttir, Unnur A Song, Huan |
author_facet | Wang, Junren Qiu, Jiajun Zhu, Ting Zeng, Yu Yang, Huazhen Shang, Yanan Yin, Jin Sun, Yajing Qu, Yuanyuan Valdimarsdóttir, Unnur A Song, Huan |
author_sort | Wang, Junren |
collection | PubMed |
description | BACKGROUND: Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice. OBJECTIVE: We aimed to construct applicable models (ie, with limited features) for short- and long-term suicidal behavior prediction. We further validated these models among individuals with different genetic risks of suicide. METHODS: Based on the prospective cohort of UK Biobank, we included 223 (0.06%) eligible cases of suicide attempts or deaths, according to hospital inpatient or death register data within 1 year from baseline and randomly selected 4460 (1.18%) controls (1:20) without such records. We similarly identified 833 (0.22%) cases of suicidal behaviors 1 to 6 years from baseline and 16,660 (4.42%) corresponding controls. Based on 143 input features, mainly including sociodemographic, environmental, and psychosocial factors; medical history; and polygenic risk scores (PRS) for suicidality, we applied a bagged balanced light gradient-boosting machine (LightGBM) with stratified 10-fold cross-validation and grid-search to construct the full prediction models for suicide attempts or deaths within 1 year or between 1 and 6 years. The Shapley Additive Explanations (SHAP) approach was used to quantify the importance of input features, and the top 20 features with the highest SHAP values were selected to train the applicable models. The external validity of the established models was assessed among 50,310 individuals who participated in UK Biobank repeated assessments both overall and by the level of PRS for suicidality. RESULTS: Individuals with suicidal behaviors were on average 56 years old, with equal sex distribution. The application of these full models in the external validation data set demonstrated good model performance, with the area under the receiver operating characteristic (AUROC) curves of 0.919 and 0.892 within 1 year and between 1 and 6 years, respectively. Importantly, the applicable models with the top 20 most important features showed comparable external-validated performance (AUROC curves of 0.901 and 0.885) as the full models, based on which we found that individuals in the top quintile of predicted risk accounted for 91.7% (n=11) and 80.7% (n=25) of all suicidality cases within 1 year and during 1 to 6 years, respectively. We further obtained comparable prediction accuracy when applying these models to subpopulations with different genetic susceptibilities to suicidality. For example, for the 1-year risk prediction, the AUROC curves were 0.907 and 0.885 for the high (>2nd tertile of PRS) and low (<1st) genetic susceptibilities groups, respectively. CONCLUSIONS: We established applicable machine learning–based models for predicting both the short- and long-term risk of suicidality with high accuracy across populations of varying genetic risk for suicide, highlighting a cost-effective method of identifying individuals with a high risk of suicidality. |
format | Online Article Text |
id | pubmed-9989910 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-99899102023-03-08 Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank Wang, Junren Qiu, Jiajun Zhu, Ting Zeng, Yu Yang, Huazhen Shang, Yanan Yin, Jin Sun, Yajing Qu, Yuanyuan Valdimarsdóttir, Unnur A Song, Huan JMIR Public Health Surveill Original Paper BACKGROUND: Suicidal behaviors, including suicide deaths and attempts, are major public health concerns. However, previous suicide models required a huge amount of input features, resulting in limited applicability in clinical practice. OBJECTIVE: We aimed to construct applicable models (ie, with limited features) for short- and long-term suicidal behavior prediction. We further validated these models among individuals with different genetic risks of suicide. METHODS: Based on the prospective cohort of UK Biobank, we included 223 (0.06%) eligible cases of suicide attempts or deaths, according to hospital inpatient or death register data within 1 year from baseline and randomly selected 4460 (1.18%) controls (1:20) without such records. We similarly identified 833 (0.22%) cases of suicidal behaviors 1 to 6 years from baseline and 16,660 (4.42%) corresponding controls. Based on 143 input features, mainly including sociodemographic, environmental, and psychosocial factors; medical history; and polygenic risk scores (PRS) for suicidality, we applied a bagged balanced light gradient-boosting machine (LightGBM) with stratified 10-fold cross-validation and grid-search to construct the full prediction models for suicide attempts or deaths within 1 year or between 1 and 6 years. The Shapley Additive Explanations (SHAP) approach was used to quantify the importance of input features, and the top 20 features with the highest SHAP values were selected to train the applicable models. The external validity of the established models was assessed among 50,310 individuals who participated in UK Biobank repeated assessments both overall and by the level of PRS for suicidality. RESULTS: Individuals with suicidal behaviors were on average 56 years old, with equal sex distribution. The application of these full models in the external validation data set demonstrated good model performance, with the area under the receiver operating characteristic (AUROC) curves of 0.919 and 0.892 within 1 year and between 1 and 6 years, respectively. Importantly, the applicable models with the top 20 most important features showed comparable external-validated performance (AUROC curves of 0.901 and 0.885) as the full models, based on which we found that individuals in the top quintile of predicted risk accounted for 91.7% (n=11) and 80.7% (n=25) of all suicidality cases within 1 year and during 1 to 6 years, respectively. We further obtained comparable prediction accuracy when applying these models to subpopulations with different genetic susceptibilities to suicidality. For example, for the 1-year risk prediction, the AUROC curves were 0.907 and 0.885 for the high (>2nd tertile of PRS) and low (<1st) genetic susceptibilities groups, respectively. CONCLUSIONS: We established applicable machine learning–based models for predicting both the short- and long-term risk of suicidality with high accuracy across populations of varying genetic risk for suicide, highlighting a cost-effective method of identifying individuals with a high risk of suicidality. JMIR Publications 2023-02-20 /pmc/articles/PMC9989910/ /pubmed/36805366 http://dx.doi.org/10.2196/43419 Text en ©Junren Wang, Jiajun Qiu, Ting Zhu, Yu Zeng, Huazhen Yang, Yanan Shang, Jin Yin, Yajing Sun, Yuanyuan Qu, Unnur A Valdimarsdóttir, Huan Song. Originally published in JMIR Public Health and Surveillance (https://publichealth.jmir.org), 20.02.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on https://publichealth.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Wang, Junren Qiu, Jiajun Zhu, Ting Zeng, Yu Yang, Huazhen Shang, Yanan Yin, Jin Sun, Yajing Qu, Yuanyuan Valdimarsdóttir, Unnur A Song, Huan Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank |
title | Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank |
title_full | Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank |
title_fullStr | Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank |
title_full_unstemmed | Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank |
title_short | Prediction of Suicidal Behaviors in the Middle-aged Population: Machine Learning Analyses of UK Biobank |
title_sort | prediction of suicidal behaviors in the middle-aged population: machine learning analyses of uk biobank |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9989910/ https://www.ncbi.nlm.nih.gov/pubmed/36805366 http://dx.doi.org/10.2196/43419 |
work_keys_str_mv | AT wangjunren predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT qiujiajun predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT zhuting predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT zengyu predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT yanghuazhen predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT shangyanan predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT yinjin predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT sunyajing predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT quyuanyuan predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT valdimarsdottirunnura predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank AT songhuan predictionofsuicidalbehaviorsinthemiddleagedpopulationmachinelearninganalysesofukbiobank |