Cargando…
Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
Early detection and treatment of diseases through health checkups are effective in improving life expectancy. In this study, we compared the predictive ability for 5-year mortality between two machine learning-based models (gradient boosting decision tree [XGBoost] and neural network) and a conventi...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9391467/ https://www.ncbi.nlm.nih.gov/pubmed/35986034 http://dx.doi.org/10.1038/s41598-022-18276-8 |
_version_ | 1784770859983962112 |
---|---|
author | Kawano, Kazuharu Otaki, Yoichiro Suzuki, Natsuko Fujimoto, Shouichi Iseki, Kunitoshi Moriyama, Toshiki Yamagata, Kunihiro Tsuruya, Kazuhiko Narita, Ichiei Kondo, Masahide Shibagaki, Yugo Kasahara, Masato Asahi, Koichi Watanabe, Tsuyoshi Konta, Tsuneo |
author_facet | Kawano, Kazuharu Otaki, Yoichiro Suzuki, Natsuko Fujimoto, Shouichi Iseki, Kunitoshi Moriyama, Toshiki Yamagata, Kunihiro Tsuruya, Kazuhiko Narita, Ichiei Kondo, Masahide Shibagaki, Yugo Kasahara, Masato Asahi, Koichi Watanabe, Tsuyoshi Konta, Tsuneo |
author_sort | Kawano, Kazuharu |
collection | PubMed |
description | Early detection and treatment of diseases through health checkups are effective in improving life expectancy. In this study, we compared the predictive ability for 5-year mortality between two machine learning-based models (gradient boosting decision tree [XGBoost] and neural network) and a conventional logistic regression model in 116,749 health checkup participants. We built prediction models using a training dataset consisting of 85,361 participants in 2008 and evaluated the models using a test dataset consisting of 31,388 participants from 2009 to 2014. The predictive ability was evaluated by the values of the area under the receiver operating characteristic curve (AUC) in the test dataset. The AUC values were 0.811 for XGBoost, 0.774 for neural network, and 0.772 for logistic regression models, indicating that the predictive ability of XGBoost was the highest. The importance rating of each explanatory variable was evaluated using the SHapley Additive exPlanations (SHAP) values, which were similar among these models. This study showed that the machine learning-based model has a higher predictive ability than the conventional logistic regression model and may be useful for risk assessment and health guidance for health checkup participants. |
format | Online Article Text |
id | pubmed-9391467 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-93914672022-08-21 Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study Kawano, Kazuharu Otaki, Yoichiro Suzuki, Natsuko Fujimoto, Shouichi Iseki, Kunitoshi Moriyama, Toshiki Yamagata, Kunihiro Tsuruya, Kazuhiko Narita, Ichiei Kondo, Masahide Shibagaki, Yugo Kasahara, Masato Asahi, Koichi Watanabe, Tsuyoshi Konta, Tsuneo Sci Rep Article Early detection and treatment of diseases through health checkups are effective in improving life expectancy. In this study, we compared the predictive ability for 5-year mortality between two machine learning-based models (gradient boosting decision tree [XGBoost] and neural network) and a conventional logistic regression model in 116,749 health checkup participants. We built prediction models using a training dataset consisting of 85,361 participants in 2008 and evaluated the models using a test dataset consisting of 31,388 participants from 2009 to 2014. The predictive ability was evaluated by the values of the area under the receiver operating characteristic curve (AUC) in the test dataset. The AUC values were 0.811 for XGBoost, 0.774 for neural network, and 0.772 for logistic regression models, indicating that the predictive ability of XGBoost was the highest. The importance rating of each explanatory variable was evaluated using the SHapley Additive exPlanations (SHAP) values, which were similar among these models. This study showed that the machine learning-based model has a higher predictive ability than the conventional logistic regression model and may be useful for risk assessment and health guidance for health checkup participants. Nature Publishing Group UK 2022-08-19 /pmc/articles/PMC9391467/ /pubmed/35986034 http://dx.doi.org/10.1038/s41598-022-18276-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Kawano, Kazuharu Otaki, Yoichiro Suzuki, Natsuko Fujimoto, Shouichi Iseki, Kunitoshi Moriyama, Toshiki Yamagata, Kunihiro Tsuruya, Kazuhiko Narita, Ichiei Kondo, Masahide Shibagaki, Yugo Kasahara, Masato Asahi, Koichi Watanabe, Tsuyoshi Konta, Tsuneo Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study |
title | Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study |
title_full | Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study |
title_fullStr | Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study |
title_full_unstemmed | Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study |
title_short | Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study |
title_sort | prediction of mortality risk of health checkup participants using machine learning-based models: the j-shc study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9391467/ https://www.ncbi.nlm.nih.gov/pubmed/35986034 http://dx.doi.org/10.1038/s41598-022-18276-8 |
work_keys_str_mv | AT kawanokazuharu predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT otakiyoichiro predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT suzukinatsuko predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT fujimotoshouichi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT isekikunitoshi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT moriyamatoshiki predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT yamagatakunihiro predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT tsuruyakazuhiko predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT naritaichiei predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT kondomasahide predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT shibagakiyugo predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT kasaharamasato predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT asahikoichi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT watanabetsuyoshi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy AT kontatsuneo predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy |