Cargando…

Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study

Early detection and treatment of diseases through health checkups are effective in improving life expectancy. In this study, we compared the predictive ability for 5-year mortality between two machine learning-based models (gradient boosting decision tree [XGBoost] and neural network) and a conventi...

Descripción completa

Detalles Bibliográficos
Autores principales: Kawano, Kazuharu, Otaki, Yoichiro, Suzuki, Natsuko, Fujimoto, Shouichi, Iseki, Kunitoshi, Moriyama, Toshiki, Yamagata, Kunihiro, Tsuruya, Kazuhiko, Narita, Ichiei, Kondo, Masahide, Shibagaki, Yugo, Kasahara, Masato, Asahi, Koichi, Watanabe, Tsuyoshi, Konta, Tsuneo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9391467/
https://www.ncbi.nlm.nih.gov/pubmed/35986034
http://dx.doi.org/10.1038/s41598-022-18276-8
_version_ 1784770859983962112
author Kawano, Kazuharu
Otaki, Yoichiro
Suzuki, Natsuko
Fujimoto, Shouichi
Iseki, Kunitoshi
Moriyama, Toshiki
Yamagata, Kunihiro
Tsuruya, Kazuhiko
Narita, Ichiei
Kondo, Masahide
Shibagaki, Yugo
Kasahara, Masato
Asahi, Koichi
Watanabe, Tsuyoshi
Konta, Tsuneo
author_facet Kawano, Kazuharu
Otaki, Yoichiro
Suzuki, Natsuko
Fujimoto, Shouichi
Iseki, Kunitoshi
Moriyama, Toshiki
Yamagata, Kunihiro
Tsuruya, Kazuhiko
Narita, Ichiei
Kondo, Masahide
Shibagaki, Yugo
Kasahara, Masato
Asahi, Koichi
Watanabe, Tsuyoshi
Konta, Tsuneo
author_sort Kawano, Kazuharu
collection PubMed
description Early detection and treatment of diseases through health checkups are effective in improving life expectancy. In this study, we compared the predictive ability for 5-year mortality between two machine learning-based models (gradient boosting decision tree [XGBoost] and neural network) and a conventional logistic regression model in 116,749 health checkup participants. We built prediction models using a training dataset consisting of 85,361 participants in 2008 and evaluated the models using a test dataset consisting of 31,388 participants from 2009 to 2014. The predictive ability was evaluated by the values of the area under the receiver operating characteristic curve (AUC) in the test dataset. The AUC values were 0.811 for XGBoost, 0.774 for neural network, and 0.772 for logistic regression models, indicating that the predictive ability of XGBoost was the highest. The importance rating of each explanatory variable was evaluated using the SHapley Additive exPlanations (SHAP) values, which were similar among these models. This study showed that the machine learning-based model has a higher predictive ability than the conventional logistic regression model and may be useful for risk assessment and health guidance for health checkup participants.
format Online
Article
Text
id pubmed-9391467
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93914672022-08-21 Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study Kawano, Kazuharu Otaki, Yoichiro Suzuki, Natsuko Fujimoto, Shouichi Iseki, Kunitoshi Moriyama, Toshiki Yamagata, Kunihiro Tsuruya, Kazuhiko Narita, Ichiei Kondo, Masahide Shibagaki, Yugo Kasahara, Masato Asahi, Koichi Watanabe, Tsuyoshi Konta, Tsuneo Sci Rep Article Early detection and treatment of diseases through health checkups are effective in improving life expectancy. In this study, we compared the predictive ability for 5-year mortality between two machine learning-based models (gradient boosting decision tree [XGBoost] and neural network) and a conventional logistic regression model in 116,749 health checkup participants. We built prediction models using a training dataset consisting of 85,361 participants in 2008 and evaluated the models using a test dataset consisting of 31,388 participants from 2009 to 2014. The predictive ability was evaluated by the values of the area under the receiver operating characteristic curve (AUC) in the test dataset. The AUC values were 0.811 for XGBoost, 0.774 for neural network, and 0.772 for logistic regression models, indicating that the predictive ability of XGBoost was the highest. The importance rating of each explanatory variable was evaluated using the SHapley Additive exPlanations (SHAP) values, which were similar among these models. This study showed that the machine learning-based model has a higher predictive ability than the conventional logistic regression model and may be useful for risk assessment and health guidance for health checkup participants. Nature Publishing Group UK 2022-08-19 /pmc/articles/PMC9391467/ /pubmed/35986034 http://dx.doi.org/10.1038/s41598-022-18276-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Kawano, Kazuharu
Otaki, Yoichiro
Suzuki, Natsuko
Fujimoto, Shouichi
Iseki, Kunitoshi
Moriyama, Toshiki
Yamagata, Kunihiro
Tsuruya, Kazuhiko
Narita, Ichiei
Kondo, Masahide
Shibagaki, Yugo
Kasahara, Masato
Asahi, Koichi
Watanabe, Tsuyoshi
Konta, Tsuneo
Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
title Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
title_full Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
title_fullStr Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
title_full_unstemmed Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
title_short Prediction of mortality risk of health checkup participants using machine learning-based models: the J-SHC study
title_sort prediction of mortality risk of health checkup participants using machine learning-based models: the j-shc study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9391467/
https://www.ncbi.nlm.nih.gov/pubmed/35986034
http://dx.doi.org/10.1038/s41598-022-18276-8
work_keys_str_mv AT kawanokazuharu predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT otakiyoichiro predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT suzukinatsuko predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT fujimotoshouichi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT isekikunitoshi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT moriyamatoshiki predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT yamagatakunihiro predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT tsuruyakazuhiko predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT naritaichiei predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT kondomasahide predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT shibagakiyugo predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT kasaharamasato predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT asahikoichi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT watanabetsuyoshi predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy
AT kontatsuneo predictionofmortalityriskofhealthcheckupparticipantsusingmachinelearningbasedmodelsthejshcstudy