Cargando…

Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease

PURPOSE: As global aging progresses, the health management of chronic diseases has become an important issue of concern to governments. Influenced by the aging of its population and improvements in the medical system and healthcare in general, Taiwan’s population of patients with chronic kidney dise...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiu, Yen-Ling, Jhou, Mao-Jhen, Lee, Tian-Shyug, Lu, Chi-Jie, Chen, Ming-Shu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8558038/
https://www.ncbi.nlm.nih.gov/pubmed/34737657
http://dx.doi.org/10.2147/RMHP.S319405
_version_ 1784592480332677120
author Chiu, Yen-Ling
Jhou, Mao-Jhen
Lee, Tian-Shyug
Lu, Chi-Jie
Chen, Ming-Shu
author_facet Chiu, Yen-Ling
Jhou, Mao-Jhen
Lee, Tian-Shyug
Lu, Chi-Jie
Chen, Ming-Shu
author_sort Chiu, Yen-Ling
collection PubMed
description PURPOSE: As global aging progresses, the health management of chronic diseases has become an important issue of concern to governments. Influenced by the aging of its population and improvements in the medical system and healthcare in general, Taiwan’s population of patients with chronic kidney disease (CKD) has tended to grow year by year, including the incidence of high-risk cases that pose major health hazards to the elderly and middle-aged populations. METHODS: This study analyzed the annual health screening data for 65,394 people from 2010 to 2015 sourced from the MJ Group – a major health screening center in Taiwan – including data for 18 risk indicators. We used five prediction model analysis methods, namely, logistic regression (LR) analysis, C5.0 decision tree (C5.0) analysis, stochastic gradient boosting (SGB) analysis, multivariate adaptive regression splines (MARS), and eXtreme gradient boosting (XGboost), with estimated glomerular filtration rate (e-GFR) data to determine G3a, G3b & G4 stage CKD risk factors. RESULTS: The LR analysis (AUC=0.848), SGB analysis (AUC=0.855), and XGboost (AUC=0.858) generated similar classification performance levels and all outperformed the C5.0 and MARS methods. The study results showed that in terms of CKD risk factors, blood urea nitrogen (BUN) and uric acid (UA) were identified as the first and second most important indicators in the models of all five analysis methods, and they were also clinically recognized as the major risk factors. The results for systolic blood pressure (SBP), SGPT, SGOT, and LDL were similar to those of a related study. Interestingly, however, socioeconomic status-related education was found to be the third important indicator in all three of the better performing analysis methods, indicating that it is more important than the other risk indicators of this study, which had different levels of importance according to the different methods. CONCLUSION: The five prediction model methods can provide high and similar classification performance in this study. Based on the results of this study, it is recommended that education as the socioeconomic status should be an important factor for CKD, as high educational level showed a negative and highly significant correlation with CKD. The findings of this study should also be of value for further discussions and follow-up research.
format Online
Article
Text
id pubmed-8558038
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-85580382021-11-03 Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease Chiu, Yen-Ling Jhou, Mao-Jhen Lee, Tian-Shyug Lu, Chi-Jie Chen, Ming-Shu Risk Manag Healthc Policy Original Research PURPOSE: As global aging progresses, the health management of chronic diseases has become an important issue of concern to governments. Influenced by the aging of its population and improvements in the medical system and healthcare in general, Taiwan’s population of patients with chronic kidney disease (CKD) has tended to grow year by year, including the incidence of high-risk cases that pose major health hazards to the elderly and middle-aged populations. METHODS: This study analyzed the annual health screening data for 65,394 people from 2010 to 2015 sourced from the MJ Group – a major health screening center in Taiwan – including data for 18 risk indicators. We used five prediction model analysis methods, namely, logistic regression (LR) analysis, C5.0 decision tree (C5.0) analysis, stochastic gradient boosting (SGB) analysis, multivariate adaptive regression splines (MARS), and eXtreme gradient boosting (XGboost), with estimated glomerular filtration rate (e-GFR) data to determine G3a, G3b & G4 stage CKD risk factors. RESULTS: The LR analysis (AUC=0.848), SGB analysis (AUC=0.855), and XGboost (AUC=0.858) generated similar classification performance levels and all outperformed the C5.0 and MARS methods. The study results showed that in terms of CKD risk factors, blood urea nitrogen (BUN) and uric acid (UA) were identified as the first and second most important indicators in the models of all five analysis methods, and they were also clinically recognized as the major risk factors. The results for systolic blood pressure (SBP), SGPT, SGOT, and LDL were similar to those of a related study. Interestingly, however, socioeconomic status-related education was found to be the third important indicator in all three of the better performing analysis methods, indicating that it is more important than the other risk indicators of this study, which had different levels of importance according to the different methods. CONCLUSION: The five prediction model methods can provide high and similar classification performance in this study. Based on the results of this study, it is recommended that education as the socioeconomic status should be an important factor for CKD, as high educational level showed a negative and highly significant correlation with CKD. The findings of this study should also be of value for further discussions and follow-up research. Dove 2021-10-27 /pmc/articles/PMC8558038/ /pubmed/34737657 http://dx.doi.org/10.2147/RMHP.S319405 Text en © 2021 Chiu et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Chiu, Yen-Ling
Jhou, Mao-Jhen
Lee, Tian-Shyug
Lu, Chi-Jie
Chen, Ming-Shu
Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease
title Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease
title_full Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease
title_fullStr Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease
title_full_unstemmed Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease
title_short Health Data-Driven Machine Learning Algorithms Applied to Risk Indicators Assessment for Chronic Kidney Disease
title_sort health data-driven machine learning algorithms applied to risk indicators assessment for chronic kidney disease
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8558038/
https://www.ncbi.nlm.nih.gov/pubmed/34737657
http://dx.doi.org/10.2147/RMHP.S319405
work_keys_str_mv AT chiuyenling healthdatadrivenmachinelearningalgorithmsappliedtoriskindicatorsassessmentforchronickidneydisease
AT jhoumaojhen healthdatadrivenmachinelearningalgorithmsappliedtoriskindicatorsassessmentforchronickidneydisease
AT leetianshyug healthdatadrivenmachinelearningalgorithmsappliedtoriskindicatorsassessmentforchronickidneydisease
AT luchijie healthdatadrivenmachinelearningalgorithmsappliedtoriskindicatorsassessmentforchronickidneydisease
AT chenmingshu healthdatadrivenmachinelearningalgorithmsappliedtoriskindicatorsassessmentforchronickidneydisease