Cargando…

Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches

BACKGROUND: Chronic kidney disease (CKD) has become a major public health problem worldwide and has caused a huge social and economic burden, especially in developing countries. No previous study has used machine learning (ML) methods combined with longitudinal data to predict the risk of CKD develo...

Descripción completa

Detalles Bibliográficos
Autores principales: Su, Dai, Zhang, Xingyu, He, Kevin, Chen, Yingchun, Wu, Nina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9634246/
https://www.ncbi.nlm.nih.gov/pubmed/36339144
http://dx.doi.org/10.3389/fpubh.2022.998549
_version_ 1784824425873408000
author Su, Dai
Zhang, Xingyu
He, Kevin
Chen, Yingchun
Wu, Nina
author_facet Su, Dai
Zhang, Xingyu
He, Kevin
Chen, Yingchun
Wu, Nina
author_sort Su, Dai
collection PubMed
description BACKGROUND: Chronic kidney disease (CKD) has become a major public health problem worldwide and has caused a huge social and economic burden, especially in developing countries. No previous study has used machine learning (ML) methods combined with longitudinal data to predict the risk of CKD development in 2 years amongst the elderly in China. METHODS: This study was based on the panel data of 925 elderly individuals in the 2012 baseline survey and 2014 follow-up survey of the Healthy Aging and Biomarkers Cohort Study (HABCS) database. Six ML models, logistic regression (LR), lasso regression, random forests (RF), gradient-boosted decision tree (GBDT), support vector machine (SVM), and deep neural network (DNN), were developed to predict the probability of CKD amongst the elderly in 2 years (the year of 2014). The decision curve analysis (DCA) provided a range of threshold probability of the outcome and the net benefit of each ML model. RESULTS: Amongst the 925 elderly in the HABCS 2014 survey, 289 (18.8%) had CKD. Compared with the other models, LR, lasso regression, RF, GBDT, and DNN had no statistical significance of the area under the receiver operating curve (AUC) value (>0.7), and SVM exhibited the lowest predictive performance (AUC = 0.633, p-value = 0.057). DNN had the highest positive predictive value (PPV) (0.328), whereas LR had the lowest (0.287). DCA results indicated that within the threshold ranges of ~0–0.03 and 0.37–0.40, the net benefit of GBDT was the largest. Within the threshold ranges of ~0.03–0.10 and 0.26–0.30, the net benefit of RF was the largest. Age was the most important predictor variable in the RF and GBDT models. Blood urea nitrogen, serum albumin, uric acid, body mass index (BMI), marital status, activities of daily living (ADL)/instrumental activities of daily living (IADL) and gender were crucial in predicting CKD in the elderly. CONCLUSION: The ML model could successfully capture the linear and nonlinear relationships of risk factors for CKD in the elderly. The decision support system based on the predictive model in this research can help medical staff detect and intervene in the health of the elderly early.
format Online
Article
Text
id pubmed-9634246
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96342462022-11-05 Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches Su, Dai Zhang, Xingyu He, Kevin Chen, Yingchun Wu, Nina Front Public Health Public Health BACKGROUND: Chronic kidney disease (CKD) has become a major public health problem worldwide and has caused a huge social and economic burden, especially in developing countries. No previous study has used machine learning (ML) methods combined with longitudinal data to predict the risk of CKD development in 2 years amongst the elderly in China. METHODS: This study was based on the panel data of 925 elderly individuals in the 2012 baseline survey and 2014 follow-up survey of the Healthy Aging and Biomarkers Cohort Study (HABCS) database. Six ML models, logistic regression (LR), lasso regression, random forests (RF), gradient-boosted decision tree (GBDT), support vector machine (SVM), and deep neural network (DNN), were developed to predict the probability of CKD amongst the elderly in 2 years (the year of 2014). The decision curve analysis (DCA) provided a range of threshold probability of the outcome and the net benefit of each ML model. RESULTS: Amongst the 925 elderly in the HABCS 2014 survey, 289 (18.8%) had CKD. Compared with the other models, LR, lasso regression, RF, GBDT, and DNN had no statistical significance of the area under the receiver operating curve (AUC) value (>0.7), and SVM exhibited the lowest predictive performance (AUC = 0.633, p-value = 0.057). DNN had the highest positive predictive value (PPV) (0.328), whereas LR had the lowest (0.287). DCA results indicated that within the threshold ranges of ~0–0.03 and 0.37–0.40, the net benefit of GBDT was the largest. Within the threshold ranges of ~0.03–0.10 and 0.26–0.30, the net benefit of RF was the largest. Age was the most important predictor variable in the RF and GBDT models. Blood urea nitrogen, serum albumin, uric acid, body mass index (BMI), marital status, activities of daily living (ADL)/instrumental activities of daily living (IADL) and gender were crucial in predicting CKD in the elderly. CONCLUSION: The ML model could successfully capture the linear and nonlinear relationships of risk factors for CKD in the elderly. The decision support system based on the predictive model in this research can help medical staff detect and intervene in the health of the elderly early. Frontiers Media S.A. 2022-10-21 /pmc/articles/PMC9634246/ /pubmed/36339144 http://dx.doi.org/10.3389/fpubh.2022.998549 Text en Copyright © 2022 Su, Zhang, He, Chen and Wu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Su, Dai
Zhang, Xingyu
He, Kevin
Chen, Yingchun
Wu, Nina
Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches
title Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches
title_full Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches
title_fullStr Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches
title_full_unstemmed Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches
title_short Individualized prediction of chronic kidney disease for the elderly in longevity areas in China: Machine learning approaches
title_sort individualized prediction of chronic kidney disease for the elderly in longevity areas in china: machine learning approaches
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9634246/
https://www.ncbi.nlm.nih.gov/pubmed/36339144
http://dx.doi.org/10.3389/fpubh.2022.998549
work_keys_str_mv AT sudai individualizedpredictionofchronickidneydiseasefortheelderlyinlongevityareasinchinamachinelearningapproaches
AT zhangxingyu individualizedpredictionofchronickidneydiseasefortheelderlyinlongevityareasinchinamachinelearningapproaches
AT hekevin individualizedpredictionofchronickidneydiseasefortheelderlyinlongevityareasinchinamachinelearningapproaches
AT chenyingchun individualizedpredictionofchronickidneydiseasefortheelderlyinlongevityareasinchinamachinelearningapproaches
AT wunina individualizedpredictionofchronickidneydiseasefortheelderlyinlongevityareasinchinamachinelearningapproaches