Cargando…

Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study

BACKGROUND: Chronic kidney disease (CKD) is a global public health concern. Therefore, to provide timely intervention for non-hospitalized high-risk patients and rationally allocate limited clinical resources is important to mine the key factors when designing a CKD prediction model. METHODS: This s...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Yufei, Ning, Yichun, Li, Yang, Zhu, Bowen, Zhang, Jian, Yang, Yan, Chen, Weize, Yan, Zhixin, Chen, Annan, Shen, Bo, Fang, Yi, Wang, Dong, Song, Nana, Ding, Xiaoqiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10472702/
https://www.ncbi.nlm.nih.gov/pubmed/37653403
http://dx.doi.org/10.1186/s12911-023-02269-2
_version_ 1785100137346891776
author Lu, Yufei
Ning, Yichun
Li, Yang
Zhu, Bowen
Zhang, Jian
Yang, Yan
Chen, Weize
Yan, Zhixin
Chen, Annan
Shen, Bo
Fang, Yi
Wang, Dong
Song, Nana
Ding, Xiaoqiang
author_facet Lu, Yufei
Ning, Yichun
Li, Yang
Zhu, Bowen
Zhang, Jian
Yang, Yan
Chen, Weize
Yan, Zhixin
Chen, Annan
Shen, Bo
Fang, Yi
Wang, Dong
Song, Nana
Ding, Xiaoqiang
author_sort Lu, Yufei
collection PubMed
description BACKGROUND: Chronic kidney disease (CKD) is a global public health concern. Therefore, to provide timely intervention for non-hospitalized high-risk patients and rationally allocate limited clinical resources is important to mine the key factors when designing a CKD prediction model. METHODS: This study included data from 1,358 patients with CKD pathologically confirmed during the period from December 2017 to September 2020 at Zhongshan Hospital. A CKD prediction interpretation framework based on machine learning was proposed. From among 100 variables, 17 were selected for the model construction through a recursive feature elimination with logistic regression feature screening. Several machine learning classifiers, including extreme gradient boosting, gaussian-based naive bayes, a neural network, ridge regression, and linear model logistic regression (LR), were trained, and an ensemble model was developed to predict 24-hour urine protein. The detailed relationship between the risk of CKD progression and these predictors was determined using a global interpretation. A patient-specific analysis was conducted using a local interpretation. RESULTS: The results showed that LR achieved the best performance, with an area under the curve (AUC) of 0.850 in a single machine learning model. The ensemble model constructed using the voting integration method further improved the AUC to 0.856. The major predictors of moderate-to-severe severity included lower levels of 25-OH-vitamin, albumin, transferrin in males, and higher levels of cystatin C. CONCLUSIONS: Compared with the clinical single kidney function evaluation indicators (eGFR, Scr), the machine learning model proposed in this study improved the prediction accuracy of CKD progression by 17.6% and 24.6%, respectively, and the AUC was improved by 0.250 and 0.236, respectively. Our framework can achieve a good predictive interpretation and provide effective clinical decision support. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02269-2.
format Online
Article
Text
id pubmed-10472702
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-104727022023-09-02 Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study Lu, Yufei Ning, Yichun Li, Yang Zhu, Bowen Zhang, Jian Yang, Yan Chen, Weize Yan, Zhixin Chen, Annan Shen, Bo Fang, Yi Wang, Dong Song, Nana Ding, Xiaoqiang BMC Med Inform Decis Mak Research BACKGROUND: Chronic kidney disease (CKD) is a global public health concern. Therefore, to provide timely intervention for non-hospitalized high-risk patients and rationally allocate limited clinical resources is important to mine the key factors when designing a CKD prediction model. METHODS: This study included data from 1,358 patients with CKD pathologically confirmed during the period from December 2017 to September 2020 at Zhongshan Hospital. A CKD prediction interpretation framework based on machine learning was proposed. From among 100 variables, 17 were selected for the model construction through a recursive feature elimination with logistic regression feature screening. Several machine learning classifiers, including extreme gradient boosting, gaussian-based naive bayes, a neural network, ridge regression, and linear model logistic regression (LR), were trained, and an ensemble model was developed to predict 24-hour urine protein. The detailed relationship between the risk of CKD progression and these predictors was determined using a global interpretation. A patient-specific analysis was conducted using a local interpretation. RESULTS: The results showed that LR achieved the best performance, with an area under the curve (AUC) of 0.850 in a single machine learning model. The ensemble model constructed using the voting integration method further improved the AUC to 0.856. The major predictors of moderate-to-severe severity included lower levels of 25-OH-vitamin, albumin, transferrin in males, and higher levels of cystatin C. CONCLUSIONS: Compared with the clinical single kidney function evaluation indicators (eGFR, Scr), the machine learning model proposed in this study improved the prediction accuracy of CKD progression by 17.6% and 24.6%, respectively, and the AUC was improved by 0.250 and 0.236, respectively. Our framework can achieve a good predictive interpretation and provide effective clinical decision support. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02269-2. BioMed Central 2023-08-31 /pmc/articles/PMC10472702/ /pubmed/37653403 http://dx.doi.org/10.1186/s12911-023-02269-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lu, Yufei
Ning, Yichun
Li, Yang
Zhu, Bowen
Zhang, Jian
Yang, Yan
Chen, Weize
Yan, Zhixin
Chen, Annan
Shen, Bo
Fang, Yi
Wang, Dong
Song, Nana
Ding, Xiaoqiang
Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
title Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
title_full Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
title_fullStr Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
title_full_unstemmed Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
title_short Risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
title_sort risk factor mining and prediction of urine protein progression in chronic kidney disease: a machine learning- based study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10472702/
https://www.ncbi.nlm.nih.gov/pubmed/37653403
http://dx.doi.org/10.1186/s12911-023-02269-2
work_keys_str_mv AT luyufei riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT ningyichun riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT liyang riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT zhubowen riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT zhangjian riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT yangyan riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT chenweize riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT yanzhixin riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT chenannan riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT shenbo riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT fangyi riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT wangdong riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT songnana riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy
AT dingxiaoqiang riskfactorminingandpredictionofurineproteinprogressioninchronickidneydiseaseamachinelearningbasedstudy