Cargando…

Comparison and development of machine learning tools in the prediction of chronic kidney disease progression

BACKGROUND: Urinary protein quantification is critical for assessing the severity of chronic kidney disease (CKD). However, the current procedure for determining the severity of CKD is completed through evaluating 24-h urinary protein, which is inconvenient during follow-up. OBJECTIVE: To quickly pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Jing, Ding, Ruifeng, Xu, Xiulin, Guan, Haochen, Feng, Xinhui, Sun, Tao, Zhu, Sibo, Ye, Zhibin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6458616/
https://www.ncbi.nlm.nih.gov/pubmed/30971285
http://dx.doi.org/10.1186/s12967-019-1860-0
_version_ 1783410043102691328
author Xiao, Jing
Ding, Ruifeng
Xu, Xiulin
Guan, Haochen
Feng, Xinhui
Sun, Tao
Zhu, Sibo
Ye, Zhibin
author_facet Xiao, Jing
Ding, Ruifeng
Xu, Xiulin
Guan, Haochen
Feng, Xinhui
Sun, Tao
Zhu, Sibo
Ye, Zhibin
author_sort Xiao, Jing
collection PubMed
description BACKGROUND: Urinary protein quantification is critical for assessing the severity of chronic kidney disease (CKD). However, the current procedure for determining the severity of CKD is completed through evaluating 24-h urinary protein, which is inconvenient during follow-up. OBJECTIVE: To quickly predict the severity of CKD using more easily available demographic and blood biochemical features during follow-up, we developed and compared several predictive models using statistical, machine learning and neural network approaches. METHODS: The clinical and blood biochemical results from 551 patients with proteinuria were collected. Thirteen blood-derived tests and 5 demographic features were used as non-urinary clinical variables to predict the 24-h urinary protein outcome response. Nine predictive models were established and compared, including logistic regression, Elastic Net, lasso regression, ridge regression, support vector machine, random forest, XGBoost, neural network and k-nearest neighbor. The AU-ROC, sensitivity (recall), specificity, accuracy, log-loss and precision of each of the models were evaluated. The effect sizes of each variable were analysed and ranked. RESULTS: The linear models including Elastic Net, lasso regression, ridge regression and logistic regression showed the highest overall predictive power, with an average AUC and a precision above 0.87 and 0.8, respectively. Logistic regression ranked first, reaching an AUC of 0.873, with a sensitivity and specificity of 0.83 and 0.82, respectively. The model with the highest sensitivity was Elastic Net (0.85), while XGBoost showed the highest specificity (0.83). In the effect size analyses, we identified that ALB, Scr, TG, LDL and EGFR had important impacts on the predictability of the models, while other predictors such as CRP, HDL and SNA were less important. CONCLUSIONS: Blood-derived tests could be applied as non-urinary predictors during outpatient follow-up. Features in routine blood tests, including ALB, Scr, TG, LDL and EGFR levels, showed predictive ability for CKD severity. The developed online tool can facilitate the prediction of proteinuria progress during follow-up in clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12967-019-1860-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6458616
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64586162019-04-19 Comparison and development of machine learning tools in the prediction of chronic kidney disease progression Xiao, Jing Ding, Ruifeng Xu, Xiulin Guan, Haochen Feng, Xinhui Sun, Tao Zhu, Sibo Ye, Zhibin J Transl Med Research BACKGROUND: Urinary protein quantification is critical for assessing the severity of chronic kidney disease (CKD). However, the current procedure for determining the severity of CKD is completed through evaluating 24-h urinary protein, which is inconvenient during follow-up. OBJECTIVE: To quickly predict the severity of CKD using more easily available demographic and blood biochemical features during follow-up, we developed and compared several predictive models using statistical, machine learning and neural network approaches. METHODS: The clinical and blood biochemical results from 551 patients with proteinuria were collected. Thirteen blood-derived tests and 5 demographic features were used as non-urinary clinical variables to predict the 24-h urinary protein outcome response. Nine predictive models were established and compared, including logistic regression, Elastic Net, lasso regression, ridge regression, support vector machine, random forest, XGBoost, neural network and k-nearest neighbor. The AU-ROC, sensitivity (recall), specificity, accuracy, log-loss and precision of each of the models were evaluated. The effect sizes of each variable were analysed and ranked. RESULTS: The linear models including Elastic Net, lasso regression, ridge regression and logistic regression showed the highest overall predictive power, with an average AUC and a precision above 0.87 and 0.8, respectively. Logistic regression ranked first, reaching an AUC of 0.873, with a sensitivity and specificity of 0.83 and 0.82, respectively. The model with the highest sensitivity was Elastic Net (0.85), while XGBoost showed the highest specificity (0.83). In the effect size analyses, we identified that ALB, Scr, TG, LDL and EGFR had important impacts on the predictability of the models, while other predictors such as CRP, HDL and SNA were less important. CONCLUSIONS: Blood-derived tests could be applied as non-urinary predictors during outpatient follow-up. Features in routine blood tests, including ALB, Scr, TG, LDL and EGFR levels, showed predictive ability for CKD severity. The developed online tool can facilitate the prediction of proteinuria progress during follow-up in clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12967-019-1860-0) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-11 /pmc/articles/PMC6458616/ /pubmed/30971285 http://dx.doi.org/10.1186/s12967-019-1860-0 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Xiao, Jing
Ding, Ruifeng
Xu, Xiulin
Guan, Haochen
Feng, Xinhui
Sun, Tao
Zhu, Sibo
Ye, Zhibin
Comparison and development of machine learning tools in the prediction of chronic kidney disease progression
title Comparison and development of machine learning tools in the prediction of chronic kidney disease progression
title_full Comparison and development of machine learning tools in the prediction of chronic kidney disease progression
title_fullStr Comparison and development of machine learning tools in the prediction of chronic kidney disease progression
title_full_unstemmed Comparison and development of machine learning tools in the prediction of chronic kidney disease progression
title_short Comparison and development of machine learning tools in the prediction of chronic kidney disease progression
title_sort comparison and development of machine learning tools in the prediction of chronic kidney disease progression
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6458616/
https://www.ncbi.nlm.nih.gov/pubmed/30971285
http://dx.doi.org/10.1186/s12967-019-1860-0
work_keys_str_mv AT xiaojing comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT dingruifeng comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT xuxiulin comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT guanhaochen comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT fengxinhui comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT suntao comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT zhusibo comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression
AT yezhibin comparisonanddevelopmentofmachinelearningtoolsinthepredictionofchronickidneydiseaseprogression