Cargando…

Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study

OBJECTIVE: Diabetic kidney disease (DKD) has been reported as a main microvascular complication of diabetes mellitus. Although renal biopsy is capable of distinguishing DKD from Non Diabetic kidney disease(NDKD), no gold standard has been validated to assess the development of DKD.This study aimed t...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Xiao zhu, Duan, Minjie, Huang, Hao dong, Zhang, Yang, Xiang, Tian yu, Niu, Wu ceng, Zhou, Bei, Wang, Hao lin, Zhang, Ting ting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352831/
https://www.ncbi.nlm.nih.gov/pubmed/37469989
http://dx.doi.org/10.3389/fendo.2023.1184190
_version_ 1785074594300821504
author Liu, Xiao zhu
Duan, Minjie
Huang, Hao dong
Zhang, Yang
Xiang, Tian yu
Niu, Wu ceng
Zhou, Bei
Wang, Hao lin
Zhang, Ting ting
author_facet Liu, Xiao zhu
Duan, Minjie
Huang, Hao dong
Zhang, Yang
Xiang, Tian yu
Niu, Wu ceng
Zhou, Bei
Wang, Hao lin
Zhang, Ting ting
author_sort Liu, Xiao zhu
collection PubMed
description OBJECTIVE: Diabetic kidney disease (DKD) has been reported as a main microvascular complication of diabetes mellitus. Although renal biopsy is capable of distinguishing DKD from Non Diabetic kidney disease(NDKD), no gold standard has been validated to assess the development of DKD.This study aimed to build an auxiliary diagnosis model for type 2 Diabetic kidney disease (T2DKD) based on machine learning algorithms. METHODS: Clinical data on 3624 individuals with type 2 diabetes (T2DM) was gathered from January 1, 2019 to December 31, 2019 using a multi-center retrospective database. The data fell into a training set and a validation set at random at a ratio of 8:2. To identify critical clinical variables, the absolute shrinkage and selection operator with the lowest number was employed. Fifteen machine learning models were built to support the diagnosis of T2DKD, and the optimal model was selected in accordance with the area under the receiver operating characteristic curve (AUC) and accuracy. The model was improved with the use of Bayesian Optimization methods. The Shapley Additive explanations (SHAP) approach was used to illustrate prediction findings. RESULTS: DKD was diagnosed in 1856 (51.2 percent) of the 3624 individuals within the final cohort. As revealed by the SHAP findings, the Categorical Boosting (CatBoost) model achieved the optimal performance 1in the prediction of the risk of T2DKD, with an AUC of 0.86 based on the top 38 characteristics. The SHAP findings suggested that a simplified CatBoost model with an AUC of 0.84 was built in accordance with the top 12 characteristics. The more basic model features consisted of systolic blood pressure (SBP), creatinine (CREA), length of stay (LOS), thrombin time (TT), Age, prothrombin time (PT), platelet large cell ratio (P-LCR), albumin (ALB), glucose (GLU), fibrinogen (FIB-C), red blood cell distribution width-standard deviation (RDW-SD), as well as hemoglobin A1C(HbA1C). CONCLUSION: A machine learning-based model for the prediction of the risk of developing T2DKD was built, and its effectiveness was verified. The CatBoost model can contribute to the diagnosis of T2DKD. Clinicians could gain more insights into the outcomes if the ML model is made interpretable.
format Online
Article
Text
id pubmed-10352831
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103528312023-07-19 Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study Liu, Xiao zhu Duan, Minjie Huang, Hao dong Zhang, Yang Xiang, Tian yu Niu, Wu ceng Zhou, Bei Wang, Hao lin Zhang, Ting ting Front Endocrinol (Lausanne) Endocrinology OBJECTIVE: Diabetic kidney disease (DKD) has been reported as a main microvascular complication of diabetes mellitus. Although renal biopsy is capable of distinguishing DKD from Non Diabetic kidney disease(NDKD), no gold standard has been validated to assess the development of DKD.This study aimed to build an auxiliary diagnosis model for type 2 Diabetic kidney disease (T2DKD) based on machine learning algorithms. METHODS: Clinical data on 3624 individuals with type 2 diabetes (T2DM) was gathered from January 1, 2019 to December 31, 2019 using a multi-center retrospective database. The data fell into a training set and a validation set at random at a ratio of 8:2. To identify critical clinical variables, the absolute shrinkage and selection operator with the lowest number was employed. Fifteen machine learning models were built to support the diagnosis of T2DKD, and the optimal model was selected in accordance with the area under the receiver operating characteristic curve (AUC) and accuracy. The model was improved with the use of Bayesian Optimization methods. The Shapley Additive explanations (SHAP) approach was used to illustrate prediction findings. RESULTS: DKD was diagnosed in 1856 (51.2 percent) of the 3624 individuals within the final cohort. As revealed by the SHAP findings, the Categorical Boosting (CatBoost) model achieved the optimal performance 1in the prediction of the risk of T2DKD, with an AUC of 0.86 based on the top 38 characteristics. The SHAP findings suggested that a simplified CatBoost model with an AUC of 0.84 was built in accordance with the top 12 characteristics. The more basic model features consisted of systolic blood pressure (SBP), creatinine (CREA), length of stay (LOS), thrombin time (TT), Age, prothrombin time (PT), platelet large cell ratio (P-LCR), albumin (ALB), glucose (GLU), fibrinogen (FIB-C), red blood cell distribution width-standard deviation (RDW-SD), as well as hemoglobin A1C(HbA1C). CONCLUSION: A machine learning-based model for the prediction of the risk of developing T2DKD was built, and its effectiveness was verified. The CatBoost model can contribute to the diagnosis of T2DKD. Clinicians could gain more insights into the outcomes if the ML model is made interpretable. Frontiers Media S.A. 2023-07-04 /pmc/articles/PMC10352831/ /pubmed/37469989 http://dx.doi.org/10.3389/fendo.2023.1184190 Text en Copyright © 2023 Liu, Duan, Huang, Zhang, Xiang, Niu, Zhou, Wang and Zhang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Endocrinology
Liu, Xiao zhu
Duan, Minjie
Huang, Hao dong
Zhang, Yang
Xiang, Tian yu
Niu, Wu ceng
Zhou, Bei
Wang, Hao lin
Zhang, Ting ting
Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
title Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
title_full Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
title_fullStr Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
title_full_unstemmed Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
title_short Predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
title_sort predicting diabetic kidney disease for type 2 diabetes mellitus by machine learning in the real world: a multicenter retrospective study
topic Endocrinology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10352831/
https://www.ncbi.nlm.nih.gov/pubmed/37469989
http://dx.doi.org/10.3389/fendo.2023.1184190
work_keys_str_mv AT liuxiaozhu predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT duanminjie predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT huanghaodong predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT zhangyang predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT xiangtianyu predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT niuwuceng predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT zhoubei predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT wanghaolin predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy
AT zhangtingting predictingdiabetickidneydiseasefortype2diabetesmellitusbymachinelearningintherealworldamulticenterretrospectivestudy