Cargando…

Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning

BACKGROUND: Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials. METHODS: We retrospectively reviewed the data of 1278 patients with diabetes treated in two central ho...

Descripción completa

Detalles Bibliográficos
Autores principales: Lian, Xiaoyang, Qi, Juanzhi, Yuan, Mengqian, Li, Xiaojie, Wang, Ming, Li, Gang, Yang, Tao, Zhong, Jingchen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394817/
https://www.ncbi.nlm.nih.gov/pubmed/37533059
http://dx.doi.org/10.1186/s12911-023-02232-1
_version_ 1785083453603053568
author Lian, Xiaoyang
Qi, Juanzhi
Yuan, Mengqian
Li, Xiaojie
Wang, Ming
Li, Gang
Yang, Tao
Zhong, Jingchen
author_facet Lian, Xiaoyang
Qi, Juanzhi
Yuan, Mengqian
Li, Xiaojie
Wang, Ming
Li, Gang
Yang, Tao
Zhong, Jingchen
author_sort Lian, Xiaoyang
collection PubMed
description BACKGROUND: Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials. METHODS: We retrospectively reviewed the data of 1278 patients with diabetes treated in two central hospitals from 2020 to 2022. The data included medical history, physical examination, and biochemical index test results. After feature selection and data balancing, the cohort was divided into training and internal validation datasets at a 7:3 ratio. Training was made in logistic regression, k-nearest neighbor, decision tree, naive bayes, random forest, and extreme gradient boosting (XGBoost) based on machine learning. The k-fold cross-validation was used for model assessment, and the accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were adopted to validate the models’ discrimination and clinical practicality. The SHapley Additive exPlanation (SHAP) was used to interpret the best-performing model. RESULTS: The XGBoost model outperformed other models, which had an accuracy of 0·746, precision of 0·765, recall of 0·711, F1-score of 0·736, and AUC of 0·813. The SHAP results indicated that age, disease duration, glycated hemoglobin, insulin resistance index, 24-h urine protein quantification, and urine protein concentration were risk factors for DPN, while the ratio between 2-h postprandial C-peptide and fasting C-peptide(C2/C0), total cholesterol, activated partial thromboplastin time, and creatinine were protective factors. CONCLUSIONS: The machine learning approach helped established a DPN risk prediction model with good performance. The model identified the factors most closely related to DPN. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02232-1.
format Online
Article
Text
id pubmed-10394817
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-103948172023-08-03 Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning Lian, Xiaoyang Qi, Juanzhi Yuan, Mengqian Li, Xiaojie Wang, Ming Li, Gang Yang, Tao Zhong, Jingchen BMC Med Inform Decis Mak Research BACKGROUND: Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials. METHODS: We retrospectively reviewed the data of 1278 patients with diabetes treated in two central hospitals from 2020 to 2022. The data included medical history, physical examination, and biochemical index test results. After feature selection and data balancing, the cohort was divided into training and internal validation datasets at a 7:3 ratio. Training was made in logistic regression, k-nearest neighbor, decision tree, naive bayes, random forest, and extreme gradient boosting (XGBoost) based on machine learning. The k-fold cross-validation was used for model assessment, and the accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were adopted to validate the models’ discrimination and clinical practicality. The SHapley Additive exPlanation (SHAP) was used to interpret the best-performing model. RESULTS: The XGBoost model outperformed other models, which had an accuracy of 0·746, precision of 0·765, recall of 0·711, F1-score of 0·736, and AUC of 0·813. The SHAP results indicated that age, disease duration, glycated hemoglobin, insulin resistance index, 24-h urine protein quantification, and urine protein concentration were risk factors for DPN, while the ratio between 2-h postprandial C-peptide and fasting C-peptide(C2/C0), total cholesterol, activated partial thromboplastin time, and creatinine were protective factors. CONCLUSIONS: The machine learning approach helped established a DPN risk prediction model with good performance. The model identified the factors most closely related to DPN. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02232-1. BioMed Central 2023-08-02 /pmc/articles/PMC10394817/ /pubmed/37533059 http://dx.doi.org/10.1186/s12911-023-02232-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Lian, Xiaoyang
Qi, Juanzhi
Yuan, Mengqian
Li, Xiaojie
Wang, Ming
Li, Gang
Yang, Tao
Zhong, Jingchen
Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
title Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
title_full Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
title_fullStr Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
title_full_unstemmed Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
title_short Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
title_sort study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394817/
https://www.ncbi.nlm.nih.gov/pubmed/37533059
http://dx.doi.org/10.1186/s12911-023-02232-1
work_keys_str_mv AT lianxiaoyang studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT qijuanzhi studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT yuanmengqian studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT lixiaojie studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT wangming studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT ligang studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT yangtao studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning
AT zhongjingchen studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning