Cargando…
Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning
BACKGROUND: Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials. METHODS: We retrospectively reviewed the data of 1278 patients with diabetes treated in two central ho...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394817/ https://www.ncbi.nlm.nih.gov/pubmed/37533059 http://dx.doi.org/10.1186/s12911-023-02232-1 |
_version_ | 1785083453603053568 |
---|---|
author | Lian, Xiaoyang Qi, Juanzhi Yuan, Mengqian Li, Xiaojie Wang, Ming Li, Gang Yang, Tao Zhong, Jingchen |
author_facet | Lian, Xiaoyang Qi, Juanzhi Yuan, Mengqian Li, Xiaojie Wang, Ming Li, Gang Yang, Tao Zhong, Jingchen |
author_sort | Lian, Xiaoyang |
collection | PubMed |
description | BACKGROUND: Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials. METHODS: We retrospectively reviewed the data of 1278 patients with diabetes treated in two central hospitals from 2020 to 2022. The data included medical history, physical examination, and biochemical index test results. After feature selection and data balancing, the cohort was divided into training and internal validation datasets at a 7:3 ratio. Training was made in logistic regression, k-nearest neighbor, decision tree, naive bayes, random forest, and extreme gradient boosting (XGBoost) based on machine learning. The k-fold cross-validation was used for model assessment, and the accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were adopted to validate the models’ discrimination and clinical practicality. The SHapley Additive exPlanation (SHAP) was used to interpret the best-performing model. RESULTS: The XGBoost model outperformed other models, which had an accuracy of 0·746, precision of 0·765, recall of 0·711, F1-score of 0·736, and AUC of 0·813. The SHAP results indicated that age, disease duration, glycated hemoglobin, insulin resistance index, 24-h urine protein quantification, and urine protein concentration were risk factors for DPN, while the ratio between 2-h postprandial C-peptide and fasting C-peptide(C2/C0), total cholesterol, activated partial thromboplastin time, and creatinine were protective factors. CONCLUSIONS: The machine learning approach helped established a DPN risk prediction model with good performance. The model identified the factors most closely related to DPN. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02232-1. |
format | Online Article Text |
id | pubmed-10394817 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-103948172023-08-03 Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning Lian, Xiaoyang Qi, Juanzhi Yuan, Mengqian Li, Xiaojie Wang, Ming Li, Gang Yang, Tao Zhong, Jingchen BMC Med Inform Decis Mak Research BACKGROUND: Diabetic peripheral neuropathy (DPN) is a common complication of diabetes. Predicting the risk of developing DPN is important for clinical decision-making and designing clinical trials. METHODS: We retrospectively reviewed the data of 1278 patients with diabetes treated in two central hospitals from 2020 to 2022. The data included medical history, physical examination, and biochemical index test results. After feature selection and data balancing, the cohort was divided into training and internal validation datasets at a 7:3 ratio. Training was made in logistic regression, k-nearest neighbor, decision tree, naive bayes, random forest, and extreme gradient boosting (XGBoost) based on machine learning. The k-fold cross-validation was used for model assessment, and the accuracy, precision, recall, F1-score, and the area under the receiver operating characteristic curve (AUC) were adopted to validate the models’ discrimination and clinical practicality. The SHapley Additive exPlanation (SHAP) was used to interpret the best-performing model. RESULTS: The XGBoost model outperformed other models, which had an accuracy of 0·746, precision of 0·765, recall of 0·711, F1-score of 0·736, and AUC of 0·813. The SHAP results indicated that age, disease duration, glycated hemoglobin, insulin resistance index, 24-h urine protein quantification, and urine protein concentration were risk factors for DPN, while the ratio between 2-h postprandial C-peptide and fasting C-peptide(C2/C0), total cholesterol, activated partial thromboplastin time, and creatinine were protective factors. CONCLUSIONS: The machine learning approach helped established a DPN risk prediction model with good performance. The model identified the factors most closely related to DPN. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12911-023-02232-1. BioMed Central 2023-08-02 /pmc/articles/PMC10394817/ /pubmed/37533059 http://dx.doi.org/10.1186/s12911-023-02232-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Lian, Xiaoyang Qi, Juanzhi Yuan, Mengqian Li, Xiaojie Wang, Ming Li, Gang Yang, Tao Zhong, Jingchen Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
title | Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
title_full | Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
title_fullStr | Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
title_full_unstemmed | Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
title_short | Study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
title_sort | study on risk factors of diabetic peripheral neuropathy and establishment of a prediction model by machine learning |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394817/ https://www.ncbi.nlm.nih.gov/pubmed/37533059 http://dx.doi.org/10.1186/s12911-023-02232-1 |
work_keys_str_mv | AT lianxiaoyang studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT qijuanzhi studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT yuanmengqian studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT lixiaojie studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT wangming studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT ligang studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT yangtao studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning AT zhongjingchen studyonriskfactorsofdiabeticperipheralneuropathyandestablishmentofapredictionmodelbymachinelearning |