Cargando…

A Machine Learning Approach to Predicting Diabetes Complications

Diabetes mellitus (DM) is a chronic disease that is considered to be life-threatening. It can affect any part of the body over time, resulting in serious complications such as nephropathy, neuropathy, and retinopathy. In this work, several supervised classification algorithms were applied for buildi...

Descripción completa

Detalles Bibliográficos
Autores principales: Jian, Yazan, Pasquier, Michel, Sagahyroon, Assim, Aloul, Fadi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702133/
https://www.ncbi.nlm.nih.gov/pubmed/34946438
http://dx.doi.org/10.3390/healthcare9121712
_version_ 1784621173392277504
author Jian, Yazan
Pasquier, Michel
Sagahyroon, Assim
Aloul, Fadi
author_facet Jian, Yazan
Pasquier, Michel
Sagahyroon, Assim
Aloul, Fadi
author_sort Jian, Yazan
collection PubMed
description Diabetes mellitus (DM) is a chronic disease that is considered to be life-threatening. It can affect any part of the body over time, resulting in serious complications such as nephropathy, neuropathy, and retinopathy. In this work, several supervised classification algorithms were applied for building different models to predict and classify eight diabetes complications. The complications include metabolic syndrome, dyslipidemia, neuropathy, nephropathy, diabetic foot, hypertension, obesity, and retinopathy. For this study, a dataset collected by the Rashid Center for Diabetes and Research (RCDR) located in Ajman, UAE, was utilized. The dataset consists of 884 records with 79 features. Some essential preprocessing steps were applied to handle the missing values and unbalanced data problems. Furthermore, feature selection was performed to select the top five and ten features for each complication. The final number of records used to train and build the binary classifiers for each complication was as follows: 428—metabolic syndrome, 836—dyslipidemia, 223—neuropathy, 233—nephropathy, 240—diabetic foot, 586—hypertension, 498—obesity, 228—retinopathy. Repeated stratified k-fold cross-validation (with k = 10 and a total of 10 repetitions) was employed for a better estimation of the performance. Accuracy and F1-score were used to evaluate the models’ performance reaching a maximum of 97.8% and 97.7% for accuracy and F1-scores, respectively. Moreover, by comparing the performance achieved using different attributes’ sets, it was found that by using a selected number of features, we can still build adequate classifiers.
format Online
Article
Text
id pubmed-8702133
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87021332021-12-24 A Machine Learning Approach to Predicting Diabetes Complications Jian, Yazan Pasquier, Michel Sagahyroon, Assim Aloul, Fadi Healthcare (Basel) Article Diabetes mellitus (DM) is a chronic disease that is considered to be life-threatening. It can affect any part of the body over time, resulting in serious complications such as nephropathy, neuropathy, and retinopathy. In this work, several supervised classification algorithms were applied for building different models to predict and classify eight diabetes complications. The complications include metabolic syndrome, dyslipidemia, neuropathy, nephropathy, diabetic foot, hypertension, obesity, and retinopathy. For this study, a dataset collected by the Rashid Center for Diabetes and Research (RCDR) located in Ajman, UAE, was utilized. The dataset consists of 884 records with 79 features. Some essential preprocessing steps were applied to handle the missing values and unbalanced data problems. Furthermore, feature selection was performed to select the top five and ten features for each complication. The final number of records used to train and build the binary classifiers for each complication was as follows: 428—metabolic syndrome, 836—dyslipidemia, 223—neuropathy, 233—nephropathy, 240—diabetic foot, 586—hypertension, 498—obesity, 228—retinopathy. Repeated stratified k-fold cross-validation (with k = 10 and a total of 10 repetitions) was employed for a better estimation of the performance. Accuracy and F1-score were used to evaluate the models’ performance reaching a maximum of 97.8% and 97.7% for accuracy and F1-scores, respectively. Moreover, by comparing the performance achieved using different attributes’ sets, it was found that by using a selected number of features, we can still build adequate classifiers. MDPI 2021-12-09 /pmc/articles/PMC8702133/ /pubmed/34946438 http://dx.doi.org/10.3390/healthcare9121712 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jian, Yazan
Pasquier, Michel
Sagahyroon, Assim
Aloul, Fadi
A Machine Learning Approach to Predicting Diabetes Complications
title A Machine Learning Approach to Predicting Diabetes Complications
title_full A Machine Learning Approach to Predicting Diabetes Complications
title_fullStr A Machine Learning Approach to Predicting Diabetes Complications
title_full_unstemmed A Machine Learning Approach to Predicting Diabetes Complications
title_short A Machine Learning Approach to Predicting Diabetes Complications
title_sort machine learning approach to predicting diabetes complications
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702133/
https://www.ncbi.nlm.nih.gov/pubmed/34946438
http://dx.doi.org/10.3390/healthcare9121712
work_keys_str_mv AT jianyazan amachinelearningapproachtopredictingdiabetescomplications
AT pasquiermichel amachinelearningapproachtopredictingdiabetescomplications
AT sagahyroonassim amachinelearningapproachtopredictingdiabetescomplications
AT aloulfadi amachinelearningapproachtopredictingdiabetescomplications
AT jianyazan machinelearningapproachtopredictingdiabetescomplications
AT pasquiermichel machinelearningapproachtopredictingdiabetescomplications
AT sagahyroonassim machinelearningapproachtopredictingdiabetescomplications
AT aloulfadi machinelearningapproachtopredictingdiabetescomplications