Cargando…
A Machine Learning Approach to Predicting Diabetes Complications
Diabetes mellitus (DM) is a chronic disease that is considered to be life-threatening. It can affect any part of the body over time, resulting in serious complications such as nephropathy, neuropathy, and retinopathy. In this work, several supervised classification algorithms were applied for buildi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702133/ https://www.ncbi.nlm.nih.gov/pubmed/34946438 http://dx.doi.org/10.3390/healthcare9121712 |
_version_ | 1784621173392277504 |
---|---|
author | Jian, Yazan Pasquier, Michel Sagahyroon, Assim Aloul, Fadi |
author_facet | Jian, Yazan Pasquier, Michel Sagahyroon, Assim Aloul, Fadi |
author_sort | Jian, Yazan |
collection | PubMed |
description | Diabetes mellitus (DM) is a chronic disease that is considered to be life-threatening. It can affect any part of the body over time, resulting in serious complications such as nephropathy, neuropathy, and retinopathy. In this work, several supervised classification algorithms were applied for building different models to predict and classify eight diabetes complications. The complications include metabolic syndrome, dyslipidemia, neuropathy, nephropathy, diabetic foot, hypertension, obesity, and retinopathy. For this study, a dataset collected by the Rashid Center for Diabetes and Research (RCDR) located in Ajman, UAE, was utilized. The dataset consists of 884 records with 79 features. Some essential preprocessing steps were applied to handle the missing values and unbalanced data problems. Furthermore, feature selection was performed to select the top five and ten features for each complication. The final number of records used to train and build the binary classifiers for each complication was as follows: 428—metabolic syndrome, 836—dyslipidemia, 223—neuropathy, 233—nephropathy, 240—diabetic foot, 586—hypertension, 498—obesity, 228—retinopathy. Repeated stratified k-fold cross-validation (with k = 10 and a total of 10 repetitions) was employed for a better estimation of the performance. Accuracy and F1-score were used to evaluate the models’ performance reaching a maximum of 97.8% and 97.7% for accuracy and F1-scores, respectively. Moreover, by comparing the performance achieved using different attributes’ sets, it was found that by using a selected number of features, we can still build adequate classifiers. |
format | Online Article Text |
id | pubmed-8702133 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-87021332021-12-24 A Machine Learning Approach to Predicting Diabetes Complications Jian, Yazan Pasquier, Michel Sagahyroon, Assim Aloul, Fadi Healthcare (Basel) Article Diabetes mellitus (DM) is a chronic disease that is considered to be life-threatening. It can affect any part of the body over time, resulting in serious complications such as nephropathy, neuropathy, and retinopathy. In this work, several supervised classification algorithms were applied for building different models to predict and classify eight diabetes complications. The complications include metabolic syndrome, dyslipidemia, neuropathy, nephropathy, diabetic foot, hypertension, obesity, and retinopathy. For this study, a dataset collected by the Rashid Center for Diabetes and Research (RCDR) located in Ajman, UAE, was utilized. The dataset consists of 884 records with 79 features. Some essential preprocessing steps were applied to handle the missing values and unbalanced data problems. Furthermore, feature selection was performed to select the top five and ten features for each complication. The final number of records used to train and build the binary classifiers for each complication was as follows: 428—metabolic syndrome, 836—dyslipidemia, 223—neuropathy, 233—nephropathy, 240—diabetic foot, 586—hypertension, 498—obesity, 228—retinopathy. Repeated stratified k-fold cross-validation (with k = 10 and a total of 10 repetitions) was employed for a better estimation of the performance. Accuracy and F1-score were used to evaluate the models’ performance reaching a maximum of 97.8% and 97.7% for accuracy and F1-scores, respectively. Moreover, by comparing the performance achieved using different attributes’ sets, it was found that by using a selected number of features, we can still build adequate classifiers. MDPI 2021-12-09 /pmc/articles/PMC8702133/ /pubmed/34946438 http://dx.doi.org/10.3390/healthcare9121712 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Jian, Yazan Pasquier, Michel Sagahyroon, Assim Aloul, Fadi A Machine Learning Approach to Predicting Diabetes Complications |
title | A Machine Learning Approach to Predicting Diabetes Complications |
title_full | A Machine Learning Approach to Predicting Diabetes Complications |
title_fullStr | A Machine Learning Approach to Predicting Diabetes Complications |
title_full_unstemmed | A Machine Learning Approach to Predicting Diabetes Complications |
title_short | A Machine Learning Approach to Predicting Diabetes Complications |
title_sort | machine learning approach to predicting diabetes complications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702133/ https://www.ncbi.nlm.nih.gov/pubmed/34946438 http://dx.doi.org/10.3390/healthcare9121712 |
work_keys_str_mv | AT jianyazan amachinelearningapproachtopredictingdiabetescomplications AT pasquiermichel amachinelearningapproachtopredictingdiabetescomplications AT sagahyroonassim amachinelearningapproachtopredictingdiabetescomplications AT aloulfadi amachinelearningapproachtopredictingdiabetescomplications AT jianyazan machinelearningapproachtopredictingdiabetescomplications AT pasquiermichel machinelearningapproachtopredictingdiabetescomplications AT sagahyroonassim machinelearningapproachtopredictingdiabetescomplications AT aloulfadi machinelearningapproachtopredictingdiabetescomplications |