Cargando…

Data-Driven Machine-Learning Methods for Diabetes Risk Prediction

Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the inciden...

Descripción completa

Detalles Bibliográficos
Autores principales: Dritsas, Elias, Trigka, Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9318204/
https://www.ncbi.nlm.nih.gov/pubmed/35890983
http://dx.doi.org/10.3390/s22145304
_version_ 1784755234409545728
author Dritsas, Elias
Trigka, Maria
author_facet Dritsas, Elias
Trigka, Maria
author_sort Dritsas, Elias
collection PubMed
description Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the incidence of diabetes. Therefore, early diagnosis of the disease is a necessity. Machine Learning (ML) has gained great popularity among healthcare providers and physicians due to its high potential in developing efficient tools for risk prediction, prognosis, treatment and the management of various conditions. In this study, a supervised learning methodology is described that aims to create risk prediction tools with high efficiency for type 2 diabetes occurrence. A features analysis is conducted to evaluate their importance and explore their association with diabetes. These features are the most common symptoms that often develop slowly with diabetes, and they are utilized to train and test several ML models. Various ML models are evaluated in terms of the Precision, Recall, F-Measure, Accuracy and AUC metrics and compared under 10-fold cross-validation and data splitting. Both validation methods highlighted Random Forest and K-NN as the best performing models in comparison to the other models.
format Online
Article
Text
id pubmed-9318204
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-93182042022-07-27 Data-Driven Machine-Learning Methods for Diabetes Risk Prediction Dritsas, Elias Trigka, Maria Sensors (Basel) Article Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the incidence of diabetes. Therefore, early diagnosis of the disease is a necessity. Machine Learning (ML) has gained great popularity among healthcare providers and physicians due to its high potential in developing efficient tools for risk prediction, prognosis, treatment and the management of various conditions. In this study, a supervised learning methodology is described that aims to create risk prediction tools with high efficiency for type 2 diabetes occurrence. A features analysis is conducted to evaluate their importance and explore their association with diabetes. These features are the most common symptoms that often develop slowly with diabetes, and they are utilized to train and test several ML models. Various ML models are evaluated in terms of the Precision, Recall, F-Measure, Accuracy and AUC metrics and compared under 10-fold cross-validation and data splitting. Both validation methods highlighted Random Forest and K-NN as the best performing models in comparison to the other models. MDPI 2022-07-15 /pmc/articles/PMC9318204/ /pubmed/35890983 http://dx.doi.org/10.3390/s22145304 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Dritsas, Elias
Trigka, Maria
Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
title Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
title_full Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
title_fullStr Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
title_full_unstemmed Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
title_short Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
title_sort data-driven machine-learning methods for diabetes risk prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9318204/
https://www.ncbi.nlm.nih.gov/pubmed/35890983
http://dx.doi.org/10.3390/s22145304
work_keys_str_mv AT dritsaselias datadrivenmachinelearningmethodsfordiabetesriskprediction
AT trigkamaria datadrivenmachinelearningmethodsfordiabetesriskprediction