Cargando…
Data-Driven Machine-Learning Methods for Diabetes Risk Prediction
Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the inciden...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9318204/ https://www.ncbi.nlm.nih.gov/pubmed/35890983 http://dx.doi.org/10.3390/s22145304 |
_version_ | 1784755234409545728 |
---|---|
author | Dritsas, Elias Trigka, Maria |
author_facet | Dritsas, Elias Trigka, Maria |
author_sort | Dritsas, Elias |
collection | PubMed |
description | Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the incidence of diabetes. Therefore, early diagnosis of the disease is a necessity. Machine Learning (ML) has gained great popularity among healthcare providers and physicians due to its high potential in developing efficient tools for risk prediction, prognosis, treatment and the management of various conditions. In this study, a supervised learning methodology is described that aims to create risk prediction tools with high efficiency for type 2 diabetes occurrence. A features analysis is conducted to evaluate their importance and explore their association with diabetes. These features are the most common symptoms that often develop slowly with diabetes, and they are utilized to train and test several ML models. Various ML models are evaluated in terms of the Precision, Recall, F-Measure, Accuracy and AUC metrics and compared under 10-fold cross-validation and data splitting. Both validation methods highlighted Random Forest and K-NN as the best performing models in comparison to the other models. |
format | Online Article Text |
id | pubmed-9318204 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-93182042022-07-27 Data-Driven Machine-Learning Methods for Diabetes Risk Prediction Dritsas, Elias Trigka, Maria Sensors (Basel) Article Diabetes mellitus is a chronic condition characterized by a disturbance in the metabolism of carbohydrates, fats and proteins. The most characteristic disorder in all forms of diabetes is hyperglycemia, i.e., elevated blood sugar levels. The modern way of life has significantly increased the incidence of diabetes. Therefore, early diagnosis of the disease is a necessity. Machine Learning (ML) has gained great popularity among healthcare providers and physicians due to its high potential in developing efficient tools for risk prediction, prognosis, treatment and the management of various conditions. In this study, a supervised learning methodology is described that aims to create risk prediction tools with high efficiency for type 2 diabetes occurrence. A features analysis is conducted to evaluate their importance and explore their association with diabetes. These features are the most common symptoms that often develop slowly with diabetes, and they are utilized to train and test several ML models. Various ML models are evaluated in terms of the Precision, Recall, F-Measure, Accuracy and AUC metrics and compared under 10-fold cross-validation and data splitting. Both validation methods highlighted Random Forest and K-NN as the best performing models in comparison to the other models. MDPI 2022-07-15 /pmc/articles/PMC9318204/ /pubmed/35890983 http://dx.doi.org/10.3390/s22145304 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Dritsas, Elias Trigka, Maria Data-Driven Machine-Learning Methods for Diabetes Risk Prediction |
title | Data-Driven Machine-Learning Methods for Diabetes Risk Prediction |
title_full | Data-Driven Machine-Learning Methods for Diabetes Risk Prediction |
title_fullStr | Data-Driven Machine-Learning Methods for Diabetes Risk Prediction |
title_full_unstemmed | Data-Driven Machine-Learning Methods for Diabetes Risk Prediction |
title_short | Data-Driven Machine-Learning Methods for Diabetes Risk Prediction |
title_sort | data-driven machine-learning methods for diabetes risk prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9318204/ https://www.ncbi.nlm.nih.gov/pubmed/35890983 http://dx.doi.org/10.3390/s22145304 |
work_keys_str_mv | AT dritsaselias datadrivenmachinelearningmethodsfordiabetesriskprediction AT trigkamaria datadrivenmachinelearningmethodsfordiabetesriskprediction |