Cargando…

A Classification Algorithm-Based Hybrid Diabetes Prediction Model

Diabetes is considered to be one of the leading causes of death globally. If diabetes is not treated and detected early, it can lead to a variety of complications. The aim of this study was to develop a model that can accurately predict the likelihood of developing diabetes in patients with the grea...

Descripción completa

Detalles Bibliográficos
Autores principales: Edeh, Michael Onyema, Khalaf, Osamah Ibrahim, Tavera, Carlos Andrés, Tayeb, Sofiane, Ghouali, Samir, Abdulsahib, Ghaida Muttashar, Richard-Nnabu, Nneka Ernestina, Louni, AbdRahmane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9008347/
https://www.ncbi.nlm.nih.gov/pubmed/35433625
http://dx.doi.org/10.3389/fpubh.2022.829519
_version_ 1784687032505729024
author Edeh, Michael Onyema
Khalaf, Osamah Ibrahim
Tavera, Carlos Andrés
Tayeb, Sofiane
Ghouali, Samir
Abdulsahib, Ghaida Muttashar
Richard-Nnabu, Nneka Ernestina
Louni, AbdRahmane
author_facet Edeh, Michael Onyema
Khalaf, Osamah Ibrahim
Tavera, Carlos Andrés
Tayeb, Sofiane
Ghouali, Samir
Abdulsahib, Ghaida Muttashar
Richard-Nnabu, Nneka Ernestina
Louni, AbdRahmane
author_sort Edeh, Michael Onyema
collection PubMed
description Diabetes is considered to be one of the leading causes of death globally. If diabetes is not treated and detected early, it can lead to a variety of complications. The aim of this study was to develop a model that can accurately predict the likelihood of developing diabetes in patients with the greatest amount of precision. Classification algorithms are widely used in the medical field to classify data into different categories based on some criteria that are relatively restrictive to the individual classifier, Therefore, four machine learning classification algorithms, namely supervised learning algorithms (Random forest, SVM and Naïve Bayes, Decision Tree DT) and unsupervised learning algorithm (k-means), have been a technique that was utilized in this investigation to identify diabetes in its early stages. The experiments are per-formed on two databases, one extracted from the Frankfurt Hospital in Germany and the other from the database. PIMA Indian Diabetes (PIDD) provided by the UCI machine learning repository. The results obtained from the database extracted from Frankfurt Hospital, Germany, showed that the random forest algorithm outperformed with the highest accuracy of 97.6%, and the results obtained from the Pima Indian database showed that the SVM algorithm outperformed with the highest accuracy of 83.1% compared to other algorithms. The validity of these results is confirmed by the process of separating the data set into two parts: a training set and a test set, which is described below. The training set is used to develop the model's capabilities. The test set is used to put the model through its paces and determine its correctness.
format Online
Article
Text
id pubmed-9008347
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90083472022-04-15 A Classification Algorithm-Based Hybrid Diabetes Prediction Model Edeh, Michael Onyema Khalaf, Osamah Ibrahim Tavera, Carlos Andrés Tayeb, Sofiane Ghouali, Samir Abdulsahib, Ghaida Muttashar Richard-Nnabu, Nneka Ernestina Louni, AbdRahmane Front Public Health Public Health Diabetes is considered to be one of the leading causes of death globally. If diabetes is not treated and detected early, it can lead to a variety of complications. The aim of this study was to develop a model that can accurately predict the likelihood of developing diabetes in patients with the greatest amount of precision. Classification algorithms are widely used in the medical field to classify data into different categories based on some criteria that are relatively restrictive to the individual classifier, Therefore, four machine learning classification algorithms, namely supervised learning algorithms (Random forest, SVM and Naïve Bayes, Decision Tree DT) and unsupervised learning algorithm (k-means), have been a technique that was utilized in this investigation to identify diabetes in its early stages. The experiments are per-formed on two databases, one extracted from the Frankfurt Hospital in Germany and the other from the database. PIMA Indian Diabetes (PIDD) provided by the UCI machine learning repository. The results obtained from the database extracted from Frankfurt Hospital, Germany, showed that the random forest algorithm outperformed with the highest accuracy of 97.6%, and the results obtained from the Pima Indian database showed that the SVM algorithm outperformed with the highest accuracy of 83.1% compared to other algorithms. The validity of these results is confirmed by the process of separating the data set into two parts: a training set and a test set, which is described below. The training set is used to develop the model's capabilities. The test set is used to put the model through its paces and determine its correctness. Frontiers Media S.A. 2022-03-31 /pmc/articles/PMC9008347/ /pubmed/35433625 http://dx.doi.org/10.3389/fpubh.2022.829519 Text en Copyright © 2022 Edeh, Khalaf, Tavera, Tayeb, Ghouali, Abdulsahib, Richard-Nnabu and Louni. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Edeh, Michael Onyema
Khalaf, Osamah Ibrahim
Tavera, Carlos Andrés
Tayeb, Sofiane
Ghouali, Samir
Abdulsahib, Ghaida Muttashar
Richard-Nnabu, Nneka Ernestina
Louni, AbdRahmane
A Classification Algorithm-Based Hybrid Diabetes Prediction Model
title A Classification Algorithm-Based Hybrid Diabetes Prediction Model
title_full A Classification Algorithm-Based Hybrid Diabetes Prediction Model
title_fullStr A Classification Algorithm-Based Hybrid Diabetes Prediction Model
title_full_unstemmed A Classification Algorithm-Based Hybrid Diabetes Prediction Model
title_short A Classification Algorithm-Based Hybrid Diabetes Prediction Model
title_sort classification algorithm-based hybrid diabetes prediction model
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9008347/
https://www.ncbi.nlm.nih.gov/pubmed/35433625
http://dx.doi.org/10.3389/fpubh.2022.829519
work_keys_str_mv AT edehmichaelonyema aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT khalafosamahibrahim aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT taveracarlosandres aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT tayebsofiane aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT ghoualisamir aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT abdulsahibghaidamuttashar aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT richardnnabunnekaernestina aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT louniabdrahmane aclassificationalgorithmbasedhybriddiabetespredictionmodel
AT edehmichaelonyema classificationalgorithmbasedhybriddiabetespredictionmodel
AT khalafosamahibrahim classificationalgorithmbasedhybriddiabetespredictionmodel
AT taveracarlosandres classificationalgorithmbasedhybriddiabetespredictionmodel
AT tayebsofiane classificationalgorithmbasedhybriddiabetespredictionmodel
AT ghoualisamir classificationalgorithmbasedhybriddiabetespredictionmodel
AT abdulsahibghaidamuttashar classificationalgorithmbasedhybriddiabetespredictionmodel
AT richardnnabunnekaernestina classificationalgorithmbasedhybriddiabetespredictionmodel
AT louniabdrahmane classificationalgorithmbasedhybriddiabetespredictionmodel