Cargando…

An ensemble learning approach for diabetes prediction using boosting techniques

Introduction: Diabetes is considered one of the leading healthcare concerns affecting millions worldwide. Taking appropriate action at the earliest stages of the disease depends on early diabetes prediction and identification. To support healthcare providers for better diagnosis and prognosis of dis...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ganie, Shahid Mohammad, Pramanik, Pijush Kanti Dutta, Bashir Malik, Majid, Mallik, Saurav, Qin, Hong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10639159/ https://www.ncbi.nlm.nih.gov/pubmed/37953921 http://dx.doi.org/10.3389/fgene.2023.1252159

_version_	1785133737904701440
author	Ganie, Shahid Mohammad Pramanik, Pijush Kanti Dutta Bashir Malik, Majid Mallik, Saurav Qin, Hong
author_facet	Ganie, Shahid Mohammad Pramanik, Pijush Kanti Dutta Bashir Malik, Majid Mallik, Saurav Qin, Hong
author_sort	Ganie, Shahid Mohammad
collection	PubMed
description	Introduction: Diabetes is considered one of the leading healthcare concerns affecting millions worldwide. Taking appropriate action at the earliest stages of the disease depends on early diabetes prediction and identification. To support healthcare providers for better diagnosis and prognosis of diseases, machine learning has been explored in the healthcare industry in recent years. Methods: To predict diabetes, this research has conducted experiments on five boosting algorithms on the Pima diabetes dataset. The dataset was obtained from the University of California, Irvine (UCI) machine learning repository, which contains several important clinical features. Exploratory data analysis was used to identify the characteristics of the dataset. Moreover, upsampling, normalisation, feature selection, and hyperparameter tuning were employed for predictive analytics. Results: The results were analysed using various statistical/machine learning metrics and k-fold cross-validation techniques. Gradient boosting achieved the greatest accuracy rate of 92.85% among all the classifiers. Precision, recall, f1-score, and receiver operating characteristic (ROC) curves were used to further validate the model. Discussion: The suggested model outperformed the current studies in terms of prediction accuracy, demonstrating its applicability to other diseases with similar predicate indications.
format	Online Article Text
id	pubmed-10639159
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-106391592023-11-11 An ensemble learning approach for diabetes prediction using boosting techniques Ganie, Shahid Mohammad Pramanik, Pijush Kanti Dutta Bashir Malik, Majid Mallik, Saurav Qin, Hong Front Genet Genetics Introduction: Diabetes is considered one of the leading healthcare concerns affecting millions worldwide. Taking appropriate action at the earliest stages of the disease depends on early diabetes prediction and identification. To support healthcare providers for better diagnosis and prognosis of diseases, machine learning has been explored in the healthcare industry in recent years. Methods: To predict diabetes, this research has conducted experiments on five boosting algorithms on the Pima diabetes dataset. The dataset was obtained from the University of California, Irvine (UCI) machine learning repository, which contains several important clinical features. Exploratory data analysis was used to identify the characteristics of the dataset. Moreover, upsampling, normalisation, feature selection, and hyperparameter tuning were employed for predictive analytics. Results: The results were analysed using various statistical/machine learning metrics and k-fold cross-validation techniques. Gradient boosting achieved the greatest accuracy rate of 92.85% among all the classifiers. Precision, recall, f1-score, and receiver operating characteristic (ROC) curves were used to further validate the model. Discussion: The suggested model outperformed the current studies in terms of prediction accuracy, demonstrating its applicability to other diseases with similar predicate indications. Frontiers Media S.A. 2023-10-26 /pmc/articles/PMC10639159/ /pubmed/37953921 http://dx.doi.org/10.3389/fgene.2023.1252159 Text en Copyright © 2023 Ganie, Pramanik, Bashir Malik, Mallik and Qin. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Ganie, Shahid Mohammad Pramanik, Pijush Kanti Dutta Bashir Malik, Majid Mallik, Saurav Qin, Hong An ensemble learning approach for diabetes prediction using boosting techniques
title	An ensemble learning approach for diabetes prediction using boosting techniques
title_full	An ensemble learning approach for diabetes prediction using boosting techniques
title_fullStr	An ensemble learning approach for diabetes prediction using boosting techniques
title_full_unstemmed	An ensemble learning approach for diabetes prediction using boosting techniques
title_short	An ensemble learning approach for diabetes prediction using boosting techniques
title_sort	ensemble learning approach for diabetes prediction using boosting techniques
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10639159/ https://www.ncbi.nlm.nih.gov/pubmed/37953921 http://dx.doi.org/10.3389/fgene.2023.1252159
work_keys_str_mv	AT ganieshahidmohammad anensemblelearningapproachfordiabetespredictionusingboostingtechniques AT pramanikpijushkantidutta anensemblelearningapproachfordiabetespredictionusingboostingtechniques AT bashirmalikmajid anensemblelearningapproachfordiabetespredictionusingboostingtechniques AT malliksaurav anensemblelearningapproachfordiabetespredictionusingboostingtechniques AT qinhong anensemblelearningapproachfordiabetespredictionusingboostingtechniques AT ganieshahidmohammad ensemblelearningapproachfordiabetespredictionusingboostingtechniques AT pramanikpijushkantidutta ensemblelearningapproachfordiabetespredictionusingboostingtechniques AT bashirmalikmajid ensemblelearningapproachfordiabetespredictionusingboostingtechniques AT malliksaurav ensemblelearningapproachfordiabetespredictionusingboostingtechniques AT qinhong ensemblelearningapproachfordiabetespredictionusingboostingtechniques

An ensemble learning approach for diabetes prediction using boosting techniques

Ejemplares similares