Cargando…

Ensemble Learning for Disease Prediction: A Review

Machine learning models are used to create and enhance various disease prediction frameworks. Ensemble learning is a machine learning technique that combines multiple classifiers to improve performance by making more accurate predictions than a single classifier. Although numerous studies have emplo...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahajan, Palak, Uddin, Shahadat, Hajati, Farshid, Moni, Mohammad Ali
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298658/
https://www.ncbi.nlm.nih.gov/pubmed/37372925
http://dx.doi.org/10.3390/healthcare11121808
_version_ 1785064171486838784
author Mahajan, Palak
Uddin, Shahadat
Hajati, Farshid
Moni, Mohammad Ali
author_facet Mahajan, Palak
Uddin, Shahadat
Hajati, Farshid
Moni, Mohammad Ali
author_sort Mahajan, Palak
collection PubMed
description Machine learning models are used to create and enhance various disease prediction frameworks. Ensemble learning is a machine learning technique that combines multiple classifiers to improve performance by making more accurate predictions than a single classifier. Although numerous studies have employed ensemble approaches for disease prediction, there is a lack of thorough assessment of commonly used ensemble approaches against highly researched diseases. Consequently, this study aims to identify significant trends in the performance accuracies of ensemble techniques (i.e., bagging, boosting, stacking, and voting) against five hugely researched diseases (i.e., diabetes, skin disease, kidney disease, liver disease, and heart conditions). Using a well-defined search strategy, we first identified 45 articles from the current literature that applied two or more of the four ensemble approaches to any of these five diseases and were published in 2016–2023. Although stacking has been used the fewest number of times (23) compared with bagging (41) and boosting (37), it showed the most accurate performance the most times (19 out of 23). The voting approach is the second-best ensemble approach, as revealed in this review. Stacking always revealed the most accurate performance in the reviewed articles for skin disease and diabetes. Bagging demonstrated the best performance for kidney disease (five out of six times) and boosting for liver and diabetes (four out of six times). The results show that stacking has demonstrated greater accuracy in disease prediction than the other three candidate algorithms. Our study also demonstrates variability in the perceived performance of different ensemble approaches against frequently used disease datasets. The findings of this work will assist researchers in better understanding current trends and hotspots in disease prediction models that employ ensemble learning, as well as in determining a more suitable ensemble model for predictive disease analytics. This article also discusses variability in the perceived performance of different ensemble approaches against frequently used disease datasets.
format Online
Article
Text
id pubmed-10298658
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102986582023-06-28 Ensemble Learning for Disease Prediction: A Review Mahajan, Palak Uddin, Shahadat Hajati, Farshid Moni, Mohammad Ali Healthcare (Basel) Review Machine learning models are used to create and enhance various disease prediction frameworks. Ensemble learning is a machine learning technique that combines multiple classifiers to improve performance by making more accurate predictions than a single classifier. Although numerous studies have employed ensemble approaches for disease prediction, there is a lack of thorough assessment of commonly used ensemble approaches against highly researched diseases. Consequently, this study aims to identify significant trends in the performance accuracies of ensemble techniques (i.e., bagging, boosting, stacking, and voting) against five hugely researched diseases (i.e., diabetes, skin disease, kidney disease, liver disease, and heart conditions). Using a well-defined search strategy, we first identified 45 articles from the current literature that applied two or more of the four ensemble approaches to any of these five diseases and were published in 2016–2023. Although stacking has been used the fewest number of times (23) compared with bagging (41) and boosting (37), it showed the most accurate performance the most times (19 out of 23). The voting approach is the second-best ensemble approach, as revealed in this review. Stacking always revealed the most accurate performance in the reviewed articles for skin disease and diabetes. Bagging demonstrated the best performance for kidney disease (five out of six times) and boosting for liver and diabetes (four out of six times). The results show that stacking has demonstrated greater accuracy in disease prediction than the other three candidate algorithms. Our study also demonstrates variability in the perceived performance of different ensemble approaches against frequently used disease datasets. The findings of this work will assist researchers in better understanding current trends and hotspots in disease prediction models that employ ensemble learning, as well as in determining a more suitable ensemble model for predictive disease analytics. This article also discusses variability in the perceived performance of different ensemble approaches against frequently used disease datasets. MDPI 2023-06-20 /pmc/articles/PMC10298658/ /pubmed/37372925 http://dx.doi.org/10.3390/healthcare11121808 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Mahajan, Palak
Uddin, Shahadat
Hajati, Farshid
Moni, Mohammad Ali
Ensemble Learning for Disease Prediction: A Review
title Ensemble Learning for Disease Prediction: A Review
title_full Ensemble Learning for Disease Prediction: A Review
title_fullStr Ensemble Learning for Disease Prediction: A Review
title_full_unstemmed Ensemble Learning for Disease Prediction: A Review
title_short Ensemble Learning for Disease Prediction: A Review
title_sort ensemble learning for disease prediction: a review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298658/
https://www.ncbi.nlm.nih.gov/pubmed/37372925
http://dx.doi.org/10.3390/healthcare11121808
work_keys_str_mv AT mahajanpalak ensemblelearningfordiseasepredictionareview
AT uddinshahadat ensemblelearningfordiseasepredictionareview
AT hajatifarshid ensemblelearningfordiseasepredictionareview
AT monimohammadali ensemblelearningfordiseasepredictionareview