Cargando…

Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting

Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are t...

Descripción completa

Detalles Bibliográficos
Autores principales: Bakasa, Wilson, Viriri, Serestina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10591225/
https://www.ncbi.nlm.nih.gov/pubmed/37876961
http://dx.doi.org/10.3389/frai.2023.1232640
_version_ 1785124178166284288
author Bakasa, Wilson
Viriri, Serestina
author_facet Bakasa, Wilson
Viriri, Serestina
author_sort Bakasa, Wilson
collection PubMed
description Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are the most advanced solution to many machine learning problems. These techniques entail training multiple models and combining their predictions to improve the predictive performance of a single model. This article introduces the idea of Stacked Ensemble Deep Learning (SEDL), a pipeline for classifying pancreas CT medical images. The weak learners are Inception V3, VGG16, and ResNet34, and we employed a stacking ensemble. By combining the first-level predictions, an input train set for XGBoost, the ensemble model at the second level of prediction, is created. Extreme Gradient Boosting (XGBoost), employed as a strong learner, will make the final classification. Our findings showed that SEDL performed better, with a 98.8% ensemble accuracy, after some adjustments to the hyperparameters. The Cancer Imaging Archive (TCIA) public access dataset consists of 80 pancreas CT scans with a resolution of 512 * 512 pixels, from 53 male and 27 female subjects. A sample of two hundred and twenty-two images was used for training and testing data. We concluded that implementing the SEDL technique is an effective way to strengthen the robustness and increase the performance of the pipeline for classifying pancreas CT medical images. Interestingly, grouping like-minded or talented learners does not make a difference.
format Online
Article
Text
id pubmed-10591225
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105912252023-10-24 Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting Bakasa, Wilson Viriri, Serestina Front Artif Intell Artificial Intelligence Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are the most advanced solution to many machine learning problems. These techniques entail training multiple models and combining their predictions to improve the predictive performance of a single model. This article introduces the idea of Stacked Ensemble Deep Learning (SEDL), a pipeline for classifying pancreas CT medical images. The weak learners are Inception V3, VGG16, and ResNet34, and we employed a stacking ensemble. By combining the first-level predictions, an input train set for XGBoost, the ensemble model at the second level of prediction, is created. Extreme Gradient Boosting (XGBoost), employed as a strong learner, will make the final classification. Our findings showed that SEDL performed better, with a 98.8% ensemble accuracy, after some adjustments to the hyperparameters. The Cancer Imaging Archive (TCIA) public access dataset consists of 80 pancreas CT scans with a resolution of 512 * 512 pixels, from 53 male and 27 female subjects. A sample of two hundred and twenty-two images was used for training and testing data. We concluded that implementing the SEDL technique is an effective way to strengthen the robustness and increase the performance of the pipeline for classifying pancreas CT medical images. Interestingly, grouping like-minded or talented learners does not make a difference. Frontiers Media S.A. 2023-10-09 /pmc/articles/PMC10591225/ /pubmed/37876961 http://dx.doi.org/10.3389/frai.2023.1232640 Text en Copyright © 2023 Bakasa and Viriri. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Bakasa, Wilson
Viriri, Serestina
Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
title Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
title_full Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
title_fullStr Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
title_full_unstemmed Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
title_short Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
title_sort stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10591225/
https://www.ncbi.nlm.nih.gov/pubmed/37876961
http://dx.doi.org/10.3389/frai.2023.1232640
work_keys_str_mv AT bakasawilson stackedensembledeeplearningforpancreascancerclassificationusingextremegradientboosting
AT viririserestina stackedensembledeeplearningforpancreascancerclassificationusingextremegradientboosting