Cargando…
Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting
Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10591225/ https://www.ncbi.nlm.nih.gov/pubmed/37876961 http://dx.doi.org/10.3389/frai.2023.1232640 |
_version_ | 1785124178166284288 |
---|---|
author | Bakasa, Wilson Viriri, Serestina |
author_facet | Bakasa, Wilson Viriri, Serestina |
author_sort | Bakasa, Wilson |
collection | PubMed |
description | Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are the most advanced solution to many machine learning problems. These techniques entail training multiple models and combining their predictions to improve the predictive performance of a single model. This article introduces the idea of Stacked Ensemble Deep Learning (SEDL), a pipeline for classifying pancreas CT medical images. The weak learners are Inception V3, VGG16, and ResNet34, and we employed a stacking ensemble. By combining the first-level predictions, an input train set for XGBoost, the ensemble model at the second level of prediction, is created. Extreme Gradient Boosting (XGBoost), employed as a strong learner, will make the final classification. Our findings showed that SEDL performed better, with a 98.8% ensemble accuracy, after some adjustments to the hyperparameters. The Cancer Imaging Archive (TCIA) public access dataset consists of 80 pancreas CT scans with a resolution of 512 * 512 pixels, from 53 male and 27 female subjects. A sample of two hundred and twenty-two images was used for training and testing data. We concluded that implementing the SEDL technique is an effective way to strengthen the robustness and increase the performance of the pipeline for classifying pancreas CT medical images. Interestingly, grouping like-minded or talented learners does not make a difference. |
format | Online Article Text |
id | pubmed-10591225 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-105912252023-10-24 Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting Bakasa, Wilson Viriri, Serestina Front Artif Intell Artificial Intelligence Ensemble learning aims to improve prediction performance by combining several models or forecasts. However, how much and which ensemble learning techniques are useful in deep learning-based pipelines for pancreas computed tomography (CT) image classification is a challenge. Ensemble approaches are the most advanced solution to many machine learning problems. These techniques entail training multiple models and combining their predictions to improve the predictive performance of a single model. This article introduces the idea of Stacked Ensemble Deep Learning (SEDL), a pipeline for classifying pancreas CT medical images. The weak learners are Inception V3, VGG16, and ResNet34, and we employed a stacking ensemble. By combining the first-level predictions, an input train set for XGBoost, the ensemble model at the second level of prediction, is created. Extreme Gradient Boosting (XGBoost), employed as a strong learner, will make the final classification. Our findings showed that SEDL performed better, with a 98.8% ensemble accuracy, after some adjustments to the hyperparameters. The Cancer Imaging Archive (TCIA) public access dataset consists of 80 pancreas CT scans with a resolution of 512 * 512 pixels, from 53 male and 27 female subjects. A sample of two hundred and twenty-two images was used for training and testing data. We concluded that implementing the SEDL technique is an effective way to strengthen the robustness and increase the performance of the pipeline for classifying pancreas CT medical images. Interestingly, grouping like-minded or talented learners does not make a difference. Frontiers Media S.A. 2023-10-09 /pmc/articles/PMC10591225/ /pubmed/37876961 http://dx.doi.org/10.3389/frai.2023.1232640 Text en Copyright © 2023 Bakasa and Viriri. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Bakasa, Wilson Viriri, Serestina Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
title | Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
title_full | Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
title_fullStr | Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
title_full_unstemmed | Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
title_short | Stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
title_sort | stacked ensemble deep learning for pancreas cancer classification using extreme gradient boosting |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10591225/ https://www.ncbi.nlm.nih.gov/pubmed/37876961 http://dx.doi.org/10.3389/frai.2023.1232640 |
work_keys_str_mv | AT bakasawilson stackedensembledeeplearningforpancreascancerclassificationusingextremegradientboosting AT viririserestina stackedensembledeeplearningforpancreascancerclassificationusingextremegradientboosting |