Cargando…

Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation

In the realm of computer vision, semantic segmentation is the task of recognizing objects in images at the pixel level. This is done by performing a classification of each pixel. The task is complex and requires sophisticated skills and knowledge about the context to identify objects’ boundaries. Th...

Descripción completa

Detalles Bibliográficos
Autores principales: Nanni, Loris, Fantozzi, Carlo, Loreggia, Andrea, Lumini, Alessandra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10224477/
https://www.ncbi.nlm.nih.gov/pubmed/37430601
http://dx.doi.org/10.3390/s23104688
_version_ 1785050193699274752
author Nanni, Loris
Fantozzi, Carlo
Loreggia, Andrea
Lumini, Alessandra
author_facet Nanni, Loris
Fantozzi, Carlo
Loreggia, Andrea
Lumini, Alessandra
author_sort Nanni, Loris
collection PubMed
description In the realm of computer vision, semantic segmentation is the task of recognizing objects in images at the pixel level. This is done by performing a classification of each pixel. The task is complex and requires sophisticated skills and knowledge about the context to identify objects’ boundaries. The importance of semantic segmentation in many domains is undisputed. In medical diagnostics, it simplifies the early detection of pathologies, thus mitigating the possible consequences. In this work, we provide a review of the literature on deep ensemble learning models for polyp segmentation and develop new ensembles based on convolutional neural networks and transformers. The development of an effective ensemble entails ensuring diversity between its components. To this end, we combined different models (HarDNet-MSEG, Polyp-PVT, and HSNet) trained with different data augmentation techniques, optimization methods, and learning rates, which we experimentally demonstrate to be useful to form a better ensemble. Most importantly, we introduce a new method to obtain the segmentation mask by averaging intermediate masks after the sigmoid layer. In our extensive experimental evaluation, the average performance of the proposed ensembles over five prominent datasets beat any other solution that we know of. Furthermore, the ensembles also performed better than the state-of-the-art on two of the five datasets, when individually considered, without having been specifically trained for them.
format Online
Article
Text
id pubmed-10224477
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102244772023-05-28 Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation Nanni, Loris Fantozzi, Carlo Loreggia, Andrea Lumini, Alessandra Sensors (Basel) Article In the realm of computer vision, semantic segmentation is the task of recognizing objects in images at the pixel level. This is done by performing a classification of each pixel. The task is complex and requires sophisticated skills and knowledge about the context to identify objects’ boundaries. The importance of semantic segmentation in many domains is undisputed. In medical diagnostics, it simplifies the early detection of pathologies, thus mitigating the possible consequences. In this work, we provide a review of the literature on deep ensemble learning models for polyp segmentation and develop new ensembles based on convolutional neural networks and transformers. The development of an effective ensemble entails ensuring diversity between its components. To this end, we combined different models (HarDNet-MSEG, Polyp-PVT, and HSNet) trained with different data augmentation techniques, optimization methods, and learning rates, which we experimentally demonstrate to be useful to form a better ensemble. Most importantly, we introduce a new method to obtain the segmentation mask by averaging intermediate masks after the sigmoid layer. In our extensive experimental evaluation, the average performance of the proposed ensembles over five prominent datasets beat any other solution that we know of. Furthermore, the ensembles also performed better than the state-of-the-art on two of the five datasets, when individually considered, without having been specifically trained for them. MDPI 2023-05-12 /pmc/articles/PMC10224477/ /pubmed/37430601 http://dx.doi.org/10.3390/s23104688 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nanni, Loris
Fantozzi, Carlo
Loreggia, Andrea
Lumini, Alessandra
Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
title Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
title_full Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
title_fullStr Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
title_full_unstemmed Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
title_short Ensembles of Convolutional Neural Networks and Transformers for Polyp Segmentation
title_sort ensembles of convolutional neural networks and transformers for polyp segmentation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10224477/
https://www.ncbi.nlm.nih.gov/pubmed/37430601
http://dx.doi.org/10.3390/s23104688
work_keys_str_mv AT nanniloris ensemblesofconvolutionalneuralnetworksandtransformersforpolypsegmentation
AT fantozzicarlo ensemblesofconvolutionalneuralnetworksandtransformersforpolypsegmentation
AT loreggiaandrea ensemblesofconvolutionalneuralnetworksandtransformersforpolypsegmentation
AT luminialessandra ensemblesofconvolutionalneuralnetworksandtransformersforpolypsegmentation