Cargando…

Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography

In this work, the performance of five deep learning architectures in classifying COVID-19 in a multi-class set-up is evaluated. The classifiers were built on pretrained ResNet-50, ResNet-50r (with kernel size [Formula: see text] in the first convolutional layer), DenseNet-121, MobileNet-v3 and the s...

Descripción completa

Detalles Bibliográficos
Autores principales: Ascencio-Cabral, Azucena, Reyes-Aldasoro, Constantino Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9500990/
https://www.ncbi.nlm.nih.gov/pubmed/36135403
http://dx.doi.org/10.3390/jimaging8090237
_version_ 1784795360043991040
author Ascencio-Cabral, Azucena
Reyes-Aldasoro, Constantino Carlos
author_facet Ascencio-Cabral, Azucena
Reyes-Aldasoro, Constantino Carlos
author_sort Ascencio-Cabral, Azucena
collection PubMed
description In this work, the performance of five deep learning architectures in classifying COVID-19 in a multi-class set-up is evaluated. The classifiers were built on pretrained ResNet-50, ResNet-50r (with kernel size [Formula: see text] in the first convolutional layer), DenseNet-121, MobileNet-v3 and the state-of-the-art CaiT-24-XXS-224 (CaiT) transformer. The cross entropy and weighted cross entropy were minimised with Adam and AdamW. In total, 20 experiments were conducted with 10 repetitions and obtained the following metrics: accuracy (Acc), balanced accuracy (BA), F(1) and F(2) from the general F [Formula: see text] macro score, Matthew’s Correlation Coefficient (MCC), sensitivity (Sens) and specificity (Spec) followed by bootstrapping. The performance of the classifiers was compared by using the Friedman–Nemenyi test. The results show that less complex architectures such as ResNet-50, ResNet-50r and DenseNet-121 were able to achieve better generalization with rankings of 1.53, 1.71 and 3.05 for the Matthew Correlation Coefficient, respectively, while MobileNet-v3 and CaiT obtained rankings of 3.72 and 5.0, respectively.
format Online
Article
Text
id pubmed-9500990
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95009902022-09-24 Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography Ascencio-Cabral, Azucena Reyes-Aldasoro, Constantino Carlos J Imaging Article In this work, the performance of five deep learning architectures in classifying COVID-19 in a multi-class set-up is evaluated. The classifiers were built on pretrained ResNet-50, ResNet-50r (with kernel size [Formula: see text] in the first convolutional layer), DenseNet-121, MobileNet-v3 and the state-of-the-art CaiT-24-XXS-224 (CaiT) transformer. The cross entropy and weighted cross entropy were minimised with Adam and AdamW. In total, 20 experiments were conducted with 10 repetitions and obtained the following metrics: accuracy (Acc), balanced accuracy (BA), F(1) and F(2) from the general F [Formula: see text] macro score, Matthew’s Correlation Coefficient (MCC), sensitivity (Sens) and specificity (Spec) followed by bootstrapping. The performance of the classifiers was compared by using the Friedman–Nemenyi test. The results show that less complex architectures such as ResNet-50, ResNet-50r and DenseNet-121 were able to achieve better generalization with rankings of 1.53, 1.71 and 3.05 for the Matthew Correlation Coefficient, respectively, while MobileNet-v3 and CaiT obtained rankings of 3.72 and 5.0, respectively. MDPI 2022-09-01 /pmc/articles/PMC9500990/ /pubmed/36135403 http://dx.doi.org/10.3390/jimaging8090237 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Ascencio-Cabral, Azucena
Reyes-Aldasoro, Constantino Carlos
Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography
title Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography
title_full Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography
title_fullStr Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography
title_full_unstemmed Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography
title_short Comparison of Convolutional Neural Networks and Transformers for the Classification of Images of COVID-19, Pneumonia and Healthy Individuals as Observed with Computed Tomography
title_sort comparison of convolutional neural networks and transformers for the classification of images of covid-19, pneumonia and healthy individuals as observed with computed tomography
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9500990/
https://www.ncbi.nlm.nih.gov/pubmed/36135403
http://dx.doi.org/10.3390/jimaging8090237
work_keys_str_mv AT ascenciocabralazucena comparisonofconvolutionalneuralnetworksandtransformersfortheclassificationofimagesofcovid19pneumoniaandhealthyindividualsasobservedwithcomputedtomography
AT reyesaldasoroconstantinocarlos comparisonofconvolutionalneuralnetworksandtransformersfortheclassificationofimagesofcovid19pneumoniaandhealthyindividualsasobservedwithcomputedtomography