Cargando…

Facial Expression Recognition Based on Squeeze Vision Transformer

In recent image classification approaches, a vision transformer (ViT) has shown an excellent performance beyond that of a convolutional neural network. A ViT achieves a high classification for natural images because it properly preserves the global image features. Conversely, a ViT still has many li...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Sangwon, Nam, Jaeyeal, Ko, Byoung Chul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9147983/
https://www.ncbi.nlm.nih.gov/pubmed/35632135
http://dx.doi.org/10.3390/s22103729
_version_ 1784716941217234944
author Kim, Sangwon
Nam, Jaeyeal
Ko, Byoung Chul
author_facet Kim, Sangwon
Nam, Jaeyeal
Ko, Byoung Chul
author_sort Kim, Sangwon
collection PubMed
description In recent image classification approaches, a vision transformer (ViT) has shown an excellent performance beyond that of a convolutional neural network. A ViT achieves a high classification for natural images because it properly preserves the global image features. Conversely, a ViT still has many limitations in facial expression recognition (FER), which requires the detection of subtle changes in expression, because it can lose the local features of the image. Therefore, in this paper, we propose Squeeze ViT, a method for reducing the computational complexity by reducing the number of feature dimensions while increasing the FER performance by concurrently combining global and local features. To measure the FER performance of Squeeze ViT, experiments were conducted on lab-controlled FER datasets and a wild FER dataset. Through comparative experiments with previous state-of-the-art approaches, we proved that the proposed method achieves an excellent performance on both types of datasets.
format Online
Article
Text
id pubmed-9147983
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91479832022-05-29 Facial Expression Recognition Based on Squeeze Vision Transformer Kim, Sangwon Nam, Jaeyeal Ko, Byoung Chul Sensors (Basel) Article In recent image classification approaches, a vision transformer (ViT) has shown an excellent performance beyond that of a convolutional neural network. A ViT achieves a high classification for natural images because it properly preserves the global image features. Conversely, a ViT still has many limitations in facial expression recognition (FER), which requires the detection of subtle changes in expression, because it can lose the local features of the image. Therefore, in this paper, we propose Squeeze ViT, a method for reducing the computational complexity by reducing the number of feature dimensions while increasing the FER performance by concurrently combining global and local features. To measure the FER performance of Squeeze ViT, experiments were conducted on lab-controlled FER datasets and a wild FER dataset. Through comparative experiments with previous state-of-the-art approaches, we proved that the proposed method achieves an excellent performance on both types of datasets. MDPI 2022-05-13 /pmc/articles/PMC9147983/ /pubmed/35632135 http://dx.doi.org/10.3390/s22103729 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kim, Sangwon
Nam, Jaeyeal
Ko, Byoung Chul
Facial Expression Recognition Based on Squeeze Vision Transformer
title Facial Expression Recognition Based on Squeeze Vision Transformer
title_full Facial Expression Recognition Based on Squeeze Vision Transformer
title_fullStr Facial Expression Recognition Based on Squeeze Vision Transformer
title_full_unstemmed Facial Expression Recognition Based on Squeeze Vision Transformer
title_short Facial Expression Recognition Based on Squeeze Vision Transformer
title_sort facial expression recognition based on squeeze vision transformer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9147983/
https://www.ncbi.nlm.nih.gov/pubmed/35632135
http://dx.doi.org/10.3390/s22103729
work_keys_str_mv AT kimsangwon facialexpressionrecognitionbasedonsqueezevisiontransformer
AT namjaeyeal facialexpressionrecognitionbasedonsqueezevisiontransformer
AT kobyoungchul facialexpressionrecognitionbasedonsqueezevisiontransformer