Cargando…

TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition

Facial expression recognition (FER) in uncontrolled environment is challenging due to various un-constrained conditions. Although existing deep learning-based FER approaches have been quite promising in recognizing frontal faces, they still struggle to accurately identify the facial expressions on t...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Jixun, Zhao, Yuanyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573424/
https://www.ncbi.nlm.nih.gov/pubmed/34759808
http://dx.doi.org/10.3389/fnbot.2021.763100
_version_ 1784595420209479680
author Gao, Jixun
Zhao, Yuanyuan
author_facet Gao, Jixun
Zhao, Yuanyuan
author_sort Gao, Jixun
collection PubMed
description Facial expression recognition (FER) in uncontrolled environment is challenging due to various un-constrained conditions. Although existing deep learning-based FER approaches have been quite promising in recognizing frontal faces, they still struggle to accurately identify the facial expressions on the faces that are partly occluded in unconstrained scenarios. To mitigate this issue, we propose a transformer-based FER method (TFE) that is capable of adaptatively focusing on the most important and unoccluded facial regions. TFE is based on the multi-head self-attention mechanism that can flexibly attend to a sequence of image patches to encode the critical cues for FER. Compared with traditional transformer, the novelty of TFE is two-fold: (i) To effectively select the discriminative facial regions, we integrate all the attention weights in various transformer layers into an attention map to guide the network to perceive the important facial regions. (ii) Given an input occluded facial image, we use a decoder to reconstruct the corresponding non-occluded face. Thus, TFE is capable of inferring the occluded regions to better recognize the facial expressions. We evaluate the proposed TFE on the two prevalent in-the-wild facial expression datasets (AffectNet and RAF-DB) and the their modifications with artificial occlusions. Experimental results show that TFE improves the recognition accuracy on both the non-occluded faces and occluded faces. Compared with other state-of-the-art FE methods, TFE obtains consistent improvements. Visualization results show TFE is capable of automatically focusing on the discriminative and non-occluded facial regions for robust FER.
format Online
Article
Text
id pubmed-8573424
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85734242021-11-09 TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition Gao, Jixun Zhao, Yuanyuan Front Neurorobot Neuroscience Facial expression recognition (FER) in uncontrolled environment is challenging due to various un-constrained conditions. Although existing deep learning-based FER approaches have been quite promising in recognizing frontal faces, they still struggle to accurately identify the facial expressions on the faces that are partly occluded in unconstrained scenarios. To mitigate this issue, we propose a transformer-based FER method (TFE) that is capable of adaptatively focusing on the most important and unoccluded facial regions. TFE is based on the multi-head self-attention mechanism that can flexibly attend to a sequence of image patches to encode the critical cues for FER. Compared with traditional transformer, the novelty of TFE is two-fold: (i) To effectively select the discriminative facial regions, we integrate all the attention weights in various transformer layers into an attention map to guide the network to perceive the important facial regions. (ii) Given an input occluded facial image, we use a decoder to reconstruct the corresponding non-occluded face. Thus, TFE is capable of inferring the occluded regions to better recognize the facial expressions. We evaluate the proposed TFE on the two prevalent in-the-wild facial expression datasets (AffectNet and RAF-DB) and the their modifications with artificial occlusions. Experimental results show that TFE improves the recognition accuracy on both the non-occluded faces and occluded faces. Compared with other state-of-the-art FE methods, TFE obtains consistent improvements. Visualization results show TFE is capable of automatically focusing on the discriminative and non-occluded facial regions for robust FER. Frontiers Media S.A. 2021-10-25 /pmc/articles/PMC8573424/ /pubmed/34759808 http://dx.doi.org/10.3389/fnbot.2021.763100 Text en Copyright © 2021 Gao and Zhao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Gao, Jixun
Zhao, Yuanyuan
TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
title TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
title_full TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
title_fullStr TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
title_full_unstemmed TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
title_short TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
title_sort tfe: a transformer architecture for occlusion aware facial expression recognition
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573424/
https://www.ncbi.nlm.nih.gov/pubmed/34759808
http://dx.doi.org/10.3389/fnbot.2021.763100
work_keys_str_mv AT gaojixun tfeatransformerarchitectureforocclusionawarefacialexpressionrecognition
AT zhaoyuanyuan tfeatransformerarchitectureforocclusionawarefacialexpressionrecognition