Cargando…

Multi-scale fusion visual attention network for facial micro-expression recognition

INTRODUCTION: Micro-expressions are facial muscle movements that hide genuine emotions. In response to the challenge of micro-expression low-intensity, recent studies have attempted to locate localized areas of facial muscle movement. However, this ignores the feature redundancy caused by the inaccu...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Hang, Yang, Hongling, Xie, Lun, Wang, Zhiliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10412924/
https://www.ncbi.nlm.nih.gov/pubmed/37575295
http://dx.doi.org/10.3389/fnins.2023.1216181
_version_ 1785087023477948416
author Pan, Hang
Yang, Hongling
Xie, Lun
Wang, Zhiliang
author_facet Pan, Hang
Yang, Hongling
Xie, Lun
Wang, Zhiliang
author_sort Pan, Hang
collection PubMed
description INTRODUCTION: Micro-expressions are facial muscle movements that hide genuine emotions. In response to the challenge of micro-expression low-intensity, recent studies have attempted to locate localized areas of facial muscle movement. However, this ignores the feature redundancy caused by the inaccurate locating of the regions of interest. METHODS: This paper proposes a novel multi-scale fusion visual attention network (MFVAN), which learns multi-scale local attention weights to mask regions of redundancy features. Specifically, this model extracts the multi-scale features of the apex frame in the micro-expression video clips by convolutional neural networks. The attention mechanism focuses on the weights of local region features in the multi-scale feature maps. Then, we mask operate redundancy regions in multi-scale features and fuse local features with high attention weights for micro-expression recognition. The self-supervision and transfer learning reduce the influence of individual identity attributes and increase the robustness of multi-scale feature maps. Finally, the multi-scale classification loss, mask loss, and removing individual identity attributes loss joint to optimize the model. RESULTS: The proposed MFVAN method is evaluated on SMIC, CASME II, SAMM, and 3DB-Combined datasets that achieve state-of-the-art performance. The experimental results show that focusing on local at the multi-scale contributes to micro-expression recognition. DISCUSSION: This paper proposed MFVAN model is the first to combine image generation with visual attention mechanisms to solve the combination challenge problem of individual identity attribute interference and low-intensity facial muscle movements. Meanwhile, the MFVAN model reveal the impact of individual attributes on the localization of local ROIs. The experimental results show that a multi-scale fusion visual attention network contributes to micro-expression recognition.
format Online
Article
Text
id pubmed-10412924
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104129242023-08-11 Multi-scale fusion visual attention network for facial micro-expression recognition Pan, Hang Yang, Hongling Xie, Lun Wang, Zhiliang Front Neurosci Neuroscience INTRODUCTION: Micro-expressions are facial muscle movements that hide genuine emotions. In response to the challenge of micro-expression low-intensity, recent studies have attempted to locate localized areas of facial muscle movement. However, this ignores the feature redundancy caused by the inaccurate locating of the regions of interest. METHODS: This paper proposes a novel multi-scale fusion visual attention network (MFVAN), which learns multi-scale local attention weights to mask regions of redundancy features. Specifically, this model extracts the multi-scale features of the apex frame in the micro-expression video clips by convolutional neural networks. The attention mechanism focuses on the weights of local region features in the multi-scale feature maps. Then, we mask operate redundancy regions in multi-scale features and fuse local features with high attention weights for micro-expression recognition. The self-supervision and transfer learning reduce the influence of individual identity attributes and increase the robustness of multi-scale feature maps. Finally, the multi-scale classification loss, mask loss, and removing individual identity attributes loss joint to optimize the model. RESULTS: The proposed MFVAN method is evaluated on SMIC, CASME II, SAMM, and 3DB-Combined datasets that achieve state-of-the-art performance. The experimental results show that focusing on local at the multi-scale contributes to micro-expression recognition. DISCUSSION: This paper proposed MFVAN model is the first to combine image generation with visual attention mechanisms to solve the combination challenge problem of individual identity attribute interference and low-intensity facial muscle movements. Meanwhile, the MFVAN model reveal the impact of individual attributes on the localization of local ROIs. The experimental results show that a multi-scale fusion visual attention network contributes to micro-expression recognition. Frontiers Media S.A. 2023-07-27 /pmc/articles/PMC10412924/ /pubmed/37575295 http://dx.doi.org/10.3389/fnins.2023.1216181 Text en Copyright © 2023 Pan, Yang, Xie and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Pan, Hang
Yang, Hongling
Xie, Lun
Wang, Zhiliang
Multi-scale fusion visual attention network for facial micro-expression recognition
title Multi-scale fusion visual attention network for facial micro-expression recognition
title_full Multi-scale fusion visual attention network for facial micro-expression recognition
title_fullStr Multi-scale fusion visual attention network for facial micro-expression recognition
title_full_unstemmed Multi-scale fusion visual attention network for facial micro-expression recognition
title_short Multi-scale fusion visual attention network for facial micro-expression recognition
title_sort multi-scale fusion visual attention network for facial micro-expression recognition
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10412924/
https://www.ncbi.nlm.nih.gov/pubmed/37575295
http://dx.doi.org/10.3389/fnins.2023.1216181
work_keys_str_mv AT panhang multiscalefusionvisualattentionnetworkforfacialmicroexpressionrecognition
AT yanghongling multiscalefusionvisualattentionnetworkforfacialmicroexpressionrecognition
AT xielun multiscalefusionvisualattentionnetworkforfacialmicroexpressionrecognition
AT wangzhiliang multiscalefusionvisualattentionnetworkforfacialmicroexpressionrecognition