Cargando…

MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition

Understanding human emotions and psychology is a critical step toward realizing artificial intelligence, and correct recognition of facial expressions is essential for judging emotions. However, the differences caused by changes in facial expression are very subtle, and different expression features...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Weiwei, Gao, Ming, Liu, Runmin, Mao, Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8569934/
https://www.ncbi.nlm.nih.gov/pubmed/34744943
http://dx.doi.org/10.3389/fpsyg.2021.762795
_version_ 1784594741546975232
author Cai, Weiwei
Gao, Ming
Liu, Runmin
Mao, Jie
author_facet Cai, Weiwei
Gao, Ming
Liu, Runmin
Mao, Jie
author_sort Cai, Weiwei
collection PubMed
description Understanding human emotions and psychology is a critical step toward realizing artificial intelligence, and correct recognition of facial expressions is essential for judging emotions. However, the differences caused by changes in facial expression are very subtle, and different expression features are less distinguishable, making it difficult for computers to recognize human facial emotions accurately. Therefore, this paper proposes a novel multi-layer interactive feature fusion network model with angular distance loss. To begin, a multi-layer and multi-scale module is designed to extract global and local features of facial emotions in order to capture part of the feature relationships between different scales, thereby improving the model's ability to discriminate subtle features of facial emotions. Second, a hierarchical interactive feature fusion module is designed to address the issue of loss of useful feature information caused by layer-by-layer convolution and pooling of convolutional neural networks. In addition, the attention mechanism is also used between convolutional layers at different levels. Improve the neural network's discriminative ability by increasing the saliency of information about different features on the layers and suppressing irrelevant information. Finally, we use the angular distance loss function to improve the proposed model's inter-class feature separation and intra-class feature clustering capabilities, addressing the issues of large intra-class differences and high inter-class similarity in facial emotion recognition. We conducted comparison and ablation experiments on the FER2013 dataset. The results illustrate that the performance of the proposed MIFAD-Net is 1.02–4.53% better than the compared methods, and it has strong competitiveness.
format Online
Article
Text
id pubmed-8569934
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85699342021-11-06 MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition Cai, Weiwei Gao, Ming Liu, Runmin Mao, Jie Front Psychol Psychology Understanding human emotions and psychology is a critical step toward realizing artificial intelligence, and correct recognition of facial expressions is essential for judging emotions. However, the differences caused by changes in facial expression are very subtle, and different expression features are less distinguishable, making it difficult for computers to recognize human facial emotions accurately. Therefore, this paper proposes a novel multi-layer interactive feature fusion network model with angular distance loss. To begin, a multi-layer and multi-scale module is designed to extract global and local features of facial emotions in order to capture part of the feature relationships between different scales, thereby improving the model's ability to discriminate subtle features of facial emotions. Second, a hierarchical interactive feature fusion module is designed to address the issue of loss of useful feature information caused by layer-by-layer convolution and pooling of convolutional neural networks. In addition, the attention mechanism is also used between convolutional layers at different levels. Improve the neural network's discriminative ability by increasing the saliency of information about different features on the layers and suppressing irrelevant information. Finally, we use the angular distance loss function to improve the proposed model's inter-class feature separation and intra-class feature clustering capabilities, addressing the issues of large intra-class differences and high inter-class similarity in facial emotion recognition. We conducted comparison and ablation experiments on the FER2013 dataset. The results illustrate that the performance of the proposed MIFAD-Net is 1.02–4.53% better than the compared methods, and it has strong competitiveness. Frontiers Media S.A. 2021-10-22 /pmc/articles/PMC8569934/ /pubmed/34744943 http://dx.doi.org/10.3389/fpsyg.2021.762795 Text en Copyright © 2021 Cai, Gao, Liu and Mao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Cai, Weiwei
Gao, Ming
Liu, Runmin
Mao, Jie
MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition
title MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition
title_full MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition
title_fullStr MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition
title_full_unstemmed MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition
title_short MIFAD-Net: Multi-Layer Interactive Feature Fusion Network With Angular Distance Loss for Face Emotion Recognition
title_sort mifad-net: multi-layer interactive feature fusion network with angular distance loss for face emotion recognition
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8569934/
https://www.ncbi.nlm.nih.gov/pubmed/34744943
http://dx.doi.org/10.3389/fpsyg.2021.762795
work_keys_str_mv AT caiweiwei mifadnetmultilayerinteractivefeaturefusionnetworkwithangulardistancelossforfaceemotionrecognition
AT gaoming mifadnetmultilayerinteractivefeaturefusionnetworkwithangulardistancelossforfaceemotionrecognition
AT liurunmin mifadnetmultilayerinteractivefeaturefusionnetworkwithangulardistancelossforfaceemotionrecognition
AT maojie mifadnetmultilayerinteractivefeaturefusionnetworkwithangulardistancelossforfaceemotionrecognition