Cargando…

eXnet: An Efficient Approach for Emotion Recognition in the Wild

Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achi...

Descripción completa

Detalles Bibliográficos
Autores principales: Riaz, Muhammad Naveed, Shen, Yao, Sohail, Muhammad, Guo, Minyi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071079/
https://www.ncbi.nlm.nih.gov/pubmed/32079319
http://dx.doi.org/10.3390/s20041087
_version_ 1783506119668269056
author Riaz, Muhammad Naveed
Shen, Yao
Sohail, Muhammad
Guo, Minyi
author_facet Riaz, Muhammad Naveed
Shen, Yao
Sohail, Muhammad
Guo, Minyi
author_sort Riaz, Muhammad Naveed
collection PubMed
description Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk.
format Online
Article
Text
id pubmed-7071079
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70710792020-03-19 eXnet: An Efficient Approach for Emotion Recognition in the Wild Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi Sensors (Basel) Article Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk. MDPI 2020-02-17 /pmc/articles/PMC7071079/ /pubmed/32079319 http://dx.doi.org/10.3390/s20041087 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Riaz, Muhammad Naveed
Shen, Yao
Sohail, Muhammad
Guo, Minyi
eXnet: An Efficient Approach for Emotion Recognition in the Wild
title eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_full eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_fullStr eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_full_unstemmed eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_short eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_sort exnet: an efficient approach for emotion recognition in the wild
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071079/
https://www.ncbi.nlm.nih.gov/pubmed/32079319
http://dx.doi.org/10.3390/s20041087
work_keys_str_mv AT riazmuhammadnaveed exnetanefficientapproachforemotionrecognitioninthewild
AT shenyao exnetanefficientapproachforemotionrecognitioninthewild
AT sohailmuhammad exnetanefficientapproachforemotionrecognitioninthewild
AT guominyi exnetanefficientapproachforemotionrecognitioninthewild