Cargando…

eXnet: An Efficient Approach for Emotion Recognition in the Wild

Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Riaz, Muhammad Naveed, Shen, Yao, Sohail, Muhammad, Guo, Minyi
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071079/ https://www.ncbi.nlm.nih.gov/pubmed/32079319 http://dx.doi.org/10.3390/s20041087

_version_	1783506119668269056
author	Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi
author_facet	Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi
author_sort	Riaz, Muhammad Naveed
collection	PubMed
description	Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk.
format	Online Article Text
id	pubmed-7071079
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-70710792020-03-19 eXnet: An Efficient Approach for Emotion Recognition in the Wild Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi Sensors (Basel) Article Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk. MDPI 2020-02-17 /pmc/articles/PMC7071079/ /pubmed/32079319 http://dx.doi.org/10.3390/s20041087 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi eXnet: An Efficient Approach for Emotion Recognition in the Wild
title	eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_full	eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_fullStr	eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_full_unstemmed	eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_short	eXnet: An Efficient Approach for Emotion Recognition in the Wild
title_sort	exnet: an efficient approach for emotion recognition in the wild
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071079/ https://www.ncbi.nlm.nih.gov/pubmed/32079319 http://dx.doi.org/10.3390/s20041087
work_keys_str_mv	AT riazmuhammadnaveed exnetanefficientapproachforemotionrecognitioninthewild AT shenyao exnetanefficientapproachforemotionrecognitioninthewild AT sohailmuhammad exnetanefficientapproachforemotionrecognitioninthewild AT guominyi exnetanefficientapproachforemotionrecognitioninthewild

eXnet: An Efficient Approach for Emotion Recognition in the Wild

Ejemplares similares