Cargando…
eXnet: An Efficient Approach for Emotion Recognition in the Wild
Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071079/ https://www.ncbi.nlm.nih.gov/pubmed/32079319 http://dx.doi.org/10.3390/s20041087 |
_version_ | 1783506119668269056 |
---|---|
author | Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi |
author_facet | Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi |
author_sort | Riaz, Muhammad Naveed |
collection | PubMed |
description | Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk. |
format | Online Article Text |
id | pubmed-7071079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-70710792020-03-19 eXnet: An Efficient Approach for Emotion Recognition in the Wild Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi Sensors (Basel) Article Facial expression recognition has been well studied for its great importance in the areas of human–computer interaction and social sciences. With the evolution of deep learning, there have been significant advances in this area that also surpass human-level accuracy. Although these methods have achieved good accuracy, they are still suffering from two constraints (high computational power and memory), which are incredibly critical for small hardware-constrained devices. To alleviate this issue, we propose a new Convolutional Neural Network (CNN) architecture eXnet (Expression Net) based on parallel feature extraction which surpasses current methods in accuracy and contains a much smaller number of parameters (eXnet: 4.57 million, VGG19: 14.72 million), making it more efficient and lightweight for real-time systems. Several modern data augmentation techniques are applied for generalization of eXnet; these techniques improve the accuracy of the network by overcoming the problem of overfitting while containing the same size. We provide an extensive evaluation of our network against key methods on Facial Expression Recognition 2013 (FER-2013), Extended Cohn-Kanade Dataset (CK+), and Real-world Affective Faces Database (RAF-DB) benchmark datasets. We also perform ablation evaluation to show the importance of different components of our architecture. To evaluate the efficiency of eXnet on embedded systems, we deploy it on Raspberry Pi 4B. All these evaluations show the superiority of eXnet for emotion recognition in the wild in terms of accuracy, the number of parameters, and size on disk. MDPI 2020-02-17 /pmc/articles/PMC7071079/ /pubmed/32079319 http://dx.doi.org/10.3390/s20041087 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Riaz, Muhammad Naveed Shen, Yao Sohail, Muhammad Guo, Minyi eXnet: An Efficient Approach for Emotion Recognition in the Wild |
title | eXnet: An Efficient Approach for Emotion Recognition in the Wild |
title_full | eXnet: An Efficient Approach for Emotion Recognition in the Wild |
title_fullStr | eXnet: An Efficient Approach for Emotion Recognition in the Wild |
title_full_unstemmed | eXnet: An Efficient Approach for Emotion Recognition in the Wild |
title_short | eXnet: An Efficient Approach for Emotion Recognition in the Wild |
title_sort | exnet: an efficient approach for emotion recognition in the wild |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7071079/ https://www.ncbi.nlm.nih.gov/pubmed/32079319 http://dx.doi.org/10.3390/s20041087 |
work_keys_str_mv | AT riazmuhammadnaveed exnetanefficientapproachforemotionrecognitioninthewild AT shenyao exnetanefficientapproachforemotionrecognitioninthewild AT sohailmuhammad exnetanefficientapproachforemotionrecognitioninthewild AT guominyi exnetanefficientapproachforemotionrecognitioninthewild |