Cargando…

Facial expression recognition based on active region of interest using deep learning and parallelism

The automatic facial expression tracking method has become an emergent topic during the last few decades. It is a challenging problem that impacts many fields such as virtual reality, security surveillance, driver safety, homeland security, human-computer interaction, medical applications. A remarka...

Descripción completa

Detalles Bibliográficos
Autores principales: Hossain, Mohammad Alamgir, Assiri, Basem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044208/
https://www.ncbi.nlm.nih.gov/pubmed/35494822
http://dx.doi.org/10.7717/peerj-cs.894
_version_ 1784695054655291392
author Hossain, Mohammad Alamgir
Assiri, Basem
author_facet Hossain, Mohammad Alamgir
Assiri, Basem
author_sort Hossain, Mohammad Alamgir
collection PubMed
description The automatic facial expression tracking method has become an emergent topic during the last few decades. It is a challenging problem that impacts many fields such as virtual reality, security surveillance, driver safety, homeland security, human-computer interaction, medical applications. A remarkable cost-efficiency can be achieved by considering some areas of a face. These areas are termed Active Regions of Interest (AROIs). This work proposes a facial expression recognition framework that investigates five types of facial expressions, namely neutral, happiness, fear, surprise, and disgust. Firstly, a pose estimation method is incorporated and to go along with an approach to rotate the face to achieve a normalized pose. Secondly, the whole face-image is segmented into four classes and eight regions. Thirdly, only four AROIs are identified from the segmented regions. The four AROIs are the nose-tip, right eye, left eye, and lips respectively. Fourthly, an info-image-data-mask database is maintained for classification and it is used to store records of images. This database is the mixture of all the images that are gained after introducing a ten-fold cross-validation technique using the Convolutional Neural Network. Correlations of variances and standard deviations are computed based on identified images. To minimize the required processing time in both training and testing the data set, a parallelism technique is introduced, in which each region of the AROIs is classified individually and all of them run in parallel. Fifthly, a decision-tree-level synthesis-based framework is proposed to coordinate the results of parallel classification, which helps to improve the recognition accuracy. Finally, experimentation on both independent and synthesis databases is voted for calculating the performance of the proposed technique. By incorporating the proposed synthesis method, we gain 94.499%, 95.439%, and 98.26% accuracy with the CK+ image sets and 92.463%, 93.318%, and 94.423% with the JAFFE image sets. The overall accuracy is 95.27% in recognition. We gain 2.8% higher accuracy by introducing a decision-level synthesis method. Moreover, with the incorporation of parallelism, processing time speeds up three times faster. This accuracy proves the robustness of the proposed scheme.
format Online
Article
Text
id pubmed-9044208
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90442082022-04-28 Facial expression recognition based on active region of interest using deep learning and parallelism Hossain, Mohammad Alamgir Assiri, Basem PeerJ Comput Sci Human-Computer Interaction The automatic facial expression tracking method has become an emergent topic during the last few decades. It is a challenging problem that impacts many fields such as virtual reality, security surveillance, driver safety, homeland security, human-computer interaction, medical applications. A remarkable cost-efficiency can be achieved by considering some areas of a face. These areas are termed Active Regions of Interest (AROIs). This work proposes a facial expression recognition framework that investigates five types of facial expressions, namely neutral, happiness, fear, surprise, and disgust. Firstly, a pose estimation method is incorporated and to go along with an approach to rotate the face to achieve a normalized pose. Secondly, the whole face-image is segmented into four classes and eight regions. Thirdly, only four AROIs are identified from the segmented regions. The four AROIs are the nose-tip, right eye, left eye, and lips respectively. Fourthly, an info-image-data-mask database is maintained for classification and it is used to store records of images. This database is the mixture of all the images that are gained after introducing a ten-fold cross-validation technique using the Convolutional Neural Network. Correlations of variances and standard deviations are computed based on identified images. To minimize the required processing time in both training and testing the data set, a parallelism technique is introduced, in which each region of the AROIs is classified individually and all of them run in parallel. Fifthly, a decision-tree-level synthesis-based framework is proposed to coordinate the results of parallel classification, which helps to improve the recognition accuracy. Finally, experimentation on both independent and synthesis databases is voted for calculating the performance of the proposed technique. By incorporating the proposed synthesis method, we gain 94.499%, 95.439%, and 98.26% accuracy with the CK+ image sets and 92.463%, 93.318%, and 94.423% with the JAFFE image sets. The overall accuracy is 95.27% in recognition. We gain 2.8% higher accuracy by introducing a decision-level synthesis method. Moreover, with the incorporation of parallelism, processing time speeds up three times faster. This accuracy proves the robustness of the proposed scheme. PeerJ Inc. 2022-03-02 /pmc/articles/PMC9044208/ /pubmed/35494822 http://dx.doi.org/10.7717/peerj-cs.894 Text en ©2022 Hossain and Assiri https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Human-Computer Interaction
Hossain, Mohammad Alamgir
Assiri, Basem
Facial expression recognition based on active region of interest using deep learning and parallelism
title Facial expression recognition based on active region of interest using deep learning and parallelism
title_full Facial expression recognition based on active region of interest using deep learning and parallelism
title_fullStr Facial expression recognition based on active region of interest using deep learning and parallelism
title_full_unstemmed Facial expression recognition based on active region of interest using deep learning and parallelism
title_short Facial expression recognition based on active region of interest using deep learning and parallelism
title_sort facial expression recognition based on active region of interest using deep learning and parallelism
topic Human-Computer Interaction
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044208/
https://www.ncbi.nlm.nih.gov/pubmed/35494822
http://dx.doi.org/10.7717/peerj-cs.894
work_keys_str_mv AT hossainmohammadalamgir facialexpressionrecognitionbasedonactiveregionofinterestusingdeeplearningandparallelism
AT assiribasem facialexpressionrecognitionbasedonactiveregionofinterestusingdeeplearningandparallelism