Cargando…
Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation
Facial Expression Recognition (FER) has gained considerable attention in affective computing due to its vast area of applications. Diverse approaches and methods have been considered for a robust FER in the field, but only a few works considered the intensity of emotion embedded in the expression. E...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641570/ https://www.ncbi.nlm.nih.gov/pubmed/34909462 http://dx.doi.org/10.7717/peerj-cs.736 |
_version_ | 1784609520473866240 |
---|---|
author | Ekundayo, Olufisayo Viriri, Serestina |
author_facet | Ekundayo, Olufisayo Viriri, Serestina |
author_sort | Ekundayo, Olufisayo |
collection | PubMed |
description | Facial Expression Recognition (FER) has gained considerable attention in affective computing due to its vast area of applications. Diverse approaches and methods have been considered for a robust FER in the field, but only a few works considered the intensity of emotion embedded in the expression. Even the available studies on expression intensity estimation successfully assigned a nominal/regression value or classified emotion in a range of intervals. Most of the available works on facial expression intensity estimation successfully present only the emotion intensity estimation. At the same time, others proposed methods that predict emotion and its intensity in different channels. These multiclass approaches and extensions do not conform to man heuristic manner of recognising emotion and its intensity estimation. This work presents a Multilabel Convolution Neural Network (ML-CNN)-based model, which could simultaneously recognise emotion and provide ordinal metrics as the intensity estimation of the emotion. The proposed ML-CNN is enhanced with the aggregation of Binary Cross-Entropy (BCE) loss and Island Loss (IL) functions to minimise intraclass and interclass variations. Also, ML-CNN model is pre-trained with Visual Geometric Group (VGG-16) to control overfitting. In the experiments conducted on Binghampton University 3D Facial Expression (BU-3DFE) and Cohn Kanade extension (CK+) datasets, we evaluate ML-CNN’s performance based on accuracy and loss. We also carried out a comparative study of our model with some popularly used multilabel algorithms using standard multilabel metrics. ML-CNN model simultaneously predicts emotion and intensity estimation using ordinal metrics. The model also shows appreciable and superior performance over four standard multilabel algorithms: Chain Classifier (CC), distinct Random K label set (RAKEL), Multilabel K Nearest Neighbour (MLKNN) and Multilabel ARAM (MLARAM). |
format | Online Article Text |
id | pubmed-8641570 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-86415702021-12-13 Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation Ekundayo, Olufisayo Viriri, Serestina PeerJ Comput Sci Algorithms and Analysis of Algorithms Facial Expression Recognition (FER) has gained considerable attention in affective computing due to its vast area of applications. Diverse approaches and methods have been considered for a robust FER in the field, but only a few works considered the intensity of emotion embedded in the expression. Even the available studies on expression intensity estimation successfully assigned a nominal/regression value or classified emotion in a range of intervals. Most of the available works on facial expression intensity estimation successfully present only the emotion intensity estimation. At the same time, others proposed methods that predict emotion and its intensity in different channels. These multiclass approaches and extensions do not conform to man heuristic manner of recognising emotion and its intensity estimation. This work presents a Multilabel Convolution Neural Network (ML-CNN)-based model, which could simultaneously recognise emotion and provide ordinal metrics as the intensity estimation of the emotion. The proposed ML-CNN is enhanced with the aggregation of Binary Cross-Entropy (BCE) loss and Island Loss (IL) functions to minimise intraclass and interclass variations. Also, ML-CNN model is pre-trained with Visual Geometric Group (VGG-16) to control overfitting. In the experiments conducted on Binghampton University 3D Facial Expression (BU-3DFE) and Cohn Kanade extension (CK+) datasets, we evaluate ML-CNN’s performance based on accuracy and loss. We also carried out a comparative study of our model with some popularly used multilabel algorithms using standard multilabel metrics. ML-CNN model simultaneously predicts emotion and intensity estimation using ordinal metrics. The model also shows appreciable and superior performance over four standard multilabel algorithms: Chain Classifier (CC), distinct Random K label set (RAKEL), Multilabel K Nearest Neighbour (MLKNN) and Multilabel ARAM (MLARAM). PeerJ Inc. 2021-11-29 /pmc/articles/PMC8641570/ /pubmed/34909462 http://dx.doi.org/10.7717/peerj-cs.736 Text en © 2021 Ekundayo and Viriri https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Algorithms and Analysis of Algorithms Ekundayo, Olufisayo Viriri, Serestina Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
title | Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
title_full | Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
title_fullStr | Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
title_full_unstemmed | Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
title_short | Multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
title_sort | multilabel convolution neural network for facial expression recognition and ordinal intensity estimation |
topic | Algorithms and Analysis of Algorithms |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8641570/ https://www.ncbi.nlm.nih.gov/pubmed/34909462 http://dx.doi.org/10.7717/peerj-cs.736 |
work_keys_str_mv | AT ekundayoolufisayo multilabelconvolutionneuralnetworkforfacialexpressionrecognitionandordinalintensityestimation AT viririserestina multilabelconvolutionneuralnetworkforfacialexpressionrecognitionandordinalintensityestimation |