Cargando…

Two-Stream Attention Network for Pain Recognition from Video Sequences

Several approaches have been proposed for the analysis of pain-related facial expressions. These approaches range from common classification architectures based on a set of carefully designed handcrafted features, to deep neural networks characterised by an autonomous extraction of relevant facial d...

Descripción completa

Detalles Bibliográficos
Autores principales: Thiam, Patrick, Kestler, Hans A., Schwenker, Friedhelm
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038688/
https://www.ncbi.nlm.nih.gov/pubmed/32033240
http://dx.doi.org/10.3390/s20030839
_version_ 1783500692900544512
author Thiam, Patrick
Kestler, Hans A.
Schwenker, Friedhelm
author_facet Thiam, Patrick
Kestler, Hans A.
Schwenker, Friedhelm
author_sort Thiam, Patrick
collection PubMed
description Several approaches have been proposed for the analysis of pain-related facial expressions. These approaches range from common classification architectures based on a set of carefully designed handcrafted features, to deep neural networks characterised by an autonomous extraction of relevant facial descriptors and simultaneous optimisation of a classification architecture. In the current work, an end-to-end approach based on attention networks for the analysis and recognition of pain-related facial expressions is proposed. The method combines both spatial and temporal aspects of facial expressions through a weighted aggregation of attention-based neural networks’ outputs, based on sequences of Motion History Images (MHIs) and Optical Flow Images (OFIs). Each input stream is fed into a specific attention network consisting of a Convolutional Neural Network (CNN) coupled to a Bidirectional Long Short-Term Memory (BiLSTM) Recurrent Neural Network (RNN). An attention mechanism generates a single weighted representation of each input stream (MHI sequence and OFI sequence), which is subsequently used to perform specific classification tasks. Simultaneously, a weighted aggregation of the classification scores specific to each input stream is performed to generate a final classification output. The assessment conducted on both the BioVid Heat Pain Database (Part A) and SenseEmotion Database points at the relevance of the proposed approach, as its classification performance is on par with state-of-the-art classification approaches proposed in the literature.
format Online
Article
Text
id pubmed-7038688
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70386882020-03-09 Two-Stream Attention Network for Pain Recognition from Video Sequences Thiam, Patrick Kestler, Hans A. Schwenker, Friedhelm Sensors (Basel) Article Several approaches have been proposed for the analysis of pain-related facial expressions. These approaches range from common classification architectures based on a set of carefully designed handcrafted features, to deep neural networks characterised by an autonomous extraction of relevant facial descriptors and simultaneous optimisation of a classification architecture. In the current work, an end-to-end approach based on attention networks for the analysis and recognition of pain-related facial expressions is proposed. The method combines both spatial and temporal aspects of facial expressions through a weighted aggregation of attention-based neural networks’ outputs, based on sequences of Motion History Images (MHIs) and Optical Flow Images (OFIs). Each input stream is fed into a specific attention network consisting of a Convolutional Neural Network (CNN) coupled to a Bidirectional Long Short-Term Memory (BiLSTM) Recurrent Neural Network (RNN). An attention mechanism generates a single weighted representation of each input stream (MHI sequence and OFI sequence), which is subsequently used to perform specific classification tasks. Simultaneously, a weighted aggregation of the classification scores specific to each input stream is performed to generate a final classification output. The assessment conducted on both the BioVid Heat Pain Database (Part A) and SenseEmotion Database points at the relevance of the proposed approach, as its classification performance is on par with state-of-the-art classification approaches proposed in the literature. MDPI 2020-02-04 /pmc/articles/PMC7038688/ /pubmed/32033240 http://dx.doi.org/10.3390/s20030839 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Thiam, Patrick
Kestler, Hans A.
Schwenker, Friedhelm
Two-Stream Attention Network for Pain Recognition from Video Sequences
title Two-Stream Attention Network for Pain Recognition from Video Sequences
title_full Two-Stream Attention Network for Pain Recognition from Video Sequences
title_fullStr Two-Stream Attention Network for Pain Recognition from Video Sequences
title_full_unstemmed Two-Stream Attention Network for Pain Recognition from Video Sequences
title_short Two-Stream Attention Network for Pain Recognition from Video Sequences
title_sort two-stream attention network for pain recognition from video sequences
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7038688/
https://www.ncbi.nlm.nih.gov/pubmed/32033240
http://dx.doi.org/10.3390/s20030839
work_keys_str_mv AT thiampatrick twostreamattentionnetworkforpainrecognitionfromvideosequences
AT kestlerhansa twostreamattentionnetworkforpainrecognitionfromvideosequences
AT schwenkerfriedhelm twostreamattentionnetworkforpainrecognitionfromvideosequences