Cargando…

Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices

Introduction To date, most ways to perform facial expression recognition rely on two-dimensional images, advanced approaches with three-dimensional data exist. These however demand stationary apparatuses and thus lack portability and possibilities to scale deployment. As human emotions, intent and e...

Descripción completa

Detalles Bibliográficos
Autores principales: Hartmann, Tim Johannes, Hartmann, Julien Ben Joachim, Friebe-Hoffmann, Ulrike, Lato, Christiane, Janni, Wolfgang, Lato, Krisztian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Georg Thieme Verlag KG 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9470291/
https://www.ncbi.nlm.nih.gov/pubmed/36110895
http://dx.doi.org/10.1055/a-1866-2943
_version_ 1784788808366030848
author Hartmann, Tim Johannes
Hartmann, Julien Ben Joachim
Friebe-Hoffmann, Ulrike
Lato, Christiane
Janni, Wolfgang
Lato, Krisztian
author_facet Hartmann, Tim Johannes
Hartmann, Julien Ben Joachim
Friebe-Hoffmann, Ulrike
Lato, Christiane
Janni, Wolfgang
Lato, Krisztian
author_sort Hartmann, Tim Johannes
collection PubMed
description Introduction To date, most ways to perform facial expression recognition rely on two-dimensional images, advanced approaches with three-dimensional data exist. These however demand stationary apparatuses and thus lack portability and possibilities to scale deployment. As human emotions, intent and even diseases may condense in distinct facial expressions or changes therein, the need for a portable yet capable solution is signified. Due to the superior informative value of three-dimensional data on facial morphology and because certain syndromes find expression in specific facial dysmorphisms, a solution should allow portable acquisition of true three-dimensional facial scans in real time. In this study we present a novel solution for the three-dimensional acquisition of facial geometry data and the recognition of facial expressions from it. The new technology presented here only requires the use of a smartphone or tablet with an integrated TrueDepth camera and enables real-time acquisition of the geometry and its categorization into distinct facial expressions. Material and Methods Our approach consisted of two parts: First, training data was acquired by asking a collective of 226 medical students to adopt defined facial expressions while their current facial morphology was captured by our specially developed app running on iPads, placed in front of the students. In total, the list of the facial expressions to be shown by the participants consisted of “disappointed”, “stressed”, “happy”, “sad” and “surprised”. Second, the data were used to train a self-normalizing neural network. A set of all factors describing the current facial expression at a time is referred to as “snapshot”. Results In total, over half a million snapshots were recorded in the study. Ultimately, the network achieved an overall accuracy of 80.54% after 400 epochs of training. In test, an overall accuracy of 81.15% was determined. Recall values differed by the category of a snapshot and ranged from 74.79% for “stressed” to 87.61% for “happy”. Precision showed similar results, whereas “sad” achieved the lowest value at 77.48% and “surprised” the highest at 86.87%. Conclusions With the present work it can be demonstrated that respectable results can be achieved even when using data sets with some challenges. Through various measures, already incorporated into an optimized version of our app, it is to be expected that the training results can be significantly improved and made more precise in the future. Currently a follow-up study with the new version of our app that encompasses the suggested alterations and adaptions, is being conducted. We aim to build a large and open database of facial scans not only for facial expression recognition but to perform disease recognition and to monitor diseases’ treatment progresses.
format Online
Article
Text
id pubmed-9470291
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Georg Thieme Verlag KG
record_format MEDLINE/PubMed
spelling pubmed-94702912022-09-14 Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices Hartmann, Tim Johannes Hartmann, Julien Ben Joachim Friebe-Hoffmann, Ulrike Lato, Christiane Janni, Wolfgang Lato, Krisztian Geburtshilfe Frauenheilkd Introduction To date, most ways to perform facial expression recognition rely on two-dimensional images, advanced approaches with three-dimensional data exist. These however demand stationary apparatuses and thus lack portability and possibilities to scale deployment. As human emotions, intent and even diseases may condense in distinct facial expressions or changes therein, the need for a portable yet capable solution is signified. Due to the superior informative value of three-dimensional data on facial morphology and because certain syndromes find expression in specific facial dysmorphisms, a solution should allow portable acquisition of true three-dimensional facial scans in real time. In this study we present a novel solution for the three-dimensional acquisition of facial geometry data and the recognition of facial expressions from it. The new technology presented here only requires the use of a smartphone or tablet with an integrated TrueDepth camera and enables real-time acquisition of the geometry and its categorization into distinct facial expressions. Material and Methods Our approach consisted of two parts: First, training data was acquired by asking a collective of 226 medical students to adopt defined facial expressions while their current facial morphology was captured by our specially developed app running on iPads, placed in front of the students. In total, the list of the facial expressions to be shown by the participants consisted of “disappointed”, “stressed”, “happy”, “sad” and “surprised”. Second, the data were used to train a self-normalizing neural network. A set of all factors describing the current facial expression at a time is referred to as “snapshot”. Results In total, over half a million snapshots were recorded in the study. Ultimately, the network achieved an overall accuracy of 80.54% after 400 epochs of training. In test, an overall accuracy of 81.15% was determined. Recall values differed by the category of a snapshot and ranged from 74.79% for “stressed” to 87.61% for “happy”. Precision showed similar results, whereas “sad” achieved the lowest value at 77.48% and “surprised” the highest at 86.87%. Conclusions With the present work it can be demonstrated that respectable results can be achieved even when using data sets with some challenges. Through various measures, already incorporated into an optimized version of our app, it is to be expected that the training results can be significantly improved and made more precise in the future. Currently a follow-up study with the new version of our app that encompasses the suggested alterations and adaptions, is being conducted. We aim to build a large and open database of facial scans not only for facial expression recognition but to perform disease recognition and to monitor diseases’ treatment progresses. Georg Thieme Verlag KG 2022-07-21 /pmc/articles/PMC9470291/ /pubmed/36110895 http://dx.doi.org/10.1055/a-1866-2943 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial-License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. (https://creativecommons.org/licenses/by-nc-nd/4.0/). https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited.
spellingShingle Hartmann, Tim Johannes
Hartmann, Julien Ben Joachim
Friebe-Hoffmann, Ulrike
Lato, Christiane
Janni, Wolfgang
Lato, Krisztian
Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices
title Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices
title_full Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices
title_fullStr Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices
title_full_unstemmed Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices
title_short Novel Method for Three-Dimensional Facial Expression Recognition Using Self-Normalizing Neural Networks and Mobile Devices
title_sort novel method for three-dimensional facial expression recognition using self-normalizing neural networks and mobile devices
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9470291/
https://www.ncbi.nlm.nih.gov/pubmed/36110895
http://dx.doi.org/10.1055/a-1866-2943
work_keys_str_mv AT hartmanntimjohannes novelmethodforthreedimensionalfacialexpressionrecognitionusingselfnormalizingneuralnetworksandmobiledevices
AT hartmannjulienbenjoachim novelmethodforthreedimensionalfacialexpressionrecognitionusingselfnormalizingneuralnetworksandmobiledevices
AT friebehoffmannulrike novelmethodforthreedimensionalfacialexpressionrecognitionusingselfnormalizingneuralnetworksandmobiledevices
AT latochristiane novelmethodforthreedimensionalfacialexpressionrecognitionusingselfnormalizingneuralnetworksandmobiledevices
AT janniwolfgang novelmethodforthreedimensionalfacialexpressionrecognitionusingselfnormalizingneuralnetworksandmobiledevices
AT latokrisztian novelmethodforthreedimensionalfacialexpressionrecognitionusingselfnormalizingneuralnetworksandmobiledevices