Cargando…

Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation

We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing virtual reality headsets. On the other hand, we aim to estimate the age and identify the gender of peopl...

Descripción completa

Detalles Bibliográficos
Autores principales: Georgescu, Mariana-Iuliana, Duţǎ, Georgian-Emilian, Ionescu, Radu Tudor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8693600/
https://www.ncbi.nlm.nih.gov/pubmed/34955610
http://dx.doi.org/10.1007/s00138-021-01270-x
_version_ 1784619175472267264
author Georgescu, Mariana-Iuliana
Duţǎ, Georgian-Emilian
Ionescu, Radu Tudor
author_facet Georgescu, Mariana-Iuliana
Duţǎ, Georgian-Emilian
Ionescu, Radu Tudor
author_sort Georgescu, Mariana-Iuliana
collection PubMed
description We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing virtual reality headsets. On the other hand, we aim to estimate the age and identify the gender of people wearing surgical masks. For all these tasks, the common ground is that half of the face is occluded. In this challenging setting, we show that convolutional neural networks trained on fully visible faces exhibit very low performance levels. While fine-tuning the deep learning models on occluded faces is extremely useful, we show that additional performance gains can be obtained by distilling knowledge from models trained on fully visible faces. To this end, we study two knowledge distillation methods, one based on teacher–student training and one based on triplet loss. Our main contribution consists in a novel approach for knowledge distillation based on triplet loss, which generalizes across models and tasks. Furthermore, we consider combining distilled models learned through conventional teacher–student training or through our novel teacher–student training based on triplet loss. We provide empirical evidence showing that, in most cases, both individual and combined knowledge distillation methods bring statistically significant performance improvements. We conduct experiments with three different neural models (VGG-f, VGG-face and ResNet-50) on various tasks (facial expression recognition, gender recognition, age estimation), showing consistent improvements regardless of the model or task.
format Online
Article
Text
id pubmed-8693600
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-86936002021-12-22 Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation Georgescu, Mariana-Iuliana Duţǎ, Georgian-Emilian Ionescu, Radu Tudor Mach Vis Appl Special Issue Paper We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing virtual reality headsets. On the other hand, we aim to estimate the age and identify the gender of people wearing surgical masks. For all these tasks, the common ground is that half of the face is occluded. In this challenging setting, we show that convolutional neural networks trained on fully visible faces exhibit very low performance levels. While fine-tuning the deep learning models on occluded faces is extremely useful, we show that additional performance gains can be obtained by distilling knowledge from models trained on fully visible faces. To this end, we study two knowledge distillation methods, one based on teacher–student training and one based on triplet loss. Our main contribution consists in a novel approach for knowledge distillation based on triplet loss, which generalizes across models and tasks. Furthermore, we consider combining distilled models learned through conventional teacher–student training or through our novel teacher–student training based on triplet loss. We provide empirical evidence showing that, in most cases, both individual and combined knowledge distillation methods bring statistically significant performance improvements. We conduct experiments with three different neural models (VGG-f, VGG-face and ResNet-50) on various tasks (facial expression recognition, gender recognition, age estimation), showing consistent improvements regardless of the model or task. Springer Berlin Heidelberg 2021-12-22 2022 /pmc/articles/PMC8693600/ /pubmed/34955610 http://dx.doi.org/10.1007/s00138-021-01270-x Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Special Issue Paper
Georgescu, Mariana-Iuliana
Duţǎ, Georgian-Emilian
Ionescu, Radu Tudor
Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
title Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
title_full Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
title_fullStr Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
title_full_unstemmed Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
title_short Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
title_sort teacher–student training and triplet loss to reduce the effect of drastic face occlusion: application to emotion recognition, gender identification and age estimation
topic Special Issue Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8693600/
https://www.ncbi.nlm.nih.gov/pubmed/34955610
http://dx.doi.org/10.1007/s00138-021-01270-x
work_keys_str_mv AT georgescumarianaiuliana teacherstudenttrainingandtripletlosstoreducetheeffectofdrasticfaceocclusionapplicationtoemotionrecognitiongenderidentificationandageestimation
AT dutageorgianemilian teacherstudenttrainingandtripletlosstoreducetheeffectofdrasticfaceocclusionapplicationtoemotionrecognitiongenderidentificationandageestimation
AT ionescuradutudor teacherstudenttrainingandtripletlosstoreducetheeffectofdrasticfaceocclusionapplicationtoemotionrecognitiongenderidentificationandageestimation