Cargando…
Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation
We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing virtual reality headsets. On the other hand, we aim to estimate the age and identify the gender of peopl...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8693600/ https://www.ncbi.nlm.nih.gov/pubmed/34955610 http://dx.doi.org/10.1007/s00138-021-01270-x |
_version_ | 1784619175472267264 |
---|---|
author | Georgescu, Mariana-Iuliana Duţǎ, Georgian-Emilian Ionescu, Radu Tudor |
author_facet | Georgescu, Mariana-Iuliana Duţǎ, Georgian-Emilian Ionescu, Radu Tudor |
author_sort | Georgescu, Mariana-Iuliana |
collection | PubMed |
description | We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing virtual reality headsets. On the other hand, we aim to estimate the age and identify the gender of people wearing surgical masks. For all these tasks, the common ground is that half of the face is occluded. In this challenging setting, we show that convolutional neural networks trained on fully visible faces exhibit very low performance levels. While fine-tuning the deep learning models on occluded faces is extremely useful, we show that additional performance gains can be obtained by distilling knowledge from models trained on fully visible faces. To this end, we study two knowledge distillation methods, one based on teacher–student training and one based on triplet loss. Our main contribution consists in a novel approach for knowledge distillation based on triplet loss, which generalizes across models and tasks. Furthermore, we consider combining distilled models learned through conventional teacher–student training or through our novel teacher–student training based on triplet loss. We provide empirical evidence showing that, in most cases, both individual and combined knowledge distillation methods bring statistically significant performance improvements. We conduct experiments with three different neural models (VGG-f, VGG-face and ResNet-50) on various tasks (facial expression recognition, gender recognition, age estimation), showing consistent improvements regardless of the model or task. |
format | Online Article Text |
id | pubmed-8693600 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-86936002021-12-22 Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation Georgescu, Mariana-Iuliana Duţǎ, Georgian-Emilian Ionescu, Radu Tudor Mach Vis Appl Special Issue Paper We study a series of recognition tasks in two realistic scenarios requiring the analysis of faces under strong occlusion. On the one hand, we aim to recognize facial expressions of people wearing virtual reality headsets. On the other hand, we aim to estimate the age and identify the gender of people wearing surgical masks. For all these tasks, the common ground is that half of the face is occluded. In this challenging setting, we show that convolutional neural networks trained on fully visible faces exhibit very low performance levels. While fine-tuning the deep learning models on occluded faces is extremely useful, we show that additional performance gains can be obtained by distilling knowledge from models trained on fully visible faces. To this end, we study two knowledge distillation methods, one based on teacher–student training and one based on triplet loss. Our main contribution consists in a novel approach for knowledge distillation based on triplet loss, which generalizes across models and tasks. Furthermore, we consider combining distilled models learned through conventional teacher–student training or through our novel teacher–student training based on triplet loss. We provide empirical evidence showing that, in most cases, both individual and combined knowledge distillation methods bring statistically significant performance improvements. We conduct experiments with three different neural models (VGG-f, VGG-face and ResNet-50) on various tasks (facial expression recognition, gender recognition, age estimation), showing consistent improvements regardless of the model or task. Springer Berlin Heidelberg 2021-12-22 2022 /pmc/articles/PMC8693600/ /pubmed/34955610 http://dx.doi.org/10.1007/s00138-021-01270-x Text en © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Special Issue Paper Georgescu, Mariana-Iuliana Duţǎ, Georgian-Emilian Ionescu, Radu Tudor Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation |
title | Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation |
title_full | Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation |
title_fullStr | Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation |
title_full_unstemmed | Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation |
title_short | Teacher–student training and triplet loss to reduce the effect of drastic face occlusion: Application to emotion recognition, gender identification and age estimation |
title_sort | teacher–student training and triplet loss to reduce the effect of drastic face occlusion: application to emotion recognition, gender identification and age estimation |
topic | Special Issue Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8693600/ https://www.ncbi.nlm.nih.gov/pubmed/34955610 http://dx.doi.org/10.1007/s00138-021-01270-x |
work_keys_str_mv | AT georgescumarianaiuliana teacherstudenttrainingandtripletlosstoreducetheeffectofdrasticfaceocclusionapplicationtoemotionrecognitiongenderidentificationandageestimation AT dutageorgianemilian teacherstudenttrainingandtripletlosstoreducetheeffectofdrasticfaceocclusionapplicationtoemotionrecognitiongenderidentificationandageestimation AT ionescuradutudor teacherstudenttrainingandtripletlosstoreducetheeffectofdrasticfaceocclusionapplicationtoemotionrecognitiongenderidentificationandageestimation |