Cargando…

SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction

Face and person detection are important tasks in computer vision, as they represent the first component in many recognition systems, such as face recognition, facial expression analysis, body pose estimation, face attribute detection, or human action recognition. Thereby, their detection rate and ru...

Descripción completa

Detalles Bibliográficos
Autores principales: Fiedler, Marc-André, Werner, Philipp, Khalifa, Aly, Al-Hamadi, Ayoub
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434547/
https://www.ncbi.nlm.nih.gov/pubmed/34502809
http://dx.doi.org/10.3390/s21175918
_version_ 1783751625780756480
author Fiedler, Marc-André
Werner, Philipp
Khalifa, Aly
Al-Hamadi, Ayoub
author_facet Fiedler, Marc-André
Werner, Philipp
Khalifa, Aly
Al-Hamadi, Ayoub
author_sort Fiedler, Marc-André
collection PubMed
description Face and person detection are important tasks in computer vision, as they represent the first component in many recognition systems, such as face recognition, facial expression analysis, body pose estimation, face attribute detection, or human action recognition. Thereby, their detection rate and runtime are crucial for the performance of the overall system. In this paper, we combine both face and person detection in one framework with the goal of reaching a detection performance that is competitive to the state of the art of lightweight object-specific networks while maintaining real-time processing speed for both detection tasks together. In order to combine face and person detection in one network, we applied multi-task learning. The difficulty lies in the fact that no datasets are available that contain both face as well as person annotations. Since we did not have the resources to manually annotate the datasets, as it is very time-consuming and automatic generation of ground truths results in annotations of poor quality, we solve this issue algorithmically by applying a special training procedure and network architecture without the need of creating new labels. Our newly developed method called Simultaneous Face and Person Detection (SFPD) is able to detect persons and faces with 40 frames per second. Because of this good trade-off between detection performance and inference time, SFPD represents a useful and valuable real-time framework especially for a multitude of real-world applications such as, e.g., human–robot interaction.
format Online
Article
Text
id pubmed-8434547
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84345472021-09-12 SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction Fiedler, Marc-André Werner, Philipp Khalifa, Aly Al-Hamadi, Ayoub Sensors (Basel) Article Face and person detection are important tasks in computer vision, as they represent the first component in many recognition systems, such as face recognition, facial expression analysis, body pose estimation, face attribute detection, or human action recognition. Thereby, their detection rate and runtime are crucial for the performance of the overall system. In this paper, we combine both face and person detection in one framework with the goal of reaching a detection performance that is competitive to the state of the art of lightweight object-specific networks while maintaining real-time processing speed for both detection tasks together. In order to combine face and person detection in one network, we applied multi-task learning. The difficulty lies in the fact that no datasets are available that contain both face as well as person annotations. Since we did not have the resources to manually annotate the datasets, as it is very time-consuming and automatic generation of ground truths results in annotations of poor quality, we solve this issue algorithmically by applying a special training procedure and network architecture without the need of creating new labels. Our newly developed method called Simultaneous Face and Person Detection (SFPD) is able to detect persons and faces with 40 frames per second. Because of this good trade-off between detection performance and inference time, SFPD represents a useful and valuable real-time framework especially for a multitude of real-world applications such as, e.g., human–robot interaction. MDPI 2021-09-02 /pmc/articles/PMC8434547/ /pubmed/34502809 http://dx.doi.org/10.3390/s21175918 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Fiedler, Marc-André
Werner, Philipp
Khalifa, Aly
Al-Hamadi, Ayoub
SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction
title SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction
title_full SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction
title_fullStr SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction
title_full_unstemmed SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction
title_short SFPD: Simultaneous Face and Person Detection in Real-Time for Human–Robot Interaction
title_sort sfpd: simultaneous face and person detection in real-time for human–robot interaction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434547/
https://www.ncbi.nlm.nih.gov/pubmed/34502809
http://dx.doi.org/10.3390/s21175918
work_keys_str_mv AT fiedlermarcandre sfpdsimultaneousfaceandpersondetectioninrealtimeforhumanrobotinteraction
AT wernerphilipp sfpdsimultaneousfaceandpersondetectioninrealtimeforhumanrobotinteraction
AT khalifaaly sfpdsimultaneousfaceandpersondetectioninrealtimeforhumanrobotinteraction
AT alhamadiayoub sfpdsimultaneousfaceandpersondetectioninrealtimeforhumanrobotinteraction