Cargando…

Detecting Facial Region and Landmarks at Once via Deep Network †

For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection m...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Taehyung, Mok, Jiwon, Lee, Euichul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8401714/
https://www.ncbi.nlm.nih.gov/pubmed/34450804
http://dx.doi.org/10.3390/s21165360
_version_ 1783745616492363776
author Kim, Taehyung
Mok, Jiwon
Lee, Euichul
author_facet Kim, Taehyung
Mok, Jiwon
Lee, Euichul
author_sort Kim, Taehyung
collection PubMed
description For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection model is used. Therefore, we propose a model that can simultaneously detect a face region and a landmark without performing the face detection step before landmark detection. The proposed single-shot detection model is based on the framework of YOLOv3, a one-stage object detection method, and the loss function and structure are altered to learn faces and landmarks at the same time. In addition, EfficientNet-B0 was utilized as the backbone network to increase processing speed and accuracy. The learned database used 300W-LP with 64 facial landmarks. The average normalized error of the proposed model was 2.32 pixels. The processing time per frame was about 15 milliseconds, and the average precision of face detection was about 99%. As a result of the evaluation, it was confirmed that the single-shot detection model has better performance and speed than the previous methods. In addition, as a result of using the COFW database, which has 29 landmarks instead of 64 to verify the proposed method, the average normalization error was 2.56 pixels, which was also confirmed to show promising performance.
format Online
Article
Text
id pubmed-8401714
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84017142021-08-29 Detecting Facial Region and Landmarks at Once via Deep Network † Kim, Taehyung Mok, Jiwon Lee, Euichul Sensors (Basel) Communication For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection model is used. Therefore, we propose a model that can simultaneously detect a face region and a landmark without performing the face detection step before landmark detection. The proposed single-shot detection model is based on the framework of YOLOv3, a one-stage object detection method, and the loss function and structure are altered to learn faces and landmarks at the same time. In addition, EfficientNet-B0 was utilized as the backbone network to increase processing speed and accuracy. The learned database used 300W-LP with 64 facial landmarks. The average normalized error of the proposed model was 2.32 pixels. The processing time per frame was about 15 milliseconds, and the average precision of face detection was about 99%. As a result of the evaluation, it was confirmed that the single-shot detection model has better performance and speed than the previous methods. In addition, as a result of using the COFW database, which has 29 landmarks instead of 64 to verify the proposed method, the average normalization error was 2.56 pixels, which was also confirmed to show promising performance. MDPI 2021-08-09 /pmc/articles/PMC8401714/ /pubmed/34450804 http://dx.doi.org/10.3390/s21165360 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Kim, Taehyung
Mok, Jiwon
Lee, Euichul
Detecting Facial Region and Landmarks at Once via Deep Network †
title Detecting Facial Region and Landmarks at Once via Deep Network †
title_full Detecting Facial Region and Landmarks at Once via Deep Network †
title_fullStr Detecting Facial Region and Landmarks at Once via Deep Network †
title_full_unstemmed Detecting Facial Region and Landmarks at Once via Deep Network †
title_short Detecting Facial Region and Landmarks at Once via Deep Network †
title_sort detecting facial region and landmarks at once via deep network †
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8401714/
https://www.ncbi.nlm.nih.gov/pubmed/34450804
http://dx.doi.org/10.3390/s21165360
work_keys_str_mv AT kimtaehyung detectingfacialregionandlandmarksatonceviadeepnetwork
AT mokjiwon detectingfacialregionandlandmarksatonceviadeepnetwork
AT leeeuichul detectingfacialregionandlandmarksatonceviadeepnetwork