Cargando…
Detecting Facial Region and Landmarks at Once via Deep Network †
For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection m...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8401714/ https://www.ncbi.nlm.nih.gov/pubmed/34450804 http://dx.doi.org/10.3390/s21165360 |
_version_ | 1783745616492363776 |
---|---|
author | Kim, Taehyung Mok, Jiwon Lee, Euichul |
author_facet | Kim, Taehyung Mok, Jiwon Lee, Euichul |
author_sort | Kim, Taehyung |
collection | PubMed |
description | For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection model is used. Therefore, we propose a model that can simultaneously detect a face region and a landmark without performing the face detection step before landmark detection. The proposed single-shot detection model is based on the framework of YOLOv3, a one-stage object detection method, and the loss function and structure are altered to learn faces and landmarks at the same time. In addition, EfficientNet-B0 was utilized as the backbone network to increase processing speed and accuracy. The learned database used 300W-LP with 64 facial landmarks. The average normalized error of the proposed model was 2.32 pixels. The processing time per frame was about 15 milliseconds, and the average precision of face detection was about 99%. As a result of the evaluation, it was confirmed that the single-shot detection model has better performance and speed than the previous methods. In addition, as a result of using the COFW database, which has 29 landmarks instead of 64 to verify the proposed method, the average normalization error was 2.56 pixels, which was also confirmed to show promising performance. |
format | Online Article Text |
id | pubmed-8401714 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84017142021-08-29 Detecting Facial Region and Landmarks at Once via Deep Network † Kim, Taehyung Mok, Jiwon Lee, Euichul Sensors (Basel) Communication For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection model is used. Therefore, we propose a model that can simultaneously detect a face region and a landmark without performing the face detection step before landmark detection. The proposed single-shot detection model is based on the framework of YOLOv3, a one-stage object detection method, and the loss function and structure are altered to learn faces and landmarks at the same time. In addition, EfficientNet-B0 was utilized as the backbone network to increase processing speed and accuracy. The learned database used 300W-LP with 64 facial landmarks. The average normalized error of the proposed model was 2.32 pixels. The processing time per frame was about 15 milliseconds, and the average precision of face detection was about 99%. As a result of the evaluation, it was confirmed that the single-shot detection model has better performance and speed than the previous methods. In addition, as a result of using the COFW database, which has 29 landmarks instead of 64 to verify the proposed method, the average normalization error was 2.56 pixels, which was also confirmed to show promising performance. MDPI 2021-08-09 /pmc/articles/PMC8401714/ /pubmed/34450804 http://dx.doi.org/10.3390/s21165360 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Kim, Taehyung Mok, Jiwon Lee, Euichul Detecting Facial Region and Landmarks at Once via Deep Network † |
title | Detecting Facial Region and Landmarks at Once via Deep Network † |
title_full | Detecting Facial Region and Landmarks at Once via Deep Network † |
title_fullStr | Detecting Facial Region and Landmarks at Once via Deep Network † |
title_full_unstemmed | Detecting Facial Region and Landmarks at Once via Deep Network † |
title_short | Detecting Facial Region and Landmarks at Once via Deep Network † |
title_sort | detecting facial region and landmarks at once via deep network † |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8401714/ https://www.ncbi.nlm.nih.gov/pubmed/34450804 http://dx.doi.org/10.3390/s21165360 |
work_keys_str_mv | AT kimtaehyung detectingfacialregionandlandmarksatonceviadeepnetwork AT mokjiwon detectingfacialregionandlandmarksatonceviadeepnetwork AT leeeuichul detectingfacialregionandlandmarksatonceviadeepnetwork |