Cargando…

Detecting Facial Region and Landmarks at Once via Deep Network †

For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection m...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Taehyung, Mok, Jiwon, Lee, Euichul
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8401714/ https://www.ncbi.nlm.nih.gov/pubmed/34450804 http://dx.doi.org/10.3390/s21165360

_version_	1783745616492363776
author	Kim, Taehyung Mok, Jiwon Lee, Euichul
author_facet	Kim, Taehyung Mok, Jiwon Lee, Euichul
author_sort	Kim, Taehyung
collection	PubMed
description	For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection model is used. Therefore, we propose a model that can simultaneously detect a face region and a landmark without performing the face detection step before landmark detection. The proposed single-shot detection model is based on the framework of YOLOv3, a one-stage object detection method, and the loss function and structure are altered to learn faces and landmarks at the same time. In addition, EfficientNet-B0 was utilized as the backbone network to increase processing speed and accuracy. The learned database used 300W-LP with 64 facial landmarks. The average normalized error of the proposed model was 2.32 pixels. The processing time per frame was about 15 milliseconds, and the average precision of face detection was about 99%. As a result of the evaluation, it was confirmed that the single-shot detection model has better performance and speed than the previous methods. In addition, as a result of using the COFW database, which has 29 landmarks instead of 64 to verify the proposed method, the average normalization error was 2.56 pixels, which was also confirmed to show promising performance.
format	Online Article Text
id	pubmed-8401714
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-84017142021-08-29 Detecting Facial Region and Landmarks at Once via Deep Network † Kim, Taehyung Mok, Jiwon Lee, Euichul Sensors (Basel) Communication For accurate and fast detection of facial landmarks, we propose a new facial landmark detection method. Previous facial landmark detection models generally perform a face detection step before landmark detection. This greatly affects landmark detection performance depending on which face detection model is used. Therefore, we propose a model that can simultaneously detect a face region and a landmark without performing the face detection step before landmark detection. The proposed single-shot detection model is based on the framework of YOLOv3, a one-stage object detection method, and the loss function and structure are altered to learn faces and landmarks at the same time. In addition, EfficientNet-B0 was utilized as the backbone network to increase processing speed and accuracy. The learned database used 300W-LP with 64 facial landmarks. The average normalized error of the proposed model was 2.32 pixels. The processing time per frame was about 15 milliseconds, and the average precision of face detection was about 99%. As a result of the evaluation, it was confirmed that the single-shot detection model has better performance and speed than the previous methods. In addition, as a result of using the COFW database, which has 29 landmarks instead of 64 to verify the proposed method, the average normalization error was 2.56 pixels, which was also confirmed to show promising performance. MDPI 2021-08-09 /pmc/articles/PMC8401714/ /pubmed/34450804 http://dx.doi.org/10.3390/s21165360 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Communication Kim, Taehyung Mok, Jiwon Lee, Euichul Detecting Facial Region and Landmarks at Once via Deep Network †
title	Detecting Facial Region and Landmarks at Once via Deep Network †
title_full	Detecting Facial Region and Landmarks at Once via Deep Network †
title_fullStr	Detecting Facial Region and Landmarks at Once via Deep Network †
title_full_unstemmed	Detecting Facial Region and Landmarks at Once via Deep Network †
title_short	Detecting Facial Region and Landmarks at Once via Deep Network †
title_sort	detecting facial region and landmarks at once via deep network †
topic	Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8401714/ https://www.ncbi.nlm.nih.gov/pubmed/34450804 http://dx.doi.org/10.3390/s21165360
work_keys_str_mv	AT kimtaehyung detectingfacialregionandlandmarksatonceviadeepnetwork AT mokjiwon detectingfacialregionandlandmarksatonceviadeepnetwork AT leeeuichul detectingfacialregionandlandmarksatonceviadeepnetwork

Detecting Facial Region and Landmarks at Once via Deep Network †

Ejemplares similares