Cargando…

Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates

The accurate localization of facial landmarks is essential for several tasks, including face recognition, head pose estimation, facial region extraction, and emotion detection. Although the number of required landmarks is task-specific, models are typically trained on all available landmarks in the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gdoura, Ahmed, Degünther, Markus, Lorenz, Birgit, Effland, Alexander
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10218977/ https://www.ncbi.nlm.nih.gov/pubmed/37233323 http://dx.doi.org/10.3390/jimaging9050104

_version_	1785048901023170560
author	Gdoura, Ahmed Degünther, Markus Lorenz, Birgit Effland, Alexander
author_facet	Gdoura, Ahmed Degünther, Markus Lorenz, Birgit Effland, Alexander
author_sort	Gdoura, Ahmed
collection	PubMed
description	The accurate localization of facial landmarks is essential for several tasks, including face recognition, head pose estimation, facial region extraction, and emotion detection. Although the number of required landmarks is task-specific, models are typically trained on all available landmarks in the datasets, limiting efficiency. Furthermore, model performance is strongly influenced by scale-dependent local appearance information around landmarks and the global shape information generated by them. To account for this, we propose a lightweight hybrid model for facial landmark detection designed specifically for pupil region extraction. Our design combines a convolutional neural network (CNN) with a Markov random field (MRF)-like process trained on only 17 carefully selected landmarks. The advantage of our model is the ability to run different image scales on the same convolutional layers, resulting in a significant reduction in model size. In addition, we employ an approximation of the MRF that is run on a subset of landmarks to validate the spatial consistency of the generated shape. This validation process is performed against a learned conditional distribution, expressing the location of one landmark relative to its neighbor. Experimental results on popular facial landmark localization datasets such as 300 w, WFLW, and HELEN demonstrate the accuracy of our proposed model. Furthermore, our model achieves state-of-the-art performance on a well-defined robustness metric. In conclusion, the results demonstrate the ability of our lightweight model to filter out spatially inconsistent predictions, even with significantly fewer training landmarks.
format	Online Article Text
id	pubmed-10218977
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-102189772023-05-27 Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates Gdoura, Ahmed Degünther, Markus Lorenz, Birgit Effland, Alexander J Imaging Article The accurate localization of facial landmarks is essential for several tasks, including face recognition, head pose estimation, facial region extraction, and emotion detection. Although the number of required landmarks is task-specific, models are typically trained on all available landmarks in the datasets, limiting efficiency. Furthermore, model performance is strongly influenced by scale-dependent local appearance information around landmarks and the global shape information generated by them. To account for this, we propose a lightweight hybrid model for facial landmark detection designed specifically for pupil region extraction. Our design combines a convolutional neural network (CNN) with a Markov random field (MRF)-like process trained on only 17 carefully selected landmarks. The advantage of our model is the ability to run different image scales on the same convolutional layers, resulting in a significant reduction in model size. In addition, we employ an approximation of the MRF that is run on a subset of landmarks to validate the spatial consistency of the generated shape. This validation process is performed against a learned conditional distribution, expressing the location of one landmark relative to its neighbor. Experimental results on popular facial landmark localization datasets such as 300 w, WFLW, and HELEN demonstrate the accuracy of our proposed model. Furthermore, our model achieves state-of-the-art performance on a well-defined robustness metric. In conclusion, the results demonstrate the ability of our lightweight model to filter out spatially inconsistent predictions, even with significantly fewer training landmarks. MDPI 2023-05-22 /pmc/articles/PMC10218977/ /pubmed/37233323 http://dx.doi.org/10.3390/jimaging9050104 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Gdoura, Ahmed Degünther, Markus Lorenz, Birgit Effland, Alexander Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates
title	Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates
title_full	Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates
title_fullStr	Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates
title_full_unstemmed	Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates
title_short	Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates
title_sort	combining cnns and markov-like models for facial landmark detection with spatial consistency estimates
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10218977/ https://www.ncbi.nlm.nih.gov/pubmed/37233323 http://dx.doi.org/10.3390/jimaging9050104
work_keys_str_mv	AT gdouraahmed combiningcnnsandmarkovlikemodelsforfaciallandmarkdetectionwithspatialconsistencyestimates AT degunthermarkus combiningcnnsandmarkovlikemodelsforfaciallandmarkdetectionwithspatialconsistencyestimates AT lorenzbirgit combiningcnnsandmarkovlikemodelsforfaciallandmarkdetectionwithspatialconsistencyestimates AT efflandalexander combiningcnnsandmarkovlikemodelsforfaciallandmarkdetectionwithspatialconsistencyestimates

Combining CNNs and Markov-like Models for Facial Landmark Detection with Spatial Consistency Estimates

Ejemplares similares