Cargando…

Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks

Estimating the driver’s gaze in a natural real-world setting can be problematic for different challenging scenario conditions. For example, faces will undergo facial occlusions, illumination, or various face positions while driving. In this effort, we aim to reduce misclassifications in driving situ...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lollett, Catherine, Kamezaki, Mitsuhiro, Sugano, Shigeki
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9370862/ https://www.ncbi.nlm.nih.gov/pubmed/35957412 http://dx.doi.org/10.3390/s22155857

_version_	1784766944798310400
author	Lollett, Catherine Kamezaki, Mitsuhiro Sugano, Shigeki
author_facet	Lollett, Catherine Kamezaki, Mitsuhiro Sugano, Shigeki
author_sort	Lollett, Catherine
collection	PubMed
description	Estimating the driver’s gaze in a natural real-world setting can be problematic for different challenging scenario conditions. For example, faces will undergo facial occlusions, illumination, or various face positions while driving. In this effort, we aim to reduce misclassifications in driving situations when the driver has different face distances regarding the camera. Three-dimensional Convolutional Neural Networks (CNN) models can make a spatio-temporal driver’s representation that extracts features encoded in multiple adjacent frames that can describe motions. This characteristic may help ease the deficiencies of a per-frame recognition system due to the lack of context information. For example, the front, navigator, right window, left window, back mirror, and speed meter are part of the known common areas to be checked by drivers. Based on this, we implement and evaluate a model that is able to detect the head direction toward these regions having various distances from the camera. In our evaluation, the 2D CNN model had a mean average recall of 74.96% across the three models, whereas the 3D CNN model had a mean average recall of 87.02%. This result show that our proposed 3D CNN-based approach outperforms a 2D CNN per-frame recognition approach in driving situations when the driver’s face has different distances from the camera.
format	Online Article Text
id	pubmed-9370862
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-93708622022-08-12 Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks Lollett, Catherine Kamezaki, Mitsuhiro Sugano, Shigeki Sensors (Basel) Article Estimating the driver’s gaze in a natural real-world setting can be problematic for different challenging scenario conditions. For example, faces will undergo facial occlusions, illumination, or various face positions while driving. In this effort, we aim to reduce misclassifications in driving situations when the driver has different face distances regarding the camera. Three-dimensional Convolutional Neural Networks (CNN) models can make a spatio-temporal driver’s representation that extracts features encoded in multiple adjacent frames that can describe motions. This characteristic may help ease the deficiencies of a per-frame recognition system due to the lack of context information. For example, the front, navigator, right window, left window, back mirror, and speed meter are part of the known common areas to be checked by drivers. Based on this, we implement and evaluate a model that is able to detect the head direction toward these regions having various distances from the camera. In our evaluation, the 2D CNN model had a mean average recall of 74.96% across the three models, whereas the 3D CNN model had a mean average recall of 87.02%. This result show that our proposed 3D CNN-based approach outperforms a 2D CNN per-frame recognition approach in driving situations when the driver’s face has different distances from the camera. MDPI 2022-08-05 /pmc/articles/PMC9370862/ /pubmed/35957412 http://dx.doi.org/10.3390/s22155857 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Lollett, Catherine Kamezaki, Mitsuhiro Sugano, Shigeki Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks
title	Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks
title_full	Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks
title_fullStr	Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks
title_full_unstemmed	Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks
title_short	Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks
title_sort	single camera face position-invariant driver’s gaze zone classifier based on frame-sequence recognition using 3d convolutional neural networks
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9370862/ https://www.ncbi.nlm.nih.gov/pubmed/35957412 http://dx.doi.org/10.3390/s22155857
work_keys_str_mv	AT lollettcatherine singlecamerafacepositioninvariantdriversgazezoneclassifierbasedonframesequencerecognitionusing3dconvolutionalneuralnetworks AT kamezakimitsuhiro singlecamerafacepositioninvariantdriversgazezoneclassifierbasedonframesequencerecognitionusing3dconvolutionalneuralnetworks AT suganoshigeki singlecamerafacepositioninvariantdriversgazezoneclassifierbasedonframesequencerecognitionusing3dconvolutionalneuralnetworks

Single Camera Face Position-Invariant Driver’s Gaze Zone Classifier Based on Frame-Sequence Recognition Using 3D Convolutional Neural Networks

Ejemplares similares