Cargando…

Real-Time Multi-Scale Face Detector on Embedded Devices

Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which s...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Xu, Liang, Xiaoqing, Zhao, Chaoyang, Tang, Ming, Wang, Jinqiao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539187/ https://www.ncbi.nlm.nih.gov/pubmed/31075955 http://dx.doi.org/10.3390/s19092158

_version_	1783422327019536384
author	Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao
author_facet	Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao
author_sort	Zhao, Xu
collection	PubMed
description	Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which shows a good trade-off between high accuracy and fast speed on the popular embedded device with low computation power (e.g., the Raspberry Pi 3b+). The EagleEye is designed to have low floating-point operations per second (FLOPS) as well as enough capacity, and its accuracy is further improved without adding too much FLOPS. Specifically, we design five strategies for building efficient face detectors with a good balance of accuracy and running speed. The first two strategies help to build a detector with low computation complexity and enough capacity. We use convolution factorization to change traditional convolutions into more sparse depth-wise convolutions to save computation costs and we use successive downsampling convolutions at the beginning of the face detection network. The latter three strategies significantly improve the accuracy of the light-weight detector without adding too much computation costs. We design an efficient context module to utilize context information to benefit the face detection. We also adopt information preserving activation function to increase the network capacity. Finally, we use focal loss to further improve the accuracy by handling the class imbalance problem better. Experiments show that the EagleEye outperforms the other face detectors with the same order of computation costs, on both runtime efficiency and accuracy.
format	Online Article Text
id	pubmed-6539187
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-65391872019-06-04 Real-Time Multi-Scale Face Detector on Embedded Devices Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao Sensors (Basel) Article Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which shows a good trade-off between high accuracy and fast speed on the popular embedded device with low computation power (e.g., the Raspberry Pi 3b+). The EagleEye is designed to have low floating-point operations per second (FLOPS) as well as enough capacity, and its accuracy is further improved without adding too much FLOPS. Specifically, we design five strategies for building efficient face detectors with a good balance of accuracy and running speed. The first two strategies help to build a detector with low computation complexity and enough capacity. We use convolution factorization to change traditional convolutions into more sparse depth-wise convolutions to save computation costs and we use successive downsampling convolutions at the beginning of the face detection network. The latter three strategies significantly improve the accuracy of the light-weight detector without adding too much computation costs. We design an efficient context module to utilize context information to benefit the face detection. We also adopt information preserving activation function to increase the network capacity. Finally, we use focal loss to further improve the accuracy by handling the class imbalance problem better. Experiments show that the EagleEye outperforms the other face detectors with the same order of computation costs, on both runtime efficiency and accuracy. MDPI 2019-05-09 /pmc/articles/PMC6539187/ /pubmed/31075955 http://dx.doi.org/10.3390/s19092158 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao Real-Time Multi-Scale Face Detector on Embedded Devices
title	Real-Time Multi-Scale Face Detector on Embedded Devices
title_full	Real-Time Multi-Scale Face Detector on Embedded Devices
title_fullStr	Real-Time Multi-Scale Face Detector on Embedded Devices
title_full_unstemmed	Real-Time Multi-Scale Face Detector on Embedded Devices
title_short	Real-Time Multi-Scale Face Detector on Embedded Devices
title_sort	real-time multi-scale face detector on embedded devices
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539187/ https://www.ncbi.nlm.nih.gov/pubmed/31075955 http://dx.doi.org/10.3390/s19092158
work_keys_str_mv	AT zhaoxu realtimemultiscalefacedetectoronembeddeddevices AT liangxiaoqing realtimemultiscalefacedetectoronembeddeddevices AT zhaochaoyang realtimemultiscalefacedetectoronembeddeddevices AT tangming realtimemultiscalefacedetectoronembeddeddevices AT wangjinqiao realtimemultiscalefacedetectoronembeddeddevices

Real-Time Multi-Scale Face Detector on Embedded Devices

Ejemplares similares