Cargando…
Real-Time Multi-Scale Face Detector on Embedded Devices
Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which s...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539187/ https://www.ncbi.nlm.nih.gov/pubmed/31075955 http://dx.doi.org/10.3390/s19092158 |
_version_ | 1783422327019536384 |
---|---|
author | Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao |
author_facet | Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao |
author_sort | Zhao, Xu |
collection | PubMed |
description | Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which shows a good trade-off between high accuracy and fast speed on the popular embedded device with low computation power (e.g., the Raspberry Pi 3b+). The EagleEye is designed to have low floating-point operations per second (FLOPS) as well as enough capacity, and its accuracy is further improved without adding too much FLOPS. Specifically, we design five strategies for building efficient face detectors with a good balance of accuracy and running speed. The first two strategies help to build a detector with low computation complexity and enough capacity. We use convolution factorization to change traditional convolutions into more sparse depth-wise convolutions to save computation costs and we use successive downsampling convolutions at the beginning of the face detection network. The latter three strategies significantly improve the accuracy of the light-weight detector without adding too much computation costs. We design an efficient context module to utilize context information to benefit the face detection. We also adopt information preserving activation function to increase the network capacity. Finally, we use focal loss to further improve the accuracy by handling the class imbalance problem better. Experiments show that the EagleEye outperforms the other face detectors with the same order of computation costs, on both runtime efficiency and accuracy. |
format | Online Article Text |
id | pubmed-6539187 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-65391872019-06-04 Real-Time Multi-Scale Face Detector on Embedded Devices Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao Sensors (Basel) Article Face detection is the basic step in video face analysis and has been studied for many years. However, achieving real-time performance on computation-resource-limited embedded devices still remains an open challenge. To address this problem, in this paper we propose a face detector, EagleEye, which shows a good trade-off between high accuracy and fast speed on the popular embedded device with low computation power (e.g., the Raspberry Pi 3b+). The EagleEye is designed to have low floating-point operations per second (FLOPS) as well as enough capacity, and its accuracy is further improved without adding too much FLOPS. Specifically, we design five strategies for building efficient face detectors with a good balance of accuracy and running speed. The first two strategies help to build a detector with low computation complexity and enough capacity. We use convolution factorization to change traditional convolutions into more sparse depth-wise convolutions to save computation costs and we use successive downsampling convolutions at the beginning of the face detection network. The latter three strategies significantly improve the accuracy of the light-weight detector without adding too much computation costs. We design an efficient context module to utilize context information to benefit the face detection. We also adopt information preserving activation function to increase the network capacity. Finally, we use focal loss to further improve the accuracy by handling the class imbalance problem better. Experiments show that the EagleEye outperforms the other face detectors with the same order of computation costs, on both runtime efficiency and accuracy. MDPI 2019-05-09 /pmc/articles/PMC6539187/ /pubmed/31075955 http://dx.doi.org/10.3390/s19092158 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhao, Xu Liang, Xiaoqing Zhao, Chaoyang Tang, Ming Wang, Jinqiao Real-Time Multi-Scale Face Detector on Embedded Devices |
title | Real-Time Multi-Scale Face Detector on Embedded Devices |
title_full | Real-Time Multi-Scale Face Detector on Embedded Devices |
title_fullStr | Real-Time Multi-Scale Face Detector on Embedded Devices |
title_full_unstemmed | Real-Time Multi-Scale Face Detector on Embedded Devices |
title_short | Real-Time Multi-Scale Face Detector on Embedded Devices |
title_sort | real-time multi-scale face detector on embedded devices |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6539187/ https://www.ncbi.nlm.nih.gov/pubmed/31075955 http://dx.doi.org/10.3390/s19092158 |
work_keys_str_mv | AT zhaoxu realtimemultiscalefacedetectoronembeddeddevices AT liangxiaoqing realtimemultiscalefacedetectoronembeddeddevices AT zhaochaoyang realtimemultiscalefacedetectoronembeddeddevices AT tangming realtimemultiscalefacedetectoronembeddeddevices AT wangjinqiao realtimemultiscalefacedetectoronembeddeddevices |