Cargando…
High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
Vulnerable Road User (VRU) detection is a major application of object detection with the aim of helping reduce accidents in advanced driver-assistance systems and enabling the development of autonomous vehicles. Due to intrinsic complexity present in computer vision and to limitations in processing...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321163/ https://www.ncbi.nlm.nih.gov/pubmed/34460539 http://dx.doi.org/10.3390/jimaging6120142 |
_version_ | 1783730785667252224 |
---|---|
author | Ortiz Castelló, Vicent Salvador Igual, Ismael del Tejo Catalá, Omar Perez-Cortes, Juan-Carlos |
author_facet | Ortiz Castelló, Vicent Salvador Igual, Ismael del Tejo Catalá, Omar Perez-Cortes, Juan-Carlos |
author_sort | Ortiz Castelló, Vicent |
collection | PubMed |
description | Vulnerable Road User (VRU) detection is a major application of object detection with the aim of helping reduce accidents in advanced driver-assistance systems and enabling the development of autonomous vehicles. Due to intrinsic complexity present in computer vision and to limitations in processing capacity and bandwidth, this task has not been completely solved nowadays. For these reasons, the well established YOLOv3 net and the new YOLOv4 one are assessed by training them on a huge, recent on-road image dataset (BDD100K), both for VRU and full on-road classes, with a great improvement in terms of detection quality when compared to their MS-COCO-trained generic correspondent models from the authors but with negligible costs in forward pass time. Additionally, some models were retrained when replacing the original Leaky ReLU convolutional activation functions from original YOLO implementation with two cutting-edge activation functions: the self-regularized non-monotonic function (MISH) and its self-gated counterpart (SWISH), with significant improvements with respect to the original activation function detection performance. Additionally, some trials were carried out including recent data augmentation techniques (mosaic and cutmix) and some grid size configurations, with cumulative improvements over the previous results, comprising different performance-throughput trade-offs. |
format | Online Article Text |
id | pubmed-8321163 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83211632021-08-26 High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K Ortiz Castelló, Vicent Salvador Igual, Ismael del Tejo Catalá, Omar Perez-Cortes, Juan-Carlos J Imaging Article Vulnerable Road User (VRU) detection is a major application of object detection with the aim of helping reduce accidents in advanced driver-assistance systems and enabling the development of autonomous vehicles. Due to intrinsic complexity present in computer vision and to limitations in processing capacity and bandwidth, this task has not been completely solved nowadays. For these reasons, the well established YOLOv3 net and the new YOLOv4 one are assessed by training them on a huge, recent on-road image dataset (BDD100K), both for VRU and full on-road classes, with a great improvement in terms of detection quality when compared to their MS-COCO-trained generic correspondent models from the authors but with negligible costs in forward pass time. Additionally, some models were retrained when replacing the original Leaky ReLU convolutional activation functions from original YOLO implementation with two cutting-edge activation functions: the self-regularized non-monotonic function (MISH) and its self-gated counterpart (SWISH), with significant improvements with respect to the original activation function detection performance. Additionally, some trials were carried out including recent data augmentation techniques (mosaic and cutmix) and some grid size configurations, with cumulative improvements over the previous results, comprising different performance-throughput trade-offs. MDPI 2020-12-19 /pmc/articles/PMC8321163/ /pubmed/34460539 http://dx.doi.org/10.3390/jimaging6120142 Text en © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ). |
spellingShingle | Article Ortiz Castelló, Vicent Salvador Igual, Ismael del Tejo Catalá, Omar Perez-Cortes, Juan-Carlos High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K |
title | High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K |
title_full | High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K |
title_fullStr | High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K |
title_full_unstemmed | High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K |
title_short | High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K |
title_sort | high-profile vru detection on resource-constrained hardware using yolov3/v4 on bdd100k |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321163/ https://www.ncbi.nlm.nih.gov/pubmed/34460539 http://dx.doi.org/10.3390/jimaging6120142 |
work_keys_str_mv | AT ortizcastellovicent highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k AT salvadorigualismael highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k AT deltejocatalaomar highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k AT perezcortesjuancarlos highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k |