Cargando…

High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K

Vulnerable Road User (VRU) detection is a major application of object detection with the aim of helping reduce accidents in advanced driver-assistance systems and enabling the development of autonomous vehicles. Due to intrinsic complexity present in computer vision and to limitations in processing...

Descripción completa

Detalles Bibliográficos
Autores principales: Ortiz Castelló, Vicent, Salvador Igual, Ismael, del Tejo Catalá, Omar, Perez-Cortes, Juan-Carlos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321163/
https://www.ncbi.nlm.nih.gov/pubmed/34460539
http://dx.doi.org/10.3390/jimaging6120142
_version_ 1783730785667252224
author Ortiz Castelló, Vicent
Salvador Igual, Ismael
del Tejo Catalá, Omar
Perez-Cortes, Juan-Carlos
author_facet Ortiz Castelló, Vicent
Salvador Igual, Ismael
del Tejo Catalá, Omar
Perez-Cortes, Juan-Carlos
author_sort Ortiz Castelló, Vicent
collection PubMed
description Vulnerable Road User (VRU) detection is a major application of object detection with the aim of helping reduce accidents in advanced driver-assistance systems and enabling the development of autonomous vehicles. Due to intrinsic complexity present in computer vision and to limitations in processing capacity and bandwidth, this task has not been completely solved nowadays. For these reasons, the well established YOLOv3 net and the new YOLOv4 one are assessed by training them on a huge, recent on-road image dataset (BDD100K), both for VRU and full on-road classes, with a great improvement in terms of detection quality when compared to their MS-COCO-trained generic correspondent models from the authors but with negligible costs in forward pass time. Additionally, some models were retrained when replacing the original Leaky ReLU convolutional activation functions from original YOLO implementation with two cutting-edge activation functions: the self-regularized non-monotonic function (MISH) and its self-gated counterpart (SWISH), with significant improvements with respect to the original activation function detection performance. Additionally, some trials were carried out including recent data augmentation techniques (mosaic and cutmix) and some grid size configurations, with cumulative improvements over the previous results, comprising different performance-throughput trade-offs.
format Online
Article
Text
id pubmed-8321163
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83211632021-08-26 High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K Ortiz Castelló, Vicent Salvador Igual, Ismael del Tejo Catalá, Omar Perez-Cortes, Juan-Carlos J Imaging Article Vulnerable Road User (VRU) detection is a major application of object detection with the aim of helping reduce accidents in advanced driver-assistance systems and enabling the development of autonomous vehicles. Due to intrinsic complexity present in computer vision and to limitations in processing capacity and bandwidth, this task has not been completely solved nowadays. For these reasons, the well established YOLOv3 net and the new YOLOv4 one are assessed by training them on a huge, recent on-road image dataset (BDD100K), both for VRU and full on-road classes, with a great improvement in terms of detection quality when compared to their MS-COCO-trained generic correspondent models from the authors but with negligible costs in forward pass time. Additionally, some models were retrained when replacing the original Leaky ReLU convolutional activation functions from original YOLO implementation with two cutting-edge activation functions: the self-regularized non-monotonic function (MISH) and its self-gated counterpart (SWISH), with significant improvements with respect to the original activation function detection performance. Additionally, some trials were carried out including recent data augmentation techniques (mosaic and cutmix) and some grid size configurations, with cumulative improvements over the previous results, comprising different performance-throughput trade-offs. MDPI 2020-12-19 /pmc/articles/PMC8321163/ /pubmed/34460539 http://dx.doi.org/10.3390/jimaging6120142 Text en © 2020 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ).
spellingShingle Article
Ortiz Castelló, Vicent
Salvador Igual, Ismael
del Tejo Catalá, Omar
Perez-Cortes, Juan-Carlos
High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
title High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
title_full High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
title_fullStr High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
title_full_unstemmed High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
title_short High-Profile VRU Detection on Resource-Constrained Hardware Using YOLOv3/v4 on BDD100K
title_sort high-profile vru detection on resource-constrained hardware using yolov3/v4 on bdd100k
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321163/
https://www.ncbi.nlm.nih.gov/pubmed/34460539
http://dx.doi.org/10.3390/jimaging6120142
work_keys_str_mv AT ortizcastellovicent highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k
AT salvadorigualismael highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k
AT deltejocatalaomar highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k
AT perezcortesjuancarlos highprofilevrudetectiononresourceconstrainedhardwareusingyolov3v4onbdd100k