Cargando…

Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving

Detecting pedestrians in autonomous driving is a safety-critical task, and the decision to avoid a a person has to be made with minimal latency. Multispectral approaches that combine RGB and thermal images are researched extensively, as they make it possible to gain robustness under varying illumina...

Descripción completa

Detalles Bibliográficos
Autores principales: Roszyk, Kamil, Nowicki, Michał R., Skrzypczyński, Piotr
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837921/
https://www.ncbi.nlm.nih.gov/pubmed/35161827
http://dx.doi.org/10.3390/s22031082
_version_ 1784649997962182656
author Roszyk, Kamil
Nowicki, Michał R.
Skrzypczyński, Piotr
author_facet Roszyk, Kamil
Nowicki, Michał R.
Skrzypczyński, Piotr
author_sort Roszyk, Kamil
collection PubMed
description Detecting pedestrians in autonomous driving is a safety-critical task, and the decision to avoid a a person has to be made with minimal latency. Multispectral approaches that combine RGB and thermal images are researched extensively, as they make it possible to gain robustness under varying illumination and weather conditions. State-of-the-art solutions employing deep neural networks offer high accuracy of pedestrian detection. However, the literature is short of works that evaluate multispectral pedestrian detection with respect to its feasibility in obstacle avoidance scenarios, taking into account the motion of the vehicle. Therefore, we investigated the real-time neural network detector architecture You Only Look Once, the latest version (YOLOv4), and demonstrate that this detector can be adapted to multispectral pedestrian detection. It can achieve accuracy on par with the state-of-the-art while being highly computationally efficient, thereby supporting low-latency decision making. The results achieved on the KAIST dataset were evaluated from the perspective of automotive applications, where low latency and a low number of false negatives are critical parameters. The middle fusion approach to YOLOv4 in its Tiny variant achieved the best accuracy to computational efficiency trade-off among the evaluated architectures.
format Online
Article
Text
id pubmed-8837921
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88379212022-02-13 Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving Roszyk, Kamil Nowicki, Michał R. Skrzypczyński, Piotr Sensors (Basel) Article Detecting pedestrians in autonomous driving is a safety-critical task, and the decision to avoid a a person has to be made with minimal latency. Multispectral approaches that combine RGB and thermal images are researched extensively, as they make it possible to gain robustness under varying illumination and weather conditions. State-of-the-art solutions employing deep neural networks offer high accuracy of pedestrian detection. However, the literature is short of works that evaluate multispectral pedestrian detection with respect to its feasibility in obstacle avoidance scenarios, taking into account the motion of the vehicle. Therefore, we investigated the real-time neural network detector architecture You Only Look Once, the latest version (YOLOv4), and demonstrate that this detector can be adapted to multispectral pedestrian detection. It can achieve accuracy on par with the state-of-the-art while being highly computationally efficient, thereby supporting low-latency decision making. The results achieved on the KAIST dataset were evaluated from the perspective of automotive applications, where low latency and a low number of false negatives are critical parameters. The middle fusion approach to YOLOv4 in its Tiny variant achieved the best accuracy to computational efficiency trade-off among the evaluated architectures. MDPI 2022-01-30 /pmc/articles/PMC8837921/ /pubmed/35161827 http://dx.doi.org/10.3390/s22031082 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Roszyk, Kamil
Nowicki, Michał R.
Skrzypczyński, Piotr
Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving
title Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving
title_full Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving
title_fullStr Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving
title_full_unstemmed Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving
title_short Adopting the YOLOv4 Architecture for Low-Latency Multispectral Pedestrian Detection in Autonomous Driving
title_sort adopting the yolov4 architecture for low-latency multispectral pedestrian detection in autonomous driving
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8837921/
https://www.ncbi.nlm.nih.gov/pubmed/35161827
http://dx.doi.org/10.3390/s22031082
work_keys_str_mv AT roszykkamil adoptingtheyolov4architectureforlowlatencymultispectralpedestriandetectioninautonomousdriving
AT nowickimichałr adoptingtheyolov4architectureforlowlatencymultispectralpedestriandetectioninautonomousdriving
AT skrzypczynskipiotr adoptingtheyolov4architectureforlowlatencymultispectralpedestriandetectioninautonomousdriving