Cargando…

Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility

For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actio...

Descripción completa

Detalles Bibliográficos
Autores principales: Mauri, Antoine, Khemmar, Redouane, Decoux, Benoit, Haddad, Madjid, Boutteau, Rémi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404932/
https://www.ncbi.nlm.nih.gov/pubmed/34460781
http://dx.doi.org/10.3390/jimaging7080145
_version_ 1783746236688367616
author Mauri, Antoine
Khemmar, Redouane
Decoux, Benoit
Haddad, Madjid
Boutteau, Rémi
author_facet Mauri, Antoine
Khemmar, Redouane
Decoux, Benoit
Haddad, Madjid
Boutteau, Rémi
author_sort Mauri, Antoine
collection PubMed
description For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actions. To this end, we introduce in this work a new real-time deep learning approach for 3D multi-object detection for smart mobility not only on roads, but also on railways. To obtain the 3D bounding boxes of the objects, we modified a proven real-time 2D detector, YOLOv3, to predict 3D object localization, object dimensions, and object orientation. Our method has been evaluated on KITTI’s road dataset as well as on our own hybrid virtual road/rail dataset acquired from the video game Grand Theft Auto (GTA) V. The evaluation of our method on these two datasets shows good accuracy, but more importantly that it can be used in real-time conditions, in road and rail traffic environments. Through our experimental results, we also show the importance of the accuracy of prediction of the regions of interest (RoIs) used in the estimation of 3D bounding box parameters.
format Online
Article
Text
id pubmed-8404932
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84049322021-10-28 Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility Mauri, Antoine Khemmar, Redouane Decoux, Benoit Haddad, Madjid Boutteau, Rémi J Imaging Article For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actions. To this end, we introduce in this work a new real-time deep learning approach for 3D multi-object detection for smart mobility not only on roads, but also on railways. To obtain the 3D bounding boxes of the objects, we modified a proven real-time 2D detector, YOLOv3, to predict 3D object localization, object dimensions, and object orientation. Our method has been evaluated on KITTI’s road dataset as well as on our own hybrid virtual road/rail dataset acquired from the video game Grand Theft Auto (GTA) V. The evaluation of our method on these two datasets shows good accuracy, but more importantly that it can be used in real-time conditions, in road and rail traffic environments. Through our experimental results, we also show the importance of the accuracy of prediction of the regions of interest (RoIs) used in the estimation of 3D bounding box parameters. MDPI 2021-08-12 /pmc/articles/PMC8404932/ /pubmed/34460781 http://dx.doi.org/10.3390/jimaging7080145 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Mauri, Antoine
Khemmar, Redouane
Decoux, Benoit
Haddad, Madjid
Boutteau, Rémi
Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
title Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
title_full Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
title_fullStr Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
title_full_unstemmed Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
title_short Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
title_sort real-time 3d multi-object detection and localization based on deep learning for road and railway smart mobility
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404932/
https://www.ncbi.nlm.nih.gov/pubmed/34460781
http://dx.doi.org/10.3390/jimaging7080145
work_keys_str_mv AT mauriantoine realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility
AT khemmarredouane realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility
AT decouxbenoit realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility
AT haddadmadjid realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility
AT boutteauremi realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility