Cargando…
Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility
For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actio...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404932/ https://www.ncbi.nlm.nih.gov/pubmed/34460781 http://dx.doi.org/10.3390/jimaging7080145 |
_version_ | 1783746236688367616 |
---|---|
author | Mauri, Antoine Khemmar, Redouane Decoux, Benoit Haddad, Madjid Boutteau, Rémi |
author_facet | Mauri, Antoine Khemmar, Redouane Decoux, Benoit Haddad, Madjid Boutteau, Rémi |
author_sort | Mauri, Antoine |
collection | PubMed |
description | For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actions. To this end, we introduce in this work a new real-time deep learning approach for 3D multi-object detection for smart mobility not only on roads, but also on railways. To obtain the 3D bounding boxes of the objects, we modified a proven real-time 2D detector, YOLOv3, to predict 3D object localization, object dimensions, and object orientation. Our method has been evaluated on KITTI’s road dataset as well as on our own hybrid virtual road/rail dataset acquired from the video game Grand Theft Auto (GTA) V. The evaluation of our method on these two datasets shows good accuracy, but more importantly that it can be used in real-time conditions, in road and rail traffic environments. Through our experimental results, we also show the importance of the accuracy of prediction of the regions of interest (RoIs) used in the estimation of 3D bounding box parameters. |
format | Online Article Text |
id | pubmed-8404932 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84049322021-10-28 Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility Mauri, Antoine Khemmar, Redouane Decoux, Benoit Haddad, Madjid Boutteau, Rémi J Imaging Article For smart mobility, autonomous vehicles, and advanced driver-assistance systems (ADASs), perception of the environment is an important task in scene analysis and understanding. Better perception of the environment allows for enhanced decision making, which, in turn, enables very high-precision actions. To this end, we introduce in this work a new real-time deep learning approach for 3D multi-object detection for smart mobility not only on roads, but also on railways. To obtain the 3D bounding boxes of the objects, we modified a proven real-time 2D detector, YOLOv3, to predict 3D object localization, object dimensions, and object orientation. Our method has been evaluated on KITTI’s road dataset as well as on our own hybrid virtual road/rail dataset acquired from the video game Grand Theft Auto (GTA) V. The evaluation of our method on these two datasets shows good accuracy, but more importantly that it can be used in real-time conditions, in road and rail traffic environments. Through our experimental results, we also show the importance of the accuracy of prediction of the regions of interest (RoIs) used in the estimation of 3D bounding box parameters. MDPI 2021-08-12 /pmc/articles/PMC8404932/ /pubmed/34460781 http://dx.doi.org/10.3390/jimaging7080145 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Mauri, Antoine Khemmar, Redouane Decoux, Benoit Haddad, Madjid Boutteau, Rémi Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility |
title | Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility |
title_full | Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility |
title_fullStr | Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility |
title_full_unstemmed | Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility |
title_short | Real-Time 3D Multi-Object Detection and Localization Based on Deep Learning for Road and Railway Smart Mobility |
title_sort | real-time 3d multi-object detection and localization based on deep learning for road and railway smart mobility |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8404932/ https://www.ncbi.nlm.nih.gov/pubmed/34460781 http://dx.doi.org/10.3390/jimaging7080145 |
work_keys_str_mv | AT mauriantoine realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility AT khemmarredouane realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility AT decouxbenoit realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility AT haddadmadjid realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility AT boutteauremi realtime3dmultiobjectdetectionandlocalizationbasedondeeplearningforroadandrailwaysmartmobility |