Cargando…

Real-Time 3D Object Detection on Crowded Pedestrians

In the field of autonomous driving, object detection under point clouds is indispensable for environmental perception. In order to achieve the goal of reducing blind spots in perception, many autonomous driving schemes have added low-cost blind-filling LiDAR on the side of the vehicle. Unlike point...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Bin, Li, Qing, Liang, Yanju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10650522/
https://www.ncbi.nlm.nih.gov/pubmed/37960425
http://dx.doi.org/10.3390/s23218725
_version_ 1785135799568695296
author Lu, Bin
Li, Qing
Liang, Yanju
author_facet Lu, Bin
Li, Qing
Liang, Yanju
author_sort Lu, Bin
collection PubMed
description In the field of autonomous driving, object detection under point clouds is indispensable for environmental perception. In order to achieve the goal of reducing blind spots in perception, many autonomous driving schemes have added low-cost blind-filling LiDAR on the side of the vehicle. Unlike point cloud target detection based on high-performance LiDAR, the blind-filling LiDARs have low vertical angular resolution and are mounted on the side of the vehicle, resulting in easily mixed point clouds of pedestrian targets in close proximity to each other. These characteristics are harmful for target detection. Currently, many research works focus on target detection under high-density LiDAR. These methods cannot effectively deal with the high sparsity of the point clouds, and the recall and detection accuracy of crowded pedestrian targets tend to be low. To overcome these problems, we propose a real-time detection model for crowded pedestrian targets, namely RTCP. To improve computational efficiency, we utilize an attention-based point sampling method to reduce the redundancy of the point clouds, then we obtain new feature tensors by the quantization of the point cloud space and neighborhood fusion in polar coordinates. In order to make it easier for the model to focus on the center position of the target, we propose an object alignment attention module (OAA) for position alignment, and we utilize an additional branch of the targets’ location occupied heatmap to guide the training of the OAA module. These methods improve the model’s robustness against the occlusion of crowded pedestrian targets. Finally, we evaluate the detector on KITTI, JRDB, and our own blind-filling LiDAR dataset, and our algorithm achieved the best trade-off of detection accuracy against runtime efficiency.
format Online
Article
Text
id pubmed-10650522
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106505222023-10-26 Real-Time 3D Object Detection on Crowded Pedestrians Lu, Bin Li, Qing Liang, Yanju Sensors (Basel) Article In the field of autonomous driving, object detection under point clouds is indispensable for environmental perception. In order to achieve the goal of reducing blind spots in perception, many autonomous driving schemes have added low-cost blind-filling LiDAR on the side of the vehicle. Unlike point cloud target detection based on high-performance LiDAR, the blind-filling LiDARs have low vertical angular resolution and are mounted on the side of the vehicle, resulting in easily mixed point clouds of pedestrian targets in close proximity to each other. These characteristics are harmful for target detection. Currently, many research works focus on target detection under high-density LiDAR. These methods cannot effectively deal with the high sparsity of the point clouds, and the recall and detection accuracy of crowded pedestrian targets tend to be low. To overcome these problems, we propose a real-time detection model for crowded pedestrian targets, namely RTCP. To improve computational efficiency, we utilize an attention-based point sampling method to reduce the redundancy of the point clouds, then we obtain new feature tensors by the quantization of the point cloud space and neighborhood fusion in polar coordinates. In order to make it easier for the model to focus on the center position of the target, we propose an object alignment attention module (OAA) for position alignment, and we utilize an additional branch of the targets’ location occupied heatmap to guide the training of the OAA module. These methods improve the model’s robustness against the occlusion of crowded pedestrian targets. Finally, we evaluate the detector on KITTI, JRDB, and our own blind-filling LiDAR dataset, and our algorithm achieved the best trade-off of detection accuracy against runtime efficiency. MDPI 2023-10-26 /pmc/articles/PMC10650522/ /pubmed/37960425 http://dx.doi.org/10.3390/s23218725 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lu, Bin
Li, Qing
Liang, Yanju
Real-Time 3D Object Detection on Crowded Pedestrians
title Real-Time 3D Object Detection on Crowded Pedestrians
title_full Real-Time 3D Object Detection on Crowded Pedestrians
title_fullStr Real-Time 3D Object Detection on Crowded Pedestrians
title_full_unstemmed Real-Time 3D Object Detection on Crowded Pedestrians
title_short Real-Time 3D Object Detection on Crowded Pedestrians
title_sort real-time 3d object detection on crowded pedestrians
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10650522/
https://www.ncbi.nlm.nih.gov/pubmed/37960425
http://dx.doi.org/10.3390/s23218725
work_keys_str_mv AT lubin realtime3dobjectdetectiononcrowdedpedestrians
AT liqing realtime3dobjectdetectiononcrowdedpedestrians
AT liangyanju realtime3dobjectdetectiononcrowdedpedestrians