Cargando…

PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud

3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Fangyu, Jin, Weizheng, Fan, Cien, Zou, Lian, Chen, Qingsheng, Li, Xiaopeng, Jiang, Hao, Liu, Yifeng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7796238/
https://www.ncbi.nlm.nih.gov/pubmed/33379254
http://dx.doi.org/10.3390/s21010136
_version_ 1783634634968399872
author Li, Fangyu
Jin, Weizheng
Fan, Cien
Zou, Lian
Chen, Qingsheng
Li, Xiaopeng
Jiang, Hao
Liu, Yifeng
author_facet Li, Fangyu
Jin, Weizheng
Fan, Cien
Zou, Lian
Chen, Qingsheng
Li, Xiaopeng
Jiang, Hao
Liu, Yifeng
author_sort Li, Fangyu
collection PubMed
description 3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’s eye view (BEV) information. In this paper, a new backbone network is proposed to complete the cross-layer fusion of multi-scale BEV feature maps, which makes full use of various information for detection. Specifically, our proposed backbone network can be divided into a coarse branch and a fine branch. In the coarse branch, we use the pyramidal feature hierarchy (PFH) to generate multi-scale BEV feature maps, which retain the advantages of different levels and serves as the input of the fine branch. In the fine branch, our proposed pyramid splitting and aggregation (PSA) module deeply integrates different levels of multi-scale feature maps, thereby improving the expressive ability of the final features. Extensive experiments on the challenging KITTI-3D benchmark show that our method has better performance in both 3D and BEV object detection compared with some previous state-of-the-art methods. Experimental results with average precision (AP) prove the effectiveness of our network.
format Online
Article
Text
id pubmed-7796238
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77962382021-01-10 PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud Li, Fangyu Jin, Weizheng Fan, Cien Zou, Lian Chen, Qingsheng Li, Xiaopeng Jiang, Hao Liu, Yifeng Sensors (Basel) Article 3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’s eye view (BEV) information. In this paper, a new backbone network is proposed to complete the cross-layer fusion of multi-scale BEV feature maps, which makes full use of various information for detection. Specifically, our proposed backbone network can be divided into a coarse branch and a fine branch. In the coarse branch, we use the pyramidal feature hierarchy (PFH) to generate multi-scale BEV feature maps, which retain the advantages of different levels and serves as the input of the fine branch. In the fine branch, our proposed pyramid splitting and aggregation (PSA) module deeply integrates different levels of multi-scale feature maps, thereby improving the expressive ability of the final features. Extensive experiments on the challenging KITTI-3D benchmark show that our method has better performance in both 3D and BEV object detection compared with some previous state-of-the-art methods. Experimental results with average precision (AP) prove the effectiveness of our network. MDPI 2020-12-28 /pmc/articles/PMC7796238/ /pubmed/33379254 http://dx.doi.org/10.3390/s21010136 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Fangyu
Jin, Weizheng
Fan, Cien
Zou, Lian
Chen, Qingsheng
Li, Xiaopeng
Jiang, Hao
Liu, Yifeng
PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
title PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
title_full PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
title_fullStr PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
title_full_unstemmed PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
title_short PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
title_sort psanet: pyramid splitting and aggregation network for 3d object detection in point cloud
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7796238/
https://www.ncbi.nlm.nih.gov/pubmed/33379254
http://dx.doi.org/10.3390/s21010136
work_keys_str_mv AT lifangyu psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT jinweizheng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT fancien psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT zoulian psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT chenqingsheng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT lixiaopeng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT jianghao psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud
AT liuyifeng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud