Cargando…
PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud
3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7796238/ https://www.ncbi.nlm.nih.gov/pubmed/33379254 http://dx.doi.org/10.3390/s21010136 |
_version_ | 1783634634968399872 |
---|---|
author | Li, Fangyu Jin, Weizheng Fan, Cien Zou, Lian Chen, Qingsheng Li, Xiaopeng Jiang, Hao Liu, Yifeng |
author_facet | Li, Fangyu Jin, Weizheng Fan, Cien Zou, Lian Chen, Qingsheng Li, Xiaopeng Jiang, Hao Liu, Yifeng |
author_sort | Li, Fangyu |
collection | PubMed |
description | 3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’s eye view (BEV) information. In this paper, a new backbone network is proposed to complete the cross-layer fusion of multi-scale BEV feature maps, which makes full use of various information for detection. Specifically, our proposed backbone network can be divided into a coarse branch and a fine branch. In the coarse branch, we use the pyramidal feature hierarchy (PFH) to generate multi-scale BEV feature maps, which retain the advantages of different levels and serves as the input of the fine branch. In the fine branch, our proposed pyramid splitting and aggregation (PSA) module deeply integrates different levels of multi-scale feature maps, thereby improving the expressive ability of the final features. Extensive experiments on the challenging KITTI-3D benchmark show that our method has better performance in both 3D and BEV object detection compared with some previous state-of-the-art methods. Experimental results with average precision (AP) prove the effectiveness of our network. |
format | Online Article Text |
id | pubmed-7796238 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-77962382021-01-10 PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud Li, Fangyu Jin, Weizheng Fan, Cien Zou, Lian Chen, Qingsheng Li, Xiaopeng Jiang, Hao Liu, Yifeng Sensors (Basel) Article 3D object detection in LiDAR point clouds has been extensively used in autonomous driving, intelligent robotics, and augmented reality. Although the one-stage 3D detector has satisfactory training and inference speed, there are still some performance problems due to insufficient utilization of bird’s eye view (BEV) information. In this paper, a new backbone network is proposed to complete the cross-layer fusion of multi-scale BEV feature maps, which makes full use of various information for detection. Specifically, our proposed backbone network can be divided into a coarse branch and a fine branch. In the coarse branch, we use the pyramidal feature hierarchy (PFH) to generate multi-scale BEV feature maps, which retain the advantages of different levels and serves as the input of the fine branch. In the fine branch, our proposed pyramid splitting and aggregation (PSA) module deeply integrates different levels of multi-scale feature maps, thereby improving the expressive ability of the final features. Extensive experiments on the challenging KITTI-3D benchmark show that our method has better performance in both 3D and BEV object detection compared with some previous state-of-the-art methods. Experimental results with average precision (AP) prove the effectiveness of our network. MDPI 2020-12-28 /pmc/articles/PMC7796238/ /pubmed/33379254 http://dx.doi.org/10.3390/s21010136 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Fangyu Jin, Weizheng Fan, Cien Zou, Lian Chen, Qingsheng Li, Xiaopeng Jiang, Hao Liu, Yifeng PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud |
title | PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud |
title_full | PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud |
title_fullStr | PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud |
title_full_unstemmed | PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud |
title_short | PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud |
title_sort | psanet: pyramid splitting and aggregation network for 3d object detection in point cloud |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7796238/ https://www.ncbi.nlm.nih.gov/pubmed/33379254 http://dx.doi.org/10.3390/s21010136 |
work_keys_str_mv | AT lifangyu psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT jinweizheng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT fancien psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT zoulian psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT chenqingsheng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT lixiaopeng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT jianghao psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud AT liuyifeng psanetpyramidsplittingandaggregationnetworkfor3dobjectdetectioninpointcloud |