Cargando…
Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570913/ https://www.ncbi.nlm.nih.gov/pubmed/36236572 http://dx.doi.org/10.3390/s22197473 |
_version_ | 1784810229641248768 |
---|---|
author | Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong |
author_facet | Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong |
author_sort | Zhai, Zhenyu |
collection | PubMed |
description | Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align static objects and not moving objects. In this paper, we proposed a non-local-based multi-scale feature fusion method, which can handle both moving and static objects without GPS- and IMU-based registrations. Considering that non-local methods are resource-consuming, we proposed a novel simplified non-local block based on the sparsity of the point cloud. By filtering out empty units, memory consumption decreased by 99.93%. In addition, triple attention is adopted to enhance the key information on the object and suppresses background noise, further benefiting non-local-based feature fusion methods. Finally, we verify the method based on PointPillars and CenterPoint. Experimental results show that the mAP of the proposed method improved by 3.9% and 4.1% in mAP compared with concatenation-based fusion modules, PointPillars-2 and CenterPoint-2, respectively. In addition, the proposed network outperforms powerful 3D-VID by 1.2% in mAP. |
format | Online Article Text |
id | pubmed-9570913 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95709132022-10-17 Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong Sensors (Basel) Article Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align static objects and not moving objects. In this paper, we proposed a non-local-based multi-scale feature fusion method, which can handle both moving and static objects without GPS- and IMU-based registrations. Considering that non-local methods are resource-consuming, we proposed a novel simplified non-local block based on the sparsity of the point cloud. By filtering out empty units, memory consumption decreased by 99.93%. In addition, triple attention is adopted to enhance the key information on the object and suppresses background noise, further benefiting non-local-based feature fusion methods. Finally, we verify the method based on PointPillars and CenterPoint. Experimental results show that the mAP of the proposed method improved by 3.9% and 4.1% in mAP compared with concatenation-based fusion modules, PointPillars-2 and CenterPoint-2, respectively. In addition, the proposed network outperforms powerful 3D-VID by 1.2% in mAP. MDPI 2022-10-02 /pmc/articles/PMC9570913/ /pubmed/36236572 http://dx.doi.org/10.3390/s22197473 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection |
title | Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection |
title_full | Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection |
title_fullStr | Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection |
title_full_unstemmed | Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection |
title_short | Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection |
title_sort | muti-frame point cloud feature fusion based on attention mechanisms for 3d object detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570913/ https://www.ncbi.nlm.nih.gov/pubmed/36236572 http://dx.doi.org/10.3390/s22197473 |
work_keys_str_mv | AT zhaizhenyu mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT wangqiantong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT panzongxu mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT gaozhentong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT huwenlong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection |