Cargando…

Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection

Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhai, Zhenyu, Wang, Qiantong, Pan, Zongxu, Gao, Zhentong, Hu, Wenlong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570913/
https://www.ncbi.nlm.nih.gov/pubmed/36236572
http://dx.doi.org/10.3390/s22197473
_version_ 1784810229641248768
author Zhai, Zhenyu
Wang, Qiantong
Pan, Zongxu
Gao, Zhentong
Hu, Wenlong
author_facet Zhai, Zhenyu
Wang, Qiantong
Pan, Zongxu
Gao, Zhentong
Hu, Wenlong
author_sort Zhai, Zhenyu
collection PubMed
description Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align static objects and not moving objects. In this paper, we proposed a non-local-based multi-scale feature fusion method, which can handle both moving and static objects without GPS- and IMU-based registrations. Considering that non-local methods are resource-consuming, we proposed a novel simplified non-local block based on the sparsity of the point cloud. By filtering out empty units, memory consumption decreased by 99.93%. In addition, triple attention is adopted to enhance the key information on the object and suppresses background noise, further benefiting non-local-based feature fusion methods. Finally, we verify the method based on PointPillars and CenterPoint. Experimental results show that the mAP of the proposed method improved by 3.9% and 4.1% in mAP compared with concatenation-based fusion modules, PointPillars-2 and CenterPoint-2, respectively. In addition, the proposed network outperforms powerful 3D-VID by 1.2% in mAP.
format Online
Article
Text
id pubmed-9570913
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95709132022-10-17 Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong Sensors (Basel) Article Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align static objects and not moving objects. In this paper, we proposed a non-local-based multi-scale feature fusion method, which can handle both moving and static objects without GPS- and IMU-based registrations. Considering that non-local methods are resource-consuming, we proposed a novel simplified non-local block based on the sparsity of the point cloud. By filtering out empty units, memory consumption decreased by 99.93%. In addition, triple attention is adopted to enhance the key information on the object and suppresses background noise, further benefiting non-local-based feature fusion methods. Finally, we verify the method based on PointPillars and CenterPoint. Experimental results show that the mAP of the proposed method improved by 3.9% and 4.1% in mAP compared with concatenation-based fusion modules, PointPillars-2 and CenterPoint-2, respectively. In addition, the proposed network outperforms powerful 3D-VID by 1.2% in mAP. MDPI 2022-10-02 /pmc/articles/PMC9570913/ /pubmed/36236572 http://dx.doi.org/10.3390/s22197473 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhai, Zhenyu
Wang, Qiantong
Pan, Zongxu
Gao, Zhentong
Hu, Wenlong
Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_full Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_fullStr Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_full_unstemmed Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_short Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_sort muti-frame point cloud feature fusion based on attention mechanisms for 3d object detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570913/
https://www.ncbi.nlm.nih.gov/pubmed/36236572
http://dx.doi.org/10.3390/s22197473
work_keys_str_mv AT zhaizhenyu mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection
AT wangqiantong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection
AT panzongxu mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection
AT gaozhentong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection
AT huwenlong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection