Cargando…

Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection

Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhai, Zhenyu, Wang, Qiantong, Pan, Zongxu, Gao, Zhentong, Hu, Wenlong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570913/ https://www.ncbi.nlm.nih.gov/pubmed/36236572 http://dx.doi.org/10.3390/s22197473

_version_	1784810229641248768
author	Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong
author_facet	Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong
author_sort	Zhai, Zhenyu
collection	PubMed
description	Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align static objects and not moving objects. In this paper, we proposed a non-local-based multi-scale feature fusion method, which can handle both moving and static objects without GPS- and IMU-based registrations. Considering that non-local methods are resource-consuming, we proposed a novel simplified non-local block based on the sparsity of the point cloud. By filtering out empty units, memory consumption decreased by 99.93%. In addition, triple attention is adopted to enhance the key information on the object and suppresses background noise, further benefiting non-local-based feature fusion methods. Finally, we verify the method based on PointPillars and CenterPoint. Experimental results show that the mAP of the proposed method improved by 3.9% and 4.1% in mAP compared with concatenation-based fusion modules, PointPillars-2 and CenterPoint-2, respectively. In addition, the proposed network outperforms powerful 3D-VID by 1.2% in mAP.
format	Online Article Text
id	pubmed-9570913
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-95709132022-10-17 Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong Sensors (Basel) Article Continuous frames of point-cloud-based object detection is a new research direction. Currently, most research studies fuse multi-frame point clouds using concatenation-based methods. The method aligns different frames by using information on GPS, IMU, etc. However, this fusion method can only align static objects and not moving objects. In this paper, we proposed a non-local-based multi-scale feature fusion method, which can handle both moving and static objects without GPS- and IMU-based registrations. Considering that non-local methods are resource-consuming, we proposed a novel simplified non-local block based on the sparsity of the point cloud. By filtering out empty units, memory consumption decreased by 99.93%. In addition, triple attention is adopted to enhance the key information on the object and suppresses background noise, further benefiting non-local-based feature fusion methods. Finally, we verify the method based on PointPillars and CenterPoint. Experimental results show that the mAP of the proposed method improved by 3.9% and 4.1% in mAP compared with concatenation-based fusion modules, PointPillars-2 and CenterPoint-2, respectively. In addition, the proposed network outperforms powerful 3D-VID by 1.2% in mAP. MDPI 2022-10-02 /pmc/articles/PMC9570913/ /pubmed/36236572 http://dx.doi.org/10.3390/s22197473 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhai, Zhenyu Wang, Qiantong Pan, Zongxu Gao, Zhentong Hu, Wenlong Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title	Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_full	Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_fullStr	Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_full_unstemmed	Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_short	Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection
title_sort	muti-frame point cloud feature fusion based on attention mechanisms for 3d object detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9570913/ https://www.ncbi.nlm.nih.gov/pubmed/36236572 http://dx.doi.org/10.3390/s22197473
work_keys_str_mv	AT zhaizhenyu mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT wangqiantong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT panzongxu mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT gaozhentong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection AT huwenlong mutiframepointcloudfeaturefusionbasedonattentionmechanismsfor3dobjectdetection

Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection

Ejemplares similares