Cargando…

Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds

An essential task for 3D visual world understanding is 3D object detection in lidar point clouds. To predict directly bounding box parameters from point clouds, existing voting-based methods use Hough voting to obtain the centroid of each object. However, it may be difficult for the inaccurately vot...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yu, Hang, Su, Jinhe, Piao, Yingchao, Cai, Guorong, Lin, Yangbin, Liu, Niansheng, Liu, Weiquan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815914/ https://www.ncbi.nlm.nih.gov/pubmed/36619812 http://dx.doi.org/10.1155/2022/3023934

_version_	1784864417985331200
author	Yu, Hang Su, Jinhe Piao, Yingchao Cai, Guorong Lin, Yangbin Liu, Niansheng Liu, Weiquan
author_facet	Yu, Hang Su, Jinhe Piao, Yingchao Cai, Guorong Lin, Yangbin Liu, Niansheng Liu, Weiquan
author_sort	Yu, Hang
collection	PubMed
description	An essential task for 3D visual world understanding is 3D object detection in lidar point clouds. To predict directly bounding box parameters from point clouds, existing voting-based methods use Hough voting to obtain the centroid of each object. However, it may be difficult for the inaccurately voted centers to regress boxes accurately, leading to the generation of redundant bounding boxes. For objects in indoor scenes, there are several co-occurrence patterns for objects in indoor scenes. Concurrently, semantic relations between object layouts and scenes can be used as prior context to guide object detection. We propose a simple, yet effective network, RSFF-Net, which adds refined voting and scene feature fusion for indoor 3D object detection. The RSFF-Net consists of three modules: geometric function, refined voting, and scene constraint. First, a geometric function module is used to capture the geometric features of the nearest object of the voted points. Then, the coarse votes are revoted by a refined voting module, which is based on the fused feature between the coarse votes and geometric features. Finally, a scene constraint module is used to add the association information between candidate objects and scenes. RSFF-Net achieves competitive results on indoor 3D object detection benchmarks: ScanNet V2 and SUN RGB-D.
format	Online Article Text
id	pubmed-9815914
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-98159142023-01-06 Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds Yu, Hang Su, Jinhe Piao, Yingchao Cai, Guorong Lin, Yangbin Liu, Niansheng Liu, Weiquan Comput Intell Neurosci Research Article An essential task for 3D visual world understanding is 3D object detection in lidar point clouds. To predict directly bounding box parameters from point clouds, existing voting-based methods use Hough voting to obtain the centroid of each object. However, it may be difficult for the inaccurately voted centers to regress boxes accurately, leading to the generation of redundant bounding boxes. For objects in indoor scenes, there are several co-occurrence patterns for objects in indoor scenes. Concurrently, semantic relations between object layouts and scenes can be used as prior context to guide object detection. We propose a simple, yet effective network, RSFF-Net, which adds refined voting and scene feature fusion for indoor 3D object detection. The RSFF-Net consists of three modules: geometric function, refined voting, and scene constraint. First, a geometric function module is used to capture the geometric features of the nearest object of the voted points. Then, the coarse votes are revoted by a refined voting module, which is based on the fused feature between the coarse votes and geometric features. Finally, a scene constraint module is used to add the association information between candidate objects and scenes. RSFF-Net achieves competitive results on indoor 3D object detection benchmarks: ScanNet V2 and SUN RGB-D. Hindawi 2022-12-29 /pmc/articles/PMC9815914/ /pubmed/36619812 http://dx.doi.org/10.1155/2022/3023934 Text en Copyright © 2022 Hang Yu et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Yu, Hang Su, Jinhe Piao, Yingchao Cai, Guorong Lin, Yangbin Liu, Niansheng Liu, Weiquan Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds
title	Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds
title_full	Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds
title_fullStr	Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds
title_full_unstemmed	Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds
title_short	Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds
title_sort	refined voting and scene feature fusion for 3d object detection in point clouds
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9815914/ https://www.ncbi.nlm.nih.gov/pubmed/36619812 http://dx.doi.org/10.1155/2022/3023934
work_keys_str_mv	AT yuhang refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds AT sujinhe refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds AT piaoyingchao refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds AT caiguorong refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds AT linyangbin refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds AT liuniansheng refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds AT liuweiquan refinedvotingandscenefeaturefusionfor3dobjectdetectioninpointclouds

Refined Voting and Scene Feature Fusion for 3D Object Detection in Point Clouds

Ejemplares similares