Cargando…

PointPainting: 3D Object Detection Aided by Semantic Image Information

A multi-modal 3D object-detection method, based on data from cameras and LiDAR, has become a subject of research interest. PointPainting proposes a method for improving point-cloud-based 3D object detectors using semantic information from RGB images. However, this method still needs to improve on th...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Zhentong, Wang, Qiantong, Pan, Zongxu, Zhai, Zhenyu, Long, Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007089/
https://www.ncbi.nlm.nih.gov/pubmed/36905069
http://dx.doi.org/10.3390/s23052868
_version_ 1784905432331976704
author Gao, Zhentong
Wang, Qiantong
Pan, Zongxu
Zhai, Zhenyu
Long, Hui
author_facet Gao, Zhentong
Wang, Qiantong
Pan, Zongxu
Zhai, Zhenyu
Long, Hui
author_sort Gao, Zhentong
collection PubMed
description A multi-modal 3D object-detection method, based on data from cameras and LiDAR, has become a subject of research interest. PointPainting proposes a method for improving point-cloud-based 3D object detectors using semantic information from RGB images. However, this method still needs to improve on the following two complications: first, there are faulty parts in the image semantic segmentation results, leading to false detections. Second, the commonly used anchor assigner only considers the intersection over union (IoU) between the anchors and ground truth boxes, meaning that some anchors contain few target LiDAR points assigned as positive anchors. In this paper, three improvements are suggested to address these complications. Specifically, a novel weighting strategy is proposed for each anchor in the classification loss. This enables the detector to pay more attention to anchors containing inaccurate semantic information. Then, SegIoU, which incorporates semantic information, instead of IoU, is proposed for the anchor assignment. SegIoU measures the similarity of the semantic information between each anchor and ground truth box, avoiding the defective anchor assignments mentioned above. In addition, a dual-attention module is introduced to enhance the voxelized point cloud. The experiments demonstrate that the proposed modules obtained significant improvements in various methods, consisting of single-stage PointPillars, two-stage SECOND-IoU, anchor-base SECOND, and an anchor-free CenterPoint on the KITTI dataset.
format Online
Article
Text
id pubmed-10007089
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100070892023-03-12 PointPainting: 3D Object Detection Aided by Semantic Image Information Gao, Zhentong Wang, Qiantong Pan, Zongxu Zhai, Zhenyu Long, Hui Sensors (Basel) Article A multi-modal 3D object-detection method, based on data from cameras and LiDAR, has become a subject of research interest. PointPainting proposes a method for improving point-cloud-based 3D object detectors using semantic information from RGB images. However, this method still needs to improve on the following two complications: first, there are faulty parts in the image semantic segmentation results, leading to false detections. Second, the commonly used anchor assigner only considers the intersection over union (IoU) between the anchors and ground truth boxes, meaning that some anchors contain few target LiDAR points assigned as positive anchors. In this paper, three improvements are suggested to address these complications. Specifically, a novel weighting strategy is proposed for each anchor in the classification loss. This enables the detector to pay more attention to anchors containing inaccurate semantic information. Then, SegIoU, which incorporates semantic information, instead of IoU, is proposed for the anchor assignment. SegIoU measures the similarity of the semantic information between each anchor and ground truth box, avoiding the defective anchor assignments mentioned above. In addition, a dual-attention module is introduced to enhance the voxelized point cloud. The experiments demonstrate that the proposed modules obtained significant improvements in various methods, consisting of single-stage PointPillars, two-stage SECOND-IoU, anchor-base SECOND, and an anchor-free CenterPoint on the KITTI dataset. MDPI 2023-03-06 /pmc/articles/PMC10007089/ /pubmed/36905069 http://dx.doi.org/10.3390/s23052868 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Gao, Zhentong
Wang, Qiantong
Pan, Zongxu
Zhai, Zhenyu
Long, Hui
PointPainting: 3D Object Detection Aided by Semantic Image Information
title PointPainting: 3D Object Detection Aided by Semantic Image Information
title_full PointPainting: 3D Object Detection Aided by Semantic Image Information
title_fullStr PointPainting: 3D Object Detection Aided by Semantic Image Information
title_full_unstemmed PointPainting: 3D Object Detection Aided by Semantic Image Information
title_short PointPainting: 3D Object Detection Aided by Semantic Image Information
title_sort pointpainting: 3d object detection aided by semantic image information
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007089/
https://www.ncbi.nlm.nih.gov/pubmed/36905069
http://dx.doi.org/10.3390/s23052868
work_keys_str_mv AT gaozhentong pointpainting3dobjectdetectionaidedbysemanticimageinformation
AT wangqiantong pointpainting3dobjectdetectionaidedbysemanticimageinformation
AT panzongxu pointpainting3dobjectdetectionaidedbysemanticimageinformation
AT zhaizhenyu pointpainting3dobjectdetectionaidedbysemanticimageinformation
AT longhui pointpainting3dobjectdetectionaidedbysemanticimageinformation