Cargando…

A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection

A two-phase cross-modality fusion detector is proposed in this study for robust and high-precision 3D object detection with RGB images and LiDAR point clouds. First, a two-stream fusion network is built into the framework of Faster RCNN to perform accurate and robust 2D detection. The visible stream...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jiao, Yujun, Yin, Zhishuai
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660652/ https://www.ncbi.nlm.nih.gov/pubmed/33114234 http://dx.doi.org/10.3390/s20216043

_version_	1783609049774817280
author	Jiao, Yujun Yin, Zhishuai
author_facet	Jiao, Yujun Yin, Zhishuai
author_sort	Jiao, Yujun
collection	PubMed
description	A two-phase cross-modality fusion detector is proposed in this study for robust and high-precision 3D object detection with RGB images and LiDAR point clouds. First, a two-stream fusion network is built into the framework of Faster RCNN to perform accurate and robust 2D detection. The visible stream takes the RGB images as inputs, while the intensity stream is fed with the intensity maps which are generated by projecting the reflection intensity of point clouds to the front view. A multi-layer feature-level fusion scheme is designed to merge multi-modal features across multiple layers in order to enhance the expressiveness and robustness of the produced features upon which region proposals are generated. Second, a decision-level fusion is implemented by projecting 2D proposals to the space of the point cloud to generate 3D frustums, on the basis of which the second-phase 3D detector is built to accomplish instance segmentation and 3D-box regression on the filtered point cloud. The results on the KITTI benchmark show that features extracted from RGB images and intensity maps complement each other, and our proposed detector achieves state-of-the-art performance on 3D object detection with a substantially lower running time as compared to available competitors.
format	Online Article Text
id	pubmed-7660652
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-76606522020-11-13 A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection Jiao, Yujun Yin, Zhishuai Sensors (Basel) Article A two-phase cross-modality fusion detector is proposed in this study for robust and high-precision 3D object detection with RGB images and LiDAR point clouds. First, a two-stream fusion network is built into the framework of Faster RCNN to perform accurate and robust 2D detection. The visible stream takes the RGB images as inputs, while the intensity stream is fed with the intensity maps which are generated by projecting the reflection intensity of point clouds to the front view. A multi-layer feature-level fusion scheme is designed to merge multi-modal features across multiple layers in order to enhance the expressiveness and robustness of the produced features upon which region proposals are generated. Second, a decision-level fusion is implemented by projecting 2D proposals to the space of the point cloud to generate 3D frustums, on the basis of which the second-phase 3D detector is built to accomplish instance segmentation and 3D-box regression on the filtered point cloud. The results on the KITTI benchmark show that features extracted from RGB images and intensity maps complement each other, and our proposed detector achieves state-of-the-art performance on 3D object detection with a substantially lower running time as compared to available competitors. MDPI 2020-10-23 /pmc/articles/PMC7660652/ /pubmed/33114234 http://dx.doi.org/10.3390/s20216043 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Jiao, Yujun Yin, Zhishuai A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection
title	A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection
title_full	A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection
title_fullStr	A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection
title_full_unstemmed	A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection
title_short	A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection
title_sort	two-phase cross-modality fusion network for robust 3d object detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660652/ https://www.ncbi.nlm.nih.gov/pubmed/33114234 http://dx.doi.org/10.3390/s20216043
work_keys_str_mv	AT jiaoyujun atwophasecrossmodalityfusionnetworkforrobust3dobjectdetection AT yinzhishuai atwophasecrossmodalityfusionnetworkforrobust3dobjectdetection AT jiaoyujun twophasecrossmodalityfusionnetworkforrobust3dobjectdetection AT yinzhishuai twophasecrossmodalityfusionnetworkforrobust3dobjectdetection

A Two-Phase Cross-Modality Fusion Network for Robust 3D Object Detection

Ejemplares similares