Cargando…

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods wo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Imad, Muhammad, Doukhi, Oualid, Lee, Deok-Jin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230345/ https://www.ncbi.nlm.nih.gov/pubmed/34201390 http://dx.doi.org/10.3390/s21123964

_version_	1783713186972696576
author	Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin
author_facet	Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin
author_sort	Imad, Muhammad
collection	PubMed
description	Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to [Formula: see text]) while still running at more than 30 frames per second (FPS).
format	Online Article Text
id	pubmed-8230345
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-82303452021-06-26 Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin Sensors (Basel) Article Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to [Formula: see text]) while still running at more than 30 frames per second (FPS). MDPI 2021-06-08 /pmc/articles/PMC8230345/ /pubmed/34201390 http://dx.doi.org/10.3390/s21123964 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title	Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_full	Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_fullStr	Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_full_unstemmed	Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_short	Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_sort	transfer learning based semantic segmentation for 3d object detection from point cloud
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230345/ https://www.ncbi.nlm.nih.gov/pubmed/34201390 http://dx.doi.org/10.3390/s21123964
work_keys_str_mv	AT imadmuhammad transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud AT doukhioualid transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud AT leedeokjin transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Ejemplares similares