Cargando…

Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud

Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods wo...

Descripción completa

Detalles Bibliográficos
Autores principales: Imad, Muhammad, Doukhi, Oualid, Lee, Deok-Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230345/
https://www.ncbi.nlm.nih.gov/pubmed/34201390
http://dx.doi.org/10.3390/s21123964
_version_ 1783713186972696576
author Imad, Muhammad
Doukhi, Oualid
Lee, Deok-Jin
author_facet Imad, Muhammad
Doukhi, Oualid
Lee, Deok-Jin
author_sort Imad, Muhammad
collection PubMed
description Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to [Formula: see text]) while still running at more than 30 frames per second (FPS).
format Online
Article
Text
id pubmed-8230345
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-82303452021-06-26 Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud Imad, Muhammad Doukhi, Oualid Lee, Deok-Jin Sensors (Basel) Article Three-dimensional object detection utilizing LiDAR point cloud data is an indispensable part of autonomous driving perception systems. Point cloud-based 3D object detection has been a better replacement for higher accuracy than cameras during nighttime. However, most LiDAR-based 3D object methods work in a supervised manner, which means their state-of-the-art performance relies heavily on a large-scale and well-labeled dataset, while these annotated datasets could be expensive to obtain and only accessible in the limited scenario. Transfer learning is a promising approach to reduce the large-scale training datasets requirement, but existing transfer learning object detectors are primarily for 2D object detection rather than 3D. In this work, we utilize the 3D point cloud data more effectively by representing the birds-eye-view (BEV) scene and propose a transfer learning based point cloud semantic segmentation for 3D object detection. The proposed model minimizes the need for large-scale training datasets and consequently reduces the training time. First, a preprocessing stage filters the raw point cloud data to a BEV map within a specific field of view. Second, the transfer learning stage uses knowledge from the previously learned classification task (with more data for training) and generalizes the semantic segmentation-based 2D object detection task. Finally, 2D detection results from the BEV image have been back-projected into 3D in the postprocessing stage. We verify results on two datasets: the KITTI 3D object detection dataset and the Ouster LiDAR-64 dataset, thus demonstrating that the proposed method is highly competitive in terms of mean average precision (mAP up to [Formula: see text]) while still running at more than 30 frames per second (FPS). MDPI 2021-06-08 /pmc/articles/PMC8230345/ /pubmed/34201390 http://dx.doi.org/10.3390/s21123964 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Imad, Muhammad
Doukhi, Oualid
Lee, Deok-Jin
Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_full Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_fullStr Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_full_unstemmed Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_short Transfer Learning Based Semantic Segmentation for 3D Object Detection from Point Cloud
title_sort transfer learning based semantic segmentation for 3d object detection from point cloud
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8230345/
https://www.ncbi.nlm.nih.gov/pubmed/34201390
http://dx.doi.org/10.3390/s21123964
work_keys_str_mv AT imadmuhammad transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud
AT doukhioualid transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud
AT leedeokjin transferlearningbasedsemanticsegmentationfor3dobjectdetectionfrompointcloud