Cargando…

FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning

In the computer vision field, many 3D deep learning models that directly manage 3D point clouds (proposed after PointNet) have been published. Moreover, deep learning-based-techniques have demonstrated state-of-the-art performance for supervised learning tasks on 3D point cloud data, such as classif...

Descripción completa

Detalles Bibliográficos
Autores principales: Shinohara, Takayuki, Xiu, Haoyi, Matsuoka, Masashi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349408/
https://www.ncbi.nlm.nih.gov/pubmed/32599774
http://dx.doi.org/10.3390/s20123568
_version_ 1783557057190821888
author Shinohara, Takayuki
Xiu, Haoyi
Matsuoka, Masashi
author_facet Shinohara, Takayuki
Xiu, Haoyi
Matsuoka, Masashi
author_sort Shinohara, Takayuki
collection PubMed
description In the computer vision field, many 3D deep learning models that directly manage 3D point clouds (proposed after PointNet) have been published. Moreover, deep learning-based-techniques have demonstrated state-of-the-art performance for supervised learning tasks on 3D point cloud data, such as classification and segmentation tasks for open datasets in competitions. Furthermore, many researchers have attempted to apply these deep learning-based techniques to 3D point clouds observed by aerial laser scanners (ALSs). However, most of these studies were developed for 3D point clouds without radiometric information. In this paper, we investigate the possibility of using a deep learning method to solve the semantic segmentation task of airborne full-waveform light detection and ranging (lidar) data that consists of geometric information and radiometric waveform data. Thus, we propose a data-driven semantic segmentation model called the full-waveform network (FWNet), which handles the waveform of full-waveform lidar data without any conversion process, such as projection onto a 2D grid or calculating handcrafted features. Our FWNet is based on a PointNet-based architecture, which can extract the local and global features of each input waveform data, along with its corresponding geographical coordinates. Subsequently, the classifier consists of 1D convolutional operational layers, which predict the class vector corresponding to the input waveform from the extracted local and global features. Our trained FWNet achieved higher scores in its recall, precision, and F1 score for unseen test data—higher scores than those of previously proposed methods in full-waveform lidar data analysis domain. Specifically, our FWNet achieved a mean recall of 0.73, a mean precision of 0.81, and a mean F1 score of 0.76. We further performed an ablation study, that is assessing the effectiveness of our proposed method, of the above-mentioned metric. Moreover, we investigated the effectiveness of our PointNet based local and global feature extraction method using the visualization of the feature vector. In this way, we have shown that our network for local and global feature extraction allows training with semantic segmentation without requiring expert knowledge on full-waveform lidar data or translation into 2D images or voxels.
format Online
Article
Text
id pubmed-7349408
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-73494082020-07-14 FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning Shinohara, Takayuki Xiu, Haoyi Matsuoka, Masashi Sensors (Basel) Article In the computer vision field, many 3D deep learning models that directly manage 3D point clouds (proposed after PointNet) have been published. Moreover, deep learning-based-techniques have demonstrated state-of-the-art performance for supervised learning tasks on 3D point cloud data, such as classification and segmentation tasks for open datasets in competitions. Furthermore, many researchers have attempted to apply these deep learning-based techniques to 3D point clouds observed by aerial laser scanners (ALSs). However, most of these studies were developed for 3D point clouds without radiometric information. In this paper, we investigate the possibility of using a deep learning method to solve the semantic segmentation task of airborne full-waveform light detection and ranging (lidar) data that consists of geometric information and radiometric waveform data. Thus, we propose a data-driven semantic segmentation model called the full-waveform network (FWNet), which handles the waveform of full-waveform lidar data without any conversion process, such as projection onto a 2D grid or calculating handcrafted features. Our FWNet is based on a PointNet-based architecture, which can extract the local and global features of each input waveform data, along with its corresponding geographical coordinates. Subsequently, the classifier consists of 1D convolutional operational layers, which predict the class vector corresponding to the input waveform from the extracted local and global features. Our trained FWNet achieved higher scores in its recall, precision, and F1 score for unseen test data—higher scores than those of previously proposed methods in full-waveform lidar data analysis domain. Specifically, our FWNet achieved a mean recall of 0.73, a mean precision of 0.81, and a mean F1 score of 0.76. We further performed an ablation study, that is assessing the effectiveness of our proposed method, of the above-mentioned metric. Moreover, we investigated the effectiveness of our PointNet based local and global feature extraction method using the visualization of the feature vector. In this way, we have shown that our network for local and global feature extraction allows training with semantic segmentation without requiring expert knowledge on full-waveform lidar data or translation into 2D images or voxels. MDPI 2020-06-24 /pmc/articles/PMC7349408/ /pubmed/32599774 http://dx.doi.org/10.3390/s20123568 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shinohara, Takayuki
Xiu, Haoyi
Matsuoka, Masashi
FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
title FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
title_full FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
title_fullStr FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
title_full_unstemmed FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
title_short FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning
title_sort fwnet: semantic segmentation for full-waveform lidar data using deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349408/
https://www.ncbi.nlm.nih.gov/pubmed/32599774
http://dx.doi.org/10.3390/s20123568
work_keys_str_mv AT shinoharatakayuki fwnetsemanticsegmentationforfullwaveformlidardatausingdeeplearning
AT xiuhaoyi fwnetsemanticsegmentationforfullwaveformlidardatausingdeeplearning
AT matsuokamasashi fwnetsemanticsegmentationforfullwaveformlidardatausingdeeplearning