Cargando…

Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover

This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal s...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Yan, Müller, Steffen, Stephan, Benedict, Gross, Horst-Michael, Notni, Gunther
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8402345/ https://www.ncbi.nlm.nih.gov/pubmed/34451117 http://dx.doi.org/10.3390/s21165676

_version_	1783745767505133568
author	Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther
author_facet	Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther
author_sort	Zhang, Yan
collection	PubMed
description	This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: [Formula: see text] by RandLA-Net) compared with the other three modes ([Formula: see text] by XYZ-RGB, [Formula: see text] by XYZ-T, [Formula: see text] by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves [Formula: see text].
format	Online Article Text
id	pubmed-8402345
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-84023452021-08-29 Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther Sensors (Basel) Article This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: [Formula: see text] by RandLA-Net) compared with the other three modes ([Formula: see text] by XYZ-RGB, [Formula: see text] by XYZ-T, [Formula: see text] by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves [Formula: see text]. MDPI 2021-08-23 /pmc/articles/PMC8402345/ /pubmed/34451117 http://dx.doi.org/10.3390/s21165676 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title	Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_full	Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_fullStr	Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_full_unstemmed	Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_short	Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_sort	point cloud hand–object segmentation using multimodal imaging with thermal and color data for safe robotic object handover
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8402345/ https://www.ncbi.nlm.nih.gov/pubmed/34451117 http://dx.doi.org/10.3390/s21165676
work_keys_str_mv	AT zhangyan pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT mullersteffen pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT stephanbenedict pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT grosshorstmichael pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT notnigunther pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover

Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover

Ejemplares similares