Cargando…

Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover

This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal s...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yan, Müller, Steffen, Stephan, Benedict, Gross, Horst-Michael, Notni, Gunther
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8402345/
https://www.ncbi.nlm.nih.gov/pubmed/34451117
http://dx.doi.org/10.3390/s21165676
_version_ 1783745767505133568
author Zhang, Yan
Müller, Steffen
Stephan, Benedict
Gross, Horst-Michael
Notni, Gunther
author_facet Zhang, Yan
Müller, Steffen
Stephan, Benedict
Gross, Horst-Michael
Notni, Gunther
author_sort Zhang, Yan
collection PubMed
description This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: [Formula: see text] by RandLA-Net) compared with the other three modes ([Formula: see text] by XYZ-RGB, [Formula: see text] by XYZ-T, [Formula: see text] by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves [Formula: see text].
format Online
Article
Text
id pubmed-8402345
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84023452021-08-29 Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther Sensors (Basel) Article This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: [Formula: see text] by RandLA-Net) compared with the other three modes ([Formula: see text] by XYZ-RGB, [Formula: see text] by XYZ-T, [Formula: see text] by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves [Formula: see text]. MDPI 2021-08-23 /pmc/articles/PMC8402345/ /pubmed/34451117 http://dx.doi.org/10.3390/s21165676 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zhang, Yan
Müller, Steffen
Stephan, Benedict
Gross, Horst-Michael
Notni, Gunther
Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_full Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_fullStr Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_full_unstemmed Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_short Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
title_sort point cloud hand–object segmentation using multimodal imaging with thermal and color data for safe robotic object handover
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8402345/
https://www.ncbi.nlm.nih.gov/pubmed/34451117
http://dx.doi.org/10.3390/s21165676
work_keys_str_mv AT zhangyan pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover
AT mullersteffen pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover
AT stephanbenedict pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover
AT grosshorstmichael pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover
AT notnigunther pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover