Cargando…
Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover
This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal s...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8402345/ https://www.ncbi.nlm.nih.gov/pubmed/34451117 http://dx.doi.org/10.3390/s21165676 |
_version_ | 1783745767505133568 |
---|---|
author | Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther |
author_facet | Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther |
author_sort | Zhang, Yan |
collection | PubMed |
description | This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: [Formula: see text] by RandLA-Net) compared with the other three modes ([Formula: see text] by XYZ-RGB, [Formula: see text] by XYZ-T, [Formula: see text] by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves [Formula: see text]. |
format | Online Article Text |
id | pubmed-8402345 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84023452021-08-29 Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther Sensors (Basel) Article This paper presents an application of neural networks operating on multimodal 3D data (3D point cloud, RGB, thermal) to effectively and precisely segment human hands and objects held in hand to realize a safe human–robot object handover. We discuss the problems encountered in building a multimodal sensor system, while the focus is on the calibration and alignment of a set of cameras including RGB, thermal, and NIR cameras. We propose the use of a copper–plastic chessboard calibration target with an internal active light source (near-infrared and visible light). By brief heating, the calibration target could be simultaneously and legibly captured by all cameras. Based on the multimodal dataset captured by our sensor system, PointNet, PointNet++, and RandLA-Net are utilized to verify the effectiveness of applying multimodal point cloud data for hand–object segmentation. These networks were trained on various data modes (XYZ, XYZ-T, XYZ-RGB, and XYZ-RGB-T). The experimental results show a significant improvement in the segmentation performance of XYZ-RGB-T (mean Intersection over Union: [Formula: see text] by RandLA-Net) compared with the other three modes ([Formula: see text] by XYZ-RGB, [Formula: see text] by XYZ-T, [Formula: see text] by XYZ), in which it is worth mentioning that the Intersection over Union for the single class of hand achieves [Formula: see text]. MDPI 2021-08-23 /pmc/articles/PMC8402345/ /pubmed/34451117 http://dx.doi.org/10.3390/s21165676 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Zhang, Yan Müller, Steffen Stephan, Benedict Gross, Horst-Michael Notni, Gunther Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover |
title | Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover |
title_full | Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover |
title_fullStr | Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover |
title_full_unstemmed | Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover |
title_short | Point Cloud Hand–Object Segmentation Using Multimodal Imaging with Thermal and Color Data for Safe Robotic Object Handover |
title_sort | point cloud hand–object segmentation using multimodal imaging with thermal and color data for safe robotic object handover |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8402345/ https://www.ncbi.nlm.nih.gov/pubmed/34451117 http://dx.doi.org/10.3390/s21165676 |
work_keys_str_mv | AT zhangyan pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT mullersteffen pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT stephanbenedict pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT grosshorstmichael pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover AT notnigunther pointcloudhandobjectsegmentationusingmultimodalimagingwiththermalandcolordataforsaferoboticobjecthandover |