Cargando…

Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks

Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand ob...

Descripción completa

Detalles Bibliográficos
Autores principales:	Melnik, Andrew, Lach, Luca, Plappert, Matthias, Korthals, Timo, Haschke, Robert, Ritter, Helge
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275974/ https://www.ncbi.nlm.nih.gov/pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773

_version_	1783721821022978048
author	Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge
author_facet	Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge
author_sort	Melnik, Andrew
collection	PubMed
description	Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments.
format	Online Article Text
id	pubmed-8275974
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-82759742021-07-14 Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge Front Robot AI Robotics and AI Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8275974/ /pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773 Text en Copyright © 2021 Melnik, Lach, Plappert, Korthals, Haschke and Ritter. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title	Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_full	Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_fullStr	Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_full_unstemmed	Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_short	Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_sort	using tactile sensing to improve the sample efficiency and performance of deep deterministic policy gradients for simulated in-hand manipulation tasks
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275974/ https://www.ncbi.nlm.nih.gov/pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773
work_keys_str_mv	AT melnikandrew usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT lachluca usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT plappertmatthias usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT korthalstimo usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT haschkerobert usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT ritterhelge usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks

Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks

Ejemplares similares