Cargando…

Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks

Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand ob...

Descripción completa

Detalles Bibliográficos
Autores principales: Melnik, Andrew, Lach, Luca, Plappert, Matthias, Korthals, Timo, Haschke, Robert, Ritter, Helge
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275974/
https://www.ncbi.nlm.nih.gov/pubmed/34268337
http://dx.doi.org/10.3389/frobt.2021.538773
_version_ 1783721821022978048
author Melnik, Andrew
Lach, Luca
Plappert, Matthias
Korthals, Timo
Haschke, Robert
Ritter, Helge
author_facet Melnik, Andrew
Lach, Luca
Plappert, Matthias
Korthals, Timo
Haschke, Robert
Ritter, Helge
author_sort Melnik, Andrew
collection PubMed
description Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments.
format Online
Article
Text
id pubmed-8275974
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82759742021-07-14 Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge Front Robot AI Robotics and AI Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8275974/ /pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773 Text en Copyright © 2021 Melnik, Lach, Plappert, Korthals, Haschke and Ritter. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Melnik, Andrew
Lach, Luca
Plappert, Matthias
Korthals, Timo
Haschke, Robert
Ritter, Helge
Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_full Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_fullStr Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_full_unstemmed Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_short Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
title_sort using tactile sensing to improve the sample efficiency and performance of deep deterministic policy gradients for simulated in-hand manipulation tasks
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275974/
https://www.ncbi.nlm.nih.gov/pubmed/34268337
http://dx.doi.org/10.3389/frobt.2021.538773
work_keys_str_mv AT melnikandrew usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks
AT lachluca usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks
AT plappertmatthias usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks
AT korthalstimo usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks
AT haschkerobert usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks
AT ritterhelge usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks