Cargando…
Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks
Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand ob...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275974/ https://www.ncbi.nlm.nih.gov/pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773 |
_version_ | 1783721821022978048 |
---|---|
author | Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge |
author_facet | Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge |
author_sort | Melnik, Andrew |
collection | PubMed |
description | Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments. |
format | Online Article Text |
id | pubmed-8275974 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-82759742021-07-14 Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge Front Robot AI Robotics and AI Deep Reinforcement Learning techniques demonstrate advances in the domain of robotics. One of the limiting factors is a large number of interaction samples usually required for training in simulated and real-world environments. In this work, we demonstrate for a set of simulated dexterous in-hand object manipulation tasks that tactile information can substantially increase sample efficiency for training (by up to more than threefold). We also observe an improvement in performance (up to 46%) after adding tactile information. To examine the role of tactile-sensor parameters in these improvements, we included experiments with varied sensor-measurement accuracy (ground truth continuous values, noisy continuous values, Boolean values), and varied spatial resolution of the tactile sensors (927 sensors, 92 sensors, and 16 pooled sensor areas in the hand). To facilitate further studies and comparisons, we make these touch-sensor extensions available as a part of the OpenAI Gym Shadow-Dexterous-Hand robotics environments. Frontiers Media S.A. 2021-06-29 /pmc/articles/PMC8275974/ /pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773 Text en Copyright © 2021 Melnik, Lach, Plappert, Korthals, Haschke and Ritter. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Melnik, Andrew Lach, Luca Plappert, Matthias Korthals, Timo Haschke, Robert Ritter, Helge Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks |
title | Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks |
title_full | Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks |
title_fullStr | Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks |
title_full_unstemmed | Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks |
title_short | Using Tactile Sensing to Improve the Sample Efficiency and Performance of Deep Deterministic Policy Gradients for Simulated In-Hand Manipulation Tasks |
title_sort | using tactile sensing to improve the sample efficiency and performance of deep deterministic policy gradients for simulated in-hand manipulation tasks |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8275974/ https://www.ncbi.nlm.nih.gov/pubmed/34268337 http://dx.doi.org/10.3389/frobt.2021.538773 |
work_keys_str_mv | AT melnikandrew usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT lachluca usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT plappertmatthias usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT korthalstimo usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT haschkerobert usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks AT ritterhelge usingtactilesensingtoimprovethesampleefficiencyandperformanceofdeepdeterministicpolicygradientsforsimulatedinhandmanipulationtasks |