Cargando…
Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces
Modern robotic applications create high demands on adaptation of actions with respect to variance in a given task. Reinforcement learning is able to optimize for these changing conditions, but relearning from scratch is hardly feasible due to the high number of required rollouts. We propose a parame...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805929/ https://www.ncbi.nlm.nih.gov/pubmed/33500934 http://dx.doi.org/10.3389/frobt.2018.00049 |
_version_ | 1783636414816059392 |
---|---|
author | Queißer, Jeffrey F. Steil, Jochen J. |
author_facet | Queißer, Jeffrey F. Steil, Jochen J. |
author_sort | Queißer, Jeffrey F. |
collection | PubMed |
description | Modern robotic applications create high demands on adaptation of actions with respect to variance in a given task. Reinforcement learning is able to optimize for these changing conditions, but relearning from scratch is hardly feasible due to the high number of required rollouts. We propose a parameterized skill that generalizes to new actions for changing task parameters, which is encoded as a meta-learner that provides parameters for task-specific dynamic motion primitives. Our work shows that utilizing parameterized skills for initialization of the optimization process leads to a more effective incremental task learning. In addition, we introduce a hybrid optimization method that combines a fast coarse optimization on a manifold of policy parameters with a fine grained parameter search in the unrestricted space of actions. The proposed algorithm reduces the number of required rollouts for adaptation to new task conditions. Application in illustrative toy scenarios, for a 10-DOF planar arm, and a humanoid robot point reaching task validate the approach. |
format | Online Article Text |
id | pubmed-7805929 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-78059292021-01-25 Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces Queißer, Jeffrey F. Steil, Jochen J. Front Robot AI Robotics and AI Modern robotic applications create high demands on adaptation of actions with respect to variance in a given task. Reinforcement learning is able to optimize for these changing conditions, but relearning from scratch is hardly feasible due to the high number of required rollouts. We propose a parameterized skill that generalizes to new actions for changing task parameters, which is encoded as a meta-learner that provides parameters for task-specific dynamic motion primitives. Our work shows that utilizing parameterized skills for initialization of the optimization process leads to a more effective incremental task learning. In addition, we introduce a hybrid optimization method that combines a fast coarse optimization on a manifold of policy parameters with a fine grained parameter search in the unrestricted space of actions. The proposed algorithm reduces the number of required rollouts for adaptation to new task conditions. Application in illustrative toy scenarios, for a 10-DOF planar arm, and a humanoid robot point reaching task validate the approach. Frontiers Media S.A. 2018-06-08 /pmc/articles/PMC7805929/ /pubmed/33500934 http://dx.doi.org/10.3389/frobt.2018.00049 Text en Copyright © 2018 Queißer and Steil http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Queißer, Jeffrey F. Steil, Jochen J. Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces |
title | Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces |
title_full | Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces |
title_fullStr | Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces |
title_full_unstemmed | Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces |
title_short | Bootstrapping of Parameterized Skills Through Hybrid Optimization in Task and Policy Spaces |
title_sort | bootstrapping of parameterized skills through hybrid optimization in task and policy spaces |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805929/ https://www.ncbi.nlm.nih.gov/pubmed/33500934 http://dx.doi.org/10.3389/frobt.2018.00049 |
work_keys_str_mv | AT queißerjeffreyf bootstrappingofparameterizedskillsthroughhybridoptimizationintaskandpolicyspaces AT steiljochenj bootstrappingofparameterizedskillsthroughhybridoptimizationintaskandpolicyspaces |