Cargando…

Learned graphical models for probabilistic planning provide a new class of movement primitives

Biological movement generation combines three interesting aspects: its modular organization in movement primitives (MPs), its characteristics of stochastic optimality under perturbations, and its efficiency in terms of learning. A common approach to motor skill learning is to endow the primitives wi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Rückert, Elmar A., Neumann, Gerhard, Toussaint, Marc, Maass, Wolfgang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2013
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534186/ https://www.ncbi.nlm.nih.gov/pubmed/23293598 http://dx.doi.org/10.3389/fncom.2012.00097

_version_	1782475289534660608
author	Rückert, Elmar A. Neumann, Gerhard Toussaint, Marc Maass, Wolfgang
author_facet	Rückert, Elmar A. Neumann, Gerhard Toussaint, Marc Maass, Wolfgang
author_sort	Rückert, Elmar A.
collection	PubMed
description	Biological movement generation combines three interesting aspects: its modular organization in movement primitives (MPs), its characteristics of stochastic optimality under perturbations, and its efficiency in terms of learning. A common approach to motor skill learning is to endow the primitives with dynamical systems. Here, the parameters of the primitive indirectly define the shape of a reference trajectory. We propose an alternative MP representation based on probabilistic inference in learned graphical models with new and interesting properties that complies with salient features of biological movement control. Instead of endowing the primitives with dynamical systems, we propose to endow MPs with an intrinsic probabilistic planning system, integrating the power of stochastic optimal control (SOC) methods within a MP. The parameterization of the primitive is a graphical model that represents the dynamics and intrinsic cost function such that inference in this graphical model yields the control policy. We parameterize the intrinsic cost function using task-relevant features, such as the importance of passing through certain via-points. The system dynamics as well as intrinsic cost function parameters are learned in a reinforcement learning (RL) setting. We evaluate our approach on a complex 4-link balancing task. Our experiments show that our movement representation facilitates learning significantly and leads to better generalization to new task settings without re-learning.
format	Online Article Text
id	pubmed-3534186
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-35341862013-01-04 Learned graphical models for probabilistic planning provide a new class of movement primitives Rückert, Elmar A. Neumann, Gerhard Toussaint, Marc Maass, Wolfgang Front Comput Neurosci Neuroscience Biological movement generation combines three interesting aspects: its modular organization in movement primitives (MPs), its characteristics of stochastic optimality under perturbations, and its efficiency in terms of learning. A common approach to motor skill learning is to endow the primitives with dynamical systems. Here, the parameters of the primitive indirectly define the shape of a reference trajectory. We propose an alternative MP representation based on probabilistic inference in learned graphical models with new and interesting properties that complies with salient features of biological movement control. Instead of endowing the primitives with dynamical systems, we propose to endow MPs with an intrinsic probabilistic planning system, integrating the power of stochastic optimal control (SOC) methods within a MP. The parameterization of the primitive is a graphical model that represents the dynamics and intrinsic cost function such that inference in this graphical model yields the control policy. We parameterize the intrinsic cost function using task-relevant features, such as the importance of passing through certain via-points. The system dynamics as well as intrinsic cost function parameters are learned in a reinforcement learning (RL) setting. We evaluate our approach on a complex 4-link balancing task. Our experiments show that our movement representation facilitates learning significantly and leads to better generalization to new task settings without re-learning. Frontiers Media S.A. 2013-01-02 /pmc/articles/PMC3534186/ /pubmed/23293598 http://dx.doi.org/10.3389/fncom.2012.00097 Text en Copyright © 2013 Rückert, Neumann, Toussaint and Maass. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle	Neuroscience Rückert, Elmar A. Neumann, Gerhard Toussaint, Marc Maass, Wolfgang Learned graphical models for probabilistic planning provide a new class of movement primitives
title	Learned graphical models for probabilistic planning provide a new class of movement primitives
title_full	Learned graphical models for probabilistic planning provide a new class of movement primitives
title_fullStr	Learned graphical models for probabilistic planning provide a new class of movement primitives
title_full_unstemmed	Learned graphical models for probabilistic planning provide a new class of movement primitives
title_short	Learned graphical models for probabilistic planning provide a new class of movement primitives
title_sort	learned graphical models for probabilistic planning provide a new class of movement primitives
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3534186/ https://www.ncbi.nlm.nih.gov/pubmed/23293598 http://dx.doi.org/10.3389/fncom.2012.00097
work_keys_str_mv	AT ruckertelmara learnedgraphicalmodelsforprobabilisticplanningprovideanewclassofmovementprimitives AT neumanngerhard learnedgraphicalmodelsforprobabilisticplanningprovideanewclassofmovementprimitives AT toussaintmarc learnedgraphicalmodelsforprobabilisticplanningprovideanewclassofmovementprimitives AT maasswolfgang learnedgraphicalmodelsforprobabilisticplanningprovideanewclassofmovementprimitives

Learned graphical models for probabilistic planning provide a new class of movement primitives

Ejemplares similares