Cargando…

A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives

In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to ac...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Cai, Lowe, Robert, Ziemke, Tom
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183130/
https://www.ncbi.nlm.nih.gov/pubmed/25324773
http://dx.doi.org/10.3389/fnbot.2014.00023
_version_ 1782337667558539264
author Li, Cai
Lowe, Robert
Ziemke, Tom
author_facet Li, Cai
Lowe, Robert
Ziemke, Tom
author_sort Li, Cai
collection PubMed
description In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to achieve fundamental functions of a certain type of locomotion and dynamics adaptation provides a “reshaping” function for adapting the baseline motion to desired motion. Based on this assumption, a three-layer architecture is developed using central pattern generators (CPGs, a bio-inspired locomotor center for the baseline motion) and dynamic motor primitives (DMPs, a model with universal “reshaping” functions). In this article, we use this architecture with the actor-critic algorithms for finding a good “reshaping” function. In order to demonstrate the learning power of the actor-critic based architecture, we tested it on two experiments: (1) learning to crawl on a humanoid and, (2) learning to gallop on a puppy robot. Two types of actor-critic algorithms (policy search and policy gradient) are compared in order to evaluate the advantages and disadvantages of different actor-critic based learning algorithms for different morphologies. Finally, based on the analysis of the experimental results, a generic view/architecture for locomotion learning is discussed in the conclusion.
format Online
Article
Text
id pubmed-4183130
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-41831302014-10-16 A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives Li, Cai Lowe, Robert Ziemke, Tom Front Neurorobot Neuroscience In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to achieve fundamental functions of a certain type of locomotion and dynamics adaptation provides a “reshaping” function for adapting the baseline motion to desired motion. Based on this assumption, a three-layer architecture is developed using central pattern generators (CPGs, a bio-inspired locomotor center for the baseline motion) and dynamic motor primitives (DMPs, a model with universal “reshaping” functions). In this article, we use this architecture with the actor-critic algorithms for finding a good “reshaping” function. In order to demonstrate the learning power of the actor-critic based architecture, we tested it on two experiments: (1) learning to crawl on a humanoid and, (2) learning to gallop on a puppy robot. Two types of actor-critic algorithms (policy search and policy gradient) are compared in order to evaluate the advantages and disadvantages of different actor-critic based learning algorithms for different morphologies. Finally, based on the analysis of the experimental results, a generic view/architecture for locomotion learning is discussed in the conclusion. Frontiers Media S.A. 2014-10-02 /pmc/articles/PMC4183130/ /pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023 Text en Copyright © 2014 Li, Lowe and Ziemke. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Li, Cai
Lowe, Robert
Ziemke, Tom
A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_full A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_fullStr A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_full_unstemmed A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_short A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_sort novel approach to locomotion learning: actor-critic architecture using central pattern generators and dynamic motor primitives
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183130/
https://www.ncbi.nlm.nih.gov/pubmed/25324773
http://dx.doi.org/10.3389/fnbot.2014.00023
work_keys_str_mv AT licai anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives
AT lowerobert anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives
AT ziemketom anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives
AT licai novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives
AT lowerobert novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives
AT ziemketom novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives