Cargando…
A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to ac...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183130/ https://www.ncbi.nlm.nih.gov/pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023 |
_version_ | 1782337667558539264 |
---|---|
author | Li, Cai Lowe, Robert Ziemke, Tom |
author_facet | Li, Cai Lowe, Robert Ziemke, Tom |
author_sort | Li, Cai |
collection | PubMed |
description | In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to achieve fundamental functions of a certain type of locomotion and dynamics adaptation provides a “reshaping” function for adapting the baseline motion to desired motion. Based on this assumption, a three-layer architecture is developed using central pattern generators (CPGs, a bio-inspired locomotor center for the baseline motion) and dynamic motor primitives (DMPs, a model with universal “reshaping” functions). In this article, we use this architecture with the actor-critic algorithms for finding a good “reshaping” function. In order to demonstrate the learning power of the actor-critic based architecture, we tested it on two experiments: (1) learning to crawl on a humanoid and, (2) learning to gallop on a puppy robot. Two types of actor-critic algorithms (policy search and policy gradient) are compared in order to evaluate the advantages and disadvantages of different actor-critic based learning algorithms for different morphologies. Finally, based on the analysis of the experimental results, a generic view/architecture for locomotion learning is discussed in the conclusion. |
format | Online Article Text |
id | pubmed-4183130 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-41831302014-10-16 A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives Li, Cai Lowe, Robert Ziemke, Tom Front Neurorobot Neuroscience In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to achieve fundamental functions of a certain type of locomotion and dynamics adaptation provides a “reshaping” function for adapting the baseline motion to desired motion. Based on this assumption, a three-layer architecture is developed using central pattern generators (CPGs, a bio-inspired locomotor center for the baseline motion) and dynamic motor primitives (DMPs, a model with universal “reshaping” functions). In this article, we use this architecture with the actor-critic algorithms for finding a good “reshaping” function. In order to demonstrate the learning power of the actor-critic based architecture, we tested it on two experiments: (1) learning to crawl on a humanoid and, (2) learning to gallop on a puppy robot. Two types of actor-critic algorithms (policy search and policy gradient) are compared in order to evaluate the advantages and disadvantages of different actor-critic based learning algorithms for different morphologies. Finally, based on the analysis of the experimental results, a generic view/architecture for locomotion learning is discussed in the conclusion. Frontiers Media S.A. 2014-10-02 /pmc/articles/PMC4183130/ /pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023 Text en Copyright © 2014 Li, Lowe and Ziemke. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Li, Cai Lowe, Robert Ziemke, Tom A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives |
title | A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives |
title_full | A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives |
title_fullStr | A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives |
title_full_unstemmed | A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives |
title_short | A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives |
title_sort | novel approach to locomotion learning: actor-critic architecture using central pattern generators and dynamic motor primitives |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183130/ https://www.ncbi.nlm.nih.gov/pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023 |
work_keys_str_mv | AT licai anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT lowerobert anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT ziemketom anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT licai novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT lowerobert novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT ziemketom novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives |