Cargando…

A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives

In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to ac...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Cai, Lowe, Robert, Ziemke, Tom
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2014
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183130/ https://www.ncbi.nlm.nih.gov/pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023

_version_	1782337667558539264
author	Li, Cai Lowe, Robert Ziemke, Tom
author_facet	Li, Cai Lowe, Robert Ziemke, Tom
author_sort	Li, Cai
collection	PubMed
description	In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to achieve fundamental functions of a certain type of locomotion and dynamics adaptation provides a “reshaping” function for adapting the baseline motion to desired motion. Based on this assumption, a three-layer architecture is developed using central pattern generators (CPGs, a bio-inspired locomotor center for the baseline motion) and dynamic motor primitives (DMPs, a model with universal “reshaping” functions). In this article, we use this architecture with the actor-critic algorithms for finding a good “reshaping” function. In order to demonstrate the learning power of the actor-critic based architecture, we tested it on two experiments: (1) learning to crawl on a humanoid and, (2) learning to gallop on a puppy robot. Two types of actor-critic algorithms (policy search and policy gradient) are compared in order to evaluate the advantages and disadvantages of different actor-critic based learning algorithms for different morphologies. Finally, based on the analysis of the experimental results, a generic view/architecture for locomotion learning is discussed in the conclusion.
format	Online Article Text
id	pubmed-4183130
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-41831302014-10-16 A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives Li, Cai Lowe, Robert Ziemke, Tom Front Neurorobot Neuroscience In this article, we propose an architecture of a bio-inspired controller that addresses the problem of learning different locomotion gaits for different robot morphologies. The modeling objective is split into two: baseline motion modeling and dynamics adaptation. Baseline motion modeling aims to achieve fundamental functions of a certain type of locomotion and dynamics adaptation provides a “reshaping” function for adapting the baseline motion to desired motion. Based on this assumption, a three-layer architecture is developed using central pattern generators (CPGs, a bio-inspired locomotor center for the baseline motion) and dynamic motor primitives (DMPs, a model with universal “reshaping” functions). In this article, we use this architecture with the actor-critic algorithms for finding a good “reshaping” function. In order to demonstrate the learning power of the actor-critic based architecture, we tested it on two experiments: (1) learning to crawl on a humanoid and, (2) learning to gallop on a puppy robot. Two types of actor-critic algorithms (policy search and policy gradient) are compared in order to evaluate the advantages and disadvantages of different actor-critic based learning algorithms for different morphologies. Finally, based on the analysis of the experimental results, a generic view/architecture for locomotion learning is discussed in the conclusion. Frontiers Media S.A. 2014-10-02 /pmc/articles/PMC4183130/ /pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023 Text en Copyright © 2014 Li, Lowe and Ziemke. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Li, Cai Lowe, Robert Ziemke, Tom A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title	A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_full	A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_fullStr	A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_full_unstemmed	A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_short	A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives
title_sort	novel approach to locomotion learning: actor-critic architecture using central pattern generators and dynamic motor primitives
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4183130/ https://www.ncbi.nlm.nih.gov/pubmed/25324773 http://dx.doi.org/10.3389/fnbot.2014.00023
work_keys_str_mv	AT licai anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT lowerobert anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT ziemketom anovelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT licai novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT lowerobert novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives AT ziemketom novelapproachtolocomotionlearningactorcriticarchitectureusingcentralpatterngeneratorsanddynamicmotorprimitives

A novel approach to locomotion learning: Actor-Critic architecture using central pattern generators and dynamic motor primitives

Ejemplares similares