Cargando…

Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning

In this work, we introduced a novel hybrid reinforcement learning scheme to balance a biped robot (NAO) on an oscillating platform, where the rotation of the platform is considered as the external disturbance to the robot. The platform had two degrees of freedom in rotation, pitch and roll. The stat...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xi, Ao, Chen, Chao
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472320/ https://www.ncbi.nlm.nih.gov/pubmed/32785092 http://dx.doi.org/10.3390/s20164468

_version_	1783578961509351424
author	Xi, Ao Chen, Chao
author_facet	Xi, Ao Chen, Chao
author_sort	Xi, Ao
collection	PubMed
description	In this work, we introduced a novel hybrid reinforcement learning scheme to balance a biped robot (NAO) on an oscillating platform, where the rotation of the platform is considered as the external disturbance to the robot. The platform had two degrees of freedom in rotation, pitch and roll. The state space comprised the position of center of pressure, and joint angles and joint velocities of two legs. The action space consisted of the joint angles of ankles, knees, and hips. By adding the inverse kinematics techniques, the dimension of action space was significantly reduced. Then, a model-based system estimator was employed during the offline training procedure to estimate the dynamics model of the system by using novel hierarchical Gaussian processes, and to provide initial control inputs, after which the reduced action space of each joint was obtained by minimizing the cost of reaching the desired stable state. Finally, a model-free optimizer based on DQN (λ) was introduced to fine tune the initial control inputs, where the optimal control inputs were obtained for each joint at any state. The proposed reinforcement learning not only successfully avoided the distribution mismatch problem, but also improved the sample efficiency. Simulation results showed that the proposed hybrid reinforcement learning mechanism enabled the NAO robot to balance on an oscillating platform with different frequencies and magnitudes. Both control performance and robustness were guaranteed during the experiments.
format	Online Article Text
id	pubmed-7472320
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-74723202020-09-04 Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning Xi, Ao Chen, Chao Sensors (Basel) Article In this work, we introduced a novel hybrid reinforcement learning scheme to balance a biped robot (NAO) on an oscillating platform, where the rotation of the platform is considered as the external disturbance to the robot. The platform had two degrees of freedom in rotation, pitch and roll. The state space comprised the position of center of pressure, and joint angles and joint velocities of two legs. The action space consisted of the joint angles of ankles, knees, and hips. By adding the inverse kinematics techniques, the dimension of action space was significantly reduced. Then, a model-based system estimator was employed during the offline training procedure to estimate the dynamics model of the system by using novel hierarchical Gaussian processes, and to provide initial control inputs, after which the reduced action space of each joint was obtained by minimizing the cost of reaching the desired stable state. Finally, a model-free optimizer based on DQN (λ) was introduced to fine tune the initial control inputs, where the optimal control inputs were obtained for each joint at any state. The proposed reinforcement learning not only successfully avoided the distribution mismatch problem, but also improved the sample efficiency. Simulation results showed that the proposed hybrid reinforcement learning mechanism enabled the NAO robot to balance on an oscillating platform with different frequencies and magnitudes. Both control performance and robustness were guaranteed during the experiments. MDPI 2020-08-10 /pmc/articles/PMC7472320/ /pubmed/32785092 http://dx.doi.org/10.3390/s20164468 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Xi, Ao Chen, Chao Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
title	Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
title_full	Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
title_fullStr	Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
title_full_unstemmed	Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
title_short	Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning
title_sort	stability control of a biped robot on a dynamic platform based on hybrid reinforcement learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7472320/ https://www.ncbi.nlm.nih.gov/pubmed/32785092 http://dx.doi.org/10.3390/s20164468
work_keys_str_mv	AT xiao stabilitycontrolofabipedrobotonadynamicplatformbasedonhybridreinforcementlearning AT chenchao stabilitycontrolofabipedrobotonadynamicplatformbasedonhybridreinforcementlearning

Stability Control of a Biped Robot on a Dynamic Platform Based on Hybrid Reinforcement Learning

Ejemplares similares