Cargando…

Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning

External disturbance poses the primary threat to robot balance in dynamic environments. This paper provides a learning-based control architecture for quadrupedal self-balancing, which is adaptable to multiple unpredictable scenes of external continuous disturbance. Different from conventional method...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sun, Haoran, Fu, Tingting, Ling, Yuanhuai, He, Chaoming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434611/ https://www.ncbi.nlm.nih.gov/pubmed/34502796 http://dx.doi.org/10.3390/s21175907

_version_	1783751641017614336
author	Sun, Haoran Fu, Tingting Ling, Yuanhuai He, Chaoming
author_facet	Sun, Haoran Fu, Tingting Ling, Yuanhuai He, Chaoming
author_sort	Sun, Haoran
collection	PubMed
description	External disturbance poses the primary threat to robot balance in dynamic environments. This paper provides a learning-based control architecture for quadrupedal self-balancing, which is adaptable to multiple unpredictable scenes of external continuous disturbance. Different from conventional methods which construct analytical models which explicitly reason the balancing process, our work utilized reinforcement learning and artificial neural network to avoid incomprehensible mathematical modeling. The control policy is composed of a neural network and a Tanh Gaussian policy, which implicitly establishes the fuzzy mapping from proprioceptive signals to action commands. During the training process, the maximum-entropy method (soft actor-critic algorithm) is employed to endow the policy with powerful exploration and generalization ability. The trained policy is validated in both simulations and realistic experiments with a customized quadruped robot. The results demonstrate that the policy can be easily transferred to the real world without elaborate configurations. Moreover, although this policy is trained in merely one specific vibration condition, it demonstrates robustness under conditions that were never encountered during training.
format	Online Article Text
id	pubmed-8434611
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-84346112021-09-12 Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning Sun, Haoran Fu, Tingting Ling, Yuanhuai He, Chaoming Sensors (Basel) Article External disturbance poses the primary threat to robot balance in dynamic environments. This paper provides a learning-based control architecture for quadrupedal self-balancing, which is adaptable to multiple unpredictable scenes of external continuous disturbance. Different from conventional methods which construct analytical models which explicitly reason the balancing process, our work utilized reinforcement learning and artificial neural network to avoid incomprehensible mathematical modeling. The control policy is composed of a neural network and a Tanh Gaussian policy, which implicitly establishes the fuzzy mapping from proprioceptive signals to action commands. During the training process, the maximum-entropy method (soft actor-critic algorithm) is employed to endow the policy with powerful exploration and generalization ability. The trained policy is validated in both simulations and realistic experiments with a customized quadruped robot. The results demonstrate that the policy can be easily transferred to the real world without elaborate configurations. Moreover, although this policy is trained in merely one specific vibration condition, it demonstrates robustness under conditions that were never encountered during training. MDPI 2021-09-02 /pmc/articles/PMC8434611/ /pubmed/34502796 http://dx.doi.org/10.3390/s21175907 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Sun, Haoran Fu, Tingting Ling, Yuanhuai He, Chaoming Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning
title	Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning
title_full	Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning
title_fullStr	Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning
title_full_unstemmed	Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning
title_short	Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning
title_sort	adaptive quadruped balance control for dynamic environments using maximum-entropy reinforcement learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434611/ https://www.ncbi.nlm.nih.gov/pubmed/34502796 http://dx.doi.org/10.3390/s21175907
work_keys_str_mv	AT sunhaoran adaptivequadrupedbalancecontrolfordynamicenvironmentsusingmaximumentropyreinforcementlearning AT futingting adaptivequadrupedbalancecontrolfordynamicenvironmentsusingmaximumentropyreinforcementlearning AT lingyuanhuai adaptivequadrupedbalancecontrolfordynamicenvironmentsusingmaximumentropyreinforcementlearning AT hechaoming adaptivequadrupedbalancecontrolfordynamicenvironmentsusingmaximumentropyreinforcementlearning

Adaptive Quadruped Balance Control for Dynamic Environments Using Maximum-Entropy Reinforcement Learning

Ejemplares similares