Cargando…

A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control

Humanoid robots are equipped with humanoid arms to make them more acceptable to the general public. Humanoid robots are a great challenge in robotics. The concept of digital twin technology complies with the guiding ideology of not only Industry 4.0, but also Made in China 2025. This paper proposes...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Chuzhao, Gao, Junyao, Bi, Yuanzhen, Shi, Xuanyang, Tian, Dingkui
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349783/ https://www.ncbi.nlm.nih.gov/pubmed/32575907 http://dx.doi.org/10.3390/s20123515

_version_	1783557134816903168
author	Liu, Chuzhao Gao, Junyao Bi, Yuanzhen Shi, Xuanyang Tian, Dingkui
author_facet	Liu, Chuzhao Gao, Junyao Bi, Yuanzhen Shi, Xuanyang Tian, Dingkui
author_sort	Liu, Chuzhao
collection	PubMed
description	Humanoid robots are equipped with humanoid arms to make them more acceptable to the general public. Humanoid robots are a great challenge in robotics. The concept of digital twin technology complies with the guiding ideology of not only Industry 4.0, but also Made in China 2025. This paper proposes a scheme that combines deep reinforcement learning (DRL) with digital twin technology for controlling humanoid robot arms. For rapid and stable motion planning for humanoid robots, multitasking-oriented training using the twin synchro-control (TSC) scheme with DRL is proposed. For switching between tasks, the robot arm training must be quick and diverse. In this work, an approach for obtaining a priori knowledge as input to DRL is developed and verified using simulations. Two simple examples are developed in a simulation environment. We developed a data acquisition system to generate angle data efficiently and automatically. These data are used to improve the reward function of the deep deterministic policy gradient (DDPG) and quickly train the robot for a task. The approach is applied to a model of the humanoid robot BHR-6, a humanoid robot with multiple-motion mode and a sophisticated mechanical structure. Using the policies trained in the simulations, the humanoid robot can perform tasks that are not possible to train with existing methods. The training is fast and allows the robot to perform multiple tasks. Our approach utilizes human joint angle data collected by the data acquisition system to solve the problem of a sparse reward in DRL for two simple tasks. A comparison with simulation results for controllers trained using the vanilla DDPG show that the designed controller developed using the DDPG with the TSC scheme have great advantages in terms of learning stability and convergence speed.
format	Online Article Text
id	pubmed-7349783
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-73497832020-07-15 A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control Liu, Chuzhao Gao, Junyao Bi, Yuanzhen Shi, Xuanyang Tian, Dingkui Sensors (Basel) Article Humanoid robots are equipped with humanoid arms to make them more acceptable to the general public. Humanoid robots are a great challenge in robotics. The concept of digital twin technology complies with the guiding ideology of not only Industry 4.0, but also Made in China 2025. This paper proposes a scheme that combines deep reinforcement learning (DRL) with digital twin technology for controlling humanoid robot arms. For rapid and stable motion planning for humanoid robots, multitasking-oriented training using the twin synchro-control (TSC) scheme with DRL is proposed. For switching between tasks, the robot arm training must be quick and diverse. In this work, an approach for obtaining a priori knowledge as input to DRL is developed and verified using simulations. Two simple examples are developed in a simulation environment. We developed a data acquisition system to generate angle data efficiently and automatically. These data are used to improve the reward function of the deep deterministic policy gradient (DDPG) and quickly train the robot for a task. The approach is applied to a model of the humanoid robot BHR-6, a humanoid robot with multiple-motion mode and a sophisticated mechanical structure. Using the policies trained in the simulations, the humanoid robot can perform tasks that are not possible to train with existing methods. The training is fast and allows the robot to perform multiple tasks. Our approach utilizes human joint angle data collected by the data acquisition system to solve the problem of a sparse reward in DRL for two simple tasks. A comparison with simulation results for controllers trained using the vanilla DDPG show that the designed controller developed using the DDPG with the TSC scheme have great advantages in terms of learning stability and convergence speed. MDPI 2020-06-21 /pmc/articles/PMC7349783/ /pubmed/32575907 http://dx.doi.org/10.3390/s20123515 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Liu, Chuzhao Gao, Junyao Bi, Yuanzhen Shi, Xuanyang Tian, Dingkui A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control
title	A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control
title_full	A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control
title_fullStr	A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control
title_full_unstemmed	A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control
title_short	A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control
title_sort	multitasking-oriented robot arm motion planning scheme based on deep reinforcement learning and twin synchro-control
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349783/ https://www.ncbi.nlm.nih.gov/pubmed/32575907 http://dx.doi.org/10.3390/s20123515
work_keys_str_mv	AT liuchuzhao amultitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT gaojunyao amultitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT biyuanzhen amultitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT shixuanyang amultitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT tiandingkui amultitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT liuchuzhao multitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT gaojunyao multitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT biyuanzhen multitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT shixuanyang multitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol AT tiandingkui multitaskingorientedrobotarmmotionplanningschemebasedondeepreinforcementlearningandtwinsynchrocontrol

A Multitasking-Oriented Robot Arm Motion Planning Scheme Based on Deep Reinforcement Learning and Twin Synchro-Control

Ejemplares similares