Cargando…

Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation

INTRODUCTION: Behavioral Cloning (BC) is a common imitation learning method which utilizes neural networks to approximate the demonstration action samples for task manipulation skill learning. However, in the real world, the demonstration trajectories from human are often sparse and imperfect, which...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zang, Yajing, Wang, Pengfei, Zha, Fusheng, Guo, Wei, Zheng, Chao, Sun, Lining
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2023
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666750/ https://www.ncbi.nlm.nih.gov/pubmed/38023454 http://dx.doi.org/10.3389/fnbot.2023.1320251

_version_	1785139109683003392
author	Zang, Yajing Wang, Pengfei Zha, Fusheng Guo, Wei Zheng, Chao Sun, Lining
author_facet	Zang, Yajing Wang, Pengfei Zha, Fusheng Guo, Wei Zheng, Chao Sun, Lining
author_sort	Zang, Yajing
collection	PubMed
description	INTRODUCTION: Behavioral Cloning (BC) is a common imitation learning method which utilizes neural networks to approximate the demonstration action samples for task manipulation skill learning. However, in the real world, the demonstration trajectories from human are often sparse and imperfect, which makes it challenging to comprehensively learn directly from the demonstration action samples. Therefore, in this paper, we proposes a streamlined imitation learning method under the terse geometric representation to take good advantage of the demonstration data, and then realize the manipulation skill learning of assembly tasks. METHODS: We map the demonstration trajectories into the geometric feature space. Then we align the demonstration trajectories by Dynamic Time Warping (DTW) method to get the unified data sequence so we can segment them into several time stages. The Probability Movement Primitives (ProMPs) of the demonstration trajectories are then extracted, so we can generate a lot of task trajectories to be the global strategy action samples for training the neural networks. Notalby, we regard the current state of the assembly task as the via point of the ProMPs model to get the generated trajectories, while the time point of the via point is calculated according to the probability model of the different time stages. And we get the action of the current state according to the target position of the next time state. Finally, we train the neural network to obtain the global assembly strategy by Behavioral Cloning. RESULTS: We applied the proposed method to the peg-in-hole assembly task in the simulation environment based on Pybullet + Gym to test its task skill learning performance. And the learned assembly strategy was also executed on a real robotic platform to verify the feasibility of the method further. DISCUSSION: According to the result of the experiment, the proposed method achieves higher success rates compared to traditional imitation learning methods while exhibiting reasonable generalization capabilities. It shows that the ProMPs under geometric representation can help the BC method make better use of the demonstration trajectory and thus better learn the task skills.
format	Online Article Text
id	pubmed-10666750
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-106667502023-01-01 Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation Zang, Yajing Wang, Pengfei Zha, Fusheng Guo, Wei Zheng, Chao Sun, Lining Front Neurorobot Neuroscience INTRODUCTION: Behavioral Cloning (BC) is a common imitation learning method which utilizes neural networks to approximate the demonstration action samples for task manipulation skill learning. However, in the real world, the demonstration trajectories from human are often sparse and imperfect, which makes it challenging to comprehensively learn directly from the demonstration action samples. Therefore, in this paper, we proposes a streamlined imitation learning method under the terse geometric representation to take good advantage of the demonstration data, and then realize the manipulation skill learning of assembly tasks. METHODS: We map the demonstration trajectories into the geometric feature space. Then we align the demonstration trajectories by Dynamic Time Warping (DTW) method to get the unified data sequence so we can segment them into several time stages. The Probability Movement Primitives (ProMPs) of the demonstration trajectories are then extracted, so we can generate a lot of task trajectories to be the global strategy action samples for training the neural networks. Notalby, we regard the current state of the assembly task as the via point of the ProMPs model to get the generated trajectories, while the time point of the via point is calculated according to the probability model of the different time stages. And we get the action of the current state according to the target position of the next time state. Finally, we train the neural network to obtain the global assembly strategy by Behavioral Cloning. RESULTS: We applied the proposed method to the peg-in-hole assembly task in the simulation environment based on Pybullet + Gym to test its task skill learning performance. And the learned assembly strategy was also executed on a real robotic platform to verify the feasibility of the method further. DISCUSSION: According to the result of the experiment, the proposed method achieves higher success rates compared to traditional imitation learning methods while exhibiting reasonable generalization capabilities. It shows that the ProMPs under geometric representation can help the BC method make better use of the demonstration trajectory and thus better learn the task skills. Frontiers Media S.A. 2023-11-09 /pmc/articles/PMC10666750/ /pubmed/38023454 http://dx.doi.org/10.3389/fnbot.2023.1320251 Text en Copyright © 2023 Zang, Wang, Zha, Guo, Zheng and Sun. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Zang, Yajing Wang, Pengfei Zha, Fusheng Guo, Wei Zheng, Chao Sun, Lining Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation
title	Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation
title_full	Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation
title_fullStr	Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation
title_full_unstemmed	Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation
title_short	Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation
title_sort	peg-in-hole assembly skill imitation learning method based on promps under task geometric representation
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10666750/ https://www.ncbi.nlm.nih.gov/pubmed/38023454 http://dx.doi.org/10.3389/fnbot.2023.1320251
work_keys_str_mv	AT zangyajing peginholeassemblyskillimitationlearningmethodbasedonprompsundertaskgeometricrepresentation AT wangpengfei peginholeassemblyskillimitationlearningmethodbasedonprompsundertaskgeometricrepresentation AT zhafusheng peginholeassemblyskillimitationlearningmethodbasedonprompsundertaskgeometricrepresentation AT guowei peginholeassemblyskillimitationlearningmethodbasedonprompsundertaskgeometricrepresentation AT zhengchao peginholeassemblyskillimitationlearningmethodbasedonprompsundertaskgeometricrepresentation AT sunlining peginholeassemblyskillimitationlearningmethodbasedonprompsundertaskgeometricrepresentation

Peg-in-hole assembly skill imitation learning method based on ProMPs under task geometric representation

Ejemplares similares