Cargando…

Stage-Wise Learning of Reaching Using Little Prior Knowledge

In some manipulation robotics environments, because of the difficulty of precisely modeling dynamics and computing features which describe well the variety of scene appearances, hand-programming a robot behavior is often intractable. Deep reinforcement learning methods partially alleviate this probl...

Descripción completa

Detalles Bibliográficos
Autores principales:	de La Bourdonnaye, François, Teulière, Céline, Triesch, Jochen, Chateau, Thierry
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2018
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7806066/ https://www.ncbi.nlm.nih.gov/pubmed/33500989 http://dx.doi.org/10.3389/frobt.2018.00110

_version_	1783636448497369088
author	de La Bourdonnaye, François Teulière, Céline Triesch, Jochen Chateau, Thierry
author_facet	de La Bourdonnaye, François Teulière, Céline Triesch, Jochen Chateau, Thierry
author_sort	de La Bourdonnaye, François
collection	PubMed
description	In some manipulation robotics environments, because of the difficulty of precisely modeling dynamics and computing features which describe well the variety of scene appearances, hand-programming a robot behavior is often intractable. Deep reinforcement learning methods partially alleviate this problem in that they can dispense with hand-crafted features for the state representation and do not need pre-computed dynamics. However, they often use prior information in the task definition in the form of shaping rewards which guide the robot toward goal state areas but require engineering or human supervision and can lead to sub-optimal behavior. In this work we consider a complex robot reaching task with a large range of initial object positions and initial arm positions and propose a new learning approach with minimal supervision. Inspired by developmental robotics, our method consists of a weakly-supervised stage-wise procedure of three tasks. First, the robot learns to fixate the object with a 2-camera system. Second, it learns hand-eye coordination by learning to fixate its end-effector. Third, using the knowledge acquired in the previous steps, it learns to reach the object at different positions and from a large set of initial robot joint angles. Experiments in a simulated environment show that our stage-wise framework yields similar reaching performances, compared with a supervised setting without using kinematic models, hand-crafted features, calibration parameters or supervised visual modules.
format	Online Article Text
id	pubmed-7806066
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-78060662021-01-25 Stage-Wise Learning of Reaching Using Little Prior Knowledge de La Bourdonnaye, François Teulière, Céline Triesch, Jochen Chateau, Thierry Front Robot AI Robotics and AI In some manipulation robotics environments, because of the difficulty of precisely modeling dynamics and computing features which describe well the variety of scene appearances, hand-programming a robot behavior is often intractable. Deep reinforcement learning methods partially alleviate this problem in that they can dispense with hand-crafted features for the state representation and do not need pre-computed dynamics. However, they often use prior information in the task definition in the form of shaping rewards which guide the robot toward goal state areas but require engineering or human supervision and can lead to sub-optimal behavior. In this work we consider a complex robot reaching task with a large range of initial object positions and initial arm positions and propose a new learning approach with minimal supervision. Inspired by developmental robotics, our method consists of a weakly-supervised stage-wise procedure of three tasks. First, the robot learns to fixate the object with a 2-camera system. Second, it learns hand-eye coordination by learning to fixate its end-effector. Third, using the knowledge acquired in the previous steps, it learns to reach the object at different positions and from a large set of initial robot joint angles. Experiments in a simulated environment show that our stage-wise framework yields similar reaching performances, compared with a supervised setting without using kinematic models, hand-crafted features, calibration parameters or supervised visual modules. Frontiers Media S.A. 2018-10-01 /pmc/articles/PMC7806066/ /pubmed/33500989 http://dx.doi.org/10.3389/frobt.2018.00110 Text en Copyright © 2018 de La Bourdonnaye, Teulière, Triesch and Chateau. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI de La Bourdonnaye, François Teulière, Céline Triesch, Jochen Chateau, Thierry Stage-Wise Learning of Reaching Using Little Prior Knowledge
title	Stage-Wise Learning of Reaching Using Little Prior Knowledge
title_full	Stage-Wise Learning of Reaching Using Little Prior Knowledge
title_fullStr	Stage-Wise Learning of Reaching Using Little Prior Knowledge
title_full_unstemmed	Stage-Wise Learning of Reaching Using Little Prior Knowledge
title_short	Stage-Wise Learning of Reaching Using Little Prior Knowledge
title_sort	stage-wise learning of reaching using little prior knowledge
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7806066/ https://www.ncbi.nlm.nih.gov/pubmed/33500989 http://dx.doi.org/10.3389/frobt.2018.00110
work_keys_str_mv	AT delabourdonnayefrancois stagewiselearningofreachingusinglittlepriorknowledge AT teuliereceline stagewiselearningofreachingusinglittlepriorknowledge AT trieschjochen stagewiselearningofreachingusinglittlepriorknowledge AT chateauthierry stagewiselearningofreachingusinglittlepriorknowledge

Stage-Wise Learning of Reaching Using Little Prior Knowledge

Ejemplares similares