Cargando…

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks

The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Koert, Dorothea, Kircher, Maximilian, Salikutluk, Vildan, D'Eramo, Carlo, Peters, Jan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805623/ https://www.ncbi.nlm.nih.gov/pubmed/33501264 http://dx.doi.org/10.3389/frobt.2020.00097

_version_	1783636341625454592
author	Koert, Dorothea Kircher, Maximilian Salikutluk, Vildan D'Eramo, Carlo Peters, Jan
author_facet	Koert, Dorothea Kircher, Maximilian Salikutluk, Vildan D'Eramo, Carlo Peters, Jan
author_sort	Koert, Dorothea
collection	PubMed
description	The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users.
format	Online Article Text
id	pubmed-7805623
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-78056232021-01-25 Multi-Channel Interactive Reinforcement Learning for Sequential Tasks Koert, Dorothea Kircher, Maximilian Salikutluk, Vildan D'Eramo, Carlo Peters, Jan Front Robot AI Robotics and AI The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users. Frontiers Media S.A. 2020-09-24 /pmc/articles/PMC7805623/ /pubmed/33501264 http://dx.doi.org/10.3389/frobt.2020.00097 Text en Copyright © 2020 Koert, Kircher, Salikutluk, D'Eramo and Peters. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI Koert, Dorothea Kircher, Maximilian Salikutluk, Vildan D'Eramo, Carlo Peters, Jan Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title	Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_full	Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_fullStr	Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_full_unstemmed	Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_short	Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_sort	multi-channel interactive reinforcement learning for sequential tasks
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805623/ https://www.ncbi.nlm.nih.gov/pubmed/33501264 http://dx.doi.org/10.3389/frobt.2020.00097
work_keys_str_mv	AT koertdorothea multichannelinteractivereinforcementlearningforsequentialtasks AT kirchermaximilian multichannelinteractivereinforcementlearningforsequentialtasks AT salikutlukvildan multichannelinteractivereinforcementlearningforsequentialtasks AT deramocarlo multichannelinteractivereinforcementlearningforsequentialtasks AT petersjan multichannelinteractivereinforcementlearningforsequentialtasks

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks

Ejemplares similares