Cargando…

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks

The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Koert, Dorothea, Kircher, Maximilian, Salikutluk, Vildan, D'Eramo, Carlo, Peters, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805623/
https://www.ncbi.nlm.nih.gov/pubmed/33501264
http://dx.doi.org/10.3389/frobt.2020.00097
_version_ 1783636341625454592
author Koert, Dorothea
Kircher, Maximilian
Salikutluk, Vildan
D'Eramo, Carlo
Peters, Jan
author_facet Koert, Dorothea
Kircher, Maximilian
Salikutluk, Vildan
D'Eramo, Carlo
Peters, Jan
author_sort Koert, Dorothea
collection PubMed
description The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users.
format Online
Article
Text
id pubmed-7805623
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-78056232021-01-25 Multi-Channel Interactive Reinforcement Learning for Sequential Tasks Koert, Dorothea Kircher, Maximilian Salikutluk, Vildan D'Eramo, Carlo Peters, Jan Front Robot AI Robotics and AI The ability to learn new tasks by sequencing already known skills is an important requirement for future robots. Reinforcement learning is a powerful tool for this as it allows for a robot to learn and improve on how to combine skills for sequential tasks. However, in real robotic applications, the cost of sample collection and exploration prevent the application of reinforcement learning for a variety of tasks. To overcome these limitations, human input during reinforcement can be beneficial to speed up learning, guide the exploration and prevent the choice of disastrous actions. Nevertheless, there is a lack of experimental evaluations of multi-channel interactive reinforcement learning systems solving robotic tasks with input from inexperienced human users, in particular for cases where human input might be partially wrong. Therefore, in this paper, we present an approach that incorporates multiple human input channels for interactive reinforcement learning in a unified framework and evaluate it on two robotic tasks with 20 inexperienced human subjects. To enable the robot to also handle potentially incorrect human input we incorporate a novel concept for self-confidence, which allows the robot to question human input after an initial learning phase. The second robotic task is specifically designed to investigate if this self-confidence can enable the robot to achieve learning progress even if the human input is partially incorrect. Further, we evaluate how humans react to suggestions of the robot, once the robot notices human input might be wrong. Our experimental evaluations show that our approach can successfully incorporate human input to accelerate the learning process in both robotic tasks even if it is partially wrong. However, not all humans were willing to accept the robot's suggestions or its questioning of their input, particularly if they do not understand the learning process and the reasons behind the robot's suggestions. We believe that the findings from this experimental evaluation can be beneficial for the future design of algorithms and interfaces of interactive reinforcement learning systems used by inexperienced users. Frontiers Media S.A. 2020-09-24 /pmc/articles/PMC7805623/ /pubmed/33501264 http://dx.doi.org/10.3389/frobt.2020.00097 Text en Copyright © 2020 Koert, Kircher, Salikutluk, D'Eramo and Peters. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Koert, Dorothea
Kircher, Maximilian
Salikutluk, Vildan
D'Eramo, Carlo
Peters, Jan
Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_full Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_fullStr Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_full_unstemmed Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_short Multi-Channel Interactive Reinforcement Learning for Sequential Tasks
title_sort multi-channel interactive reinforcement learning for sequential tasks
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7805623/
https://www.ncbi.nlm.nih.gov/pubmed/33501264
http://dx.doi.org/10.3389/frobt.2020.00097
work_keys_str_mv AT koertdorothea multichannelinteractivereinforcementlearningforsequentialtasks
AT kirchermaximilian multichannelinteractivereinforcementlearningforsequentialtasks
AT salikutlukvildan multichannelinteractivereinforcementlearningforsequentialtasks
AT deramocarlo multichannelinteractivereinforcementlearningforsequentialtasks
AT petersjan multichannelinteractivereinforcementlearningforsequentialtasks