Cargando…

Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction

We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforc...

Descripción completa

Detalles Bibliográficos
Autores principales: Sinha, Anirban, Wang, Yue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245025/
https://www.ncbi.nlm.nih.gov/pubmed/35783024
http://dx.doi.org/10.3389/frobt.2022.779194
_version_ 1784738658610315264
author Sinha, Anirban
Wang, Yue
author_facet Sinha, Anirban
Wang, Yue
author_sort Sinha, Anirban
collection PubMed
description We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforcement learning algorithms by utilizing the Koopman operator theory to build a human intent model from the human demonstrated trajectories. In order to ensure that the developed reward function produces the correct reward, the demonstrated trajectories are further used to create a trust domain within which the Koopman operator–based human intent prediction is considered. Otherwise, the proposed algorithm asks for human feedback to receive rewards. The designed reward function is incorporated inside the deep Q-learning (DQN) framework, which results in a modified DQN algorithm. The effectiveness of the proposed learning algorithm is demonstrated using a simulated robotic arm to learn the paths for constrained end-effector motion and considering the safety of the human in the surroundings of the robot.
format Online
Article
Text
id pubmed-9245025
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92450252022-07-01 Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction Sinha, Anirban Wang, Yue Front Robot AI Robotics and AI We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforcement learning algorithms by utilizing the Koopman operator theory to build a human intent model from the human demonstrated trajectories. In order to ensure that the developed reward function produces the correct reward, the demonstrated trajectories are further used to create a trust domain within which the Koopman operator–based human intent prediction is considered. Otherwise, the proposed algorithm asks for human feedback to receive rewards. The designed reward function is incorporated inside the deep Q-learning (DQN) framework, which results in a modified DQN algorithm. The effectiveness of the proposed learning algorithm is demonstrated using a simulated robotic arm to learn the paths for constrained end-effector motion and considering the safety of the human in the surroundings of the robot. Frontiers Media S.A. 2022-06-16 /pmc/articles/PMC9245025/ /pubmed/35783024 http://dx.doi.org/10.3389/frobt.2022.779194 Text en Copyright © 2022 Sinha and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Sinha, Anirban
Wang, Yue
Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
title Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
title_full Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
title_fullStr Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
title_full_unstemmed Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
title_short Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
title_sort koopman operator–based knowledge-guided reinforcement learning for safe human–robot interaction
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245025/
https://www.ncbi.nlm.nih.gov/pubmed/35783024
http://dx.doi.org/10.3389/frobt.2022.779194
work_keys_str_mv AT sinhaanirban koopmanoperatorbasedknowledgeguidedreinforcementlearningforsafehumanrobotinteraction
AT wangyue koopmanoperatorbasedknowledgeguidedreinforcementlearningforsafehumanrobotinteraction