Cargando…
Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction
We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforc...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245025/ https://www.ncbi.nlm.nih.gov/pubmed/35783024 http://dx.doi.org/10.3389/frobt.2022.779194 |
_version_ | 1784738658610315264 |
---|---|
author | Sinha, Anirban Wang, Yue |
author_facet | Sinha, Anirban Wang, Yue |
author_sort | Sinha, Anirban |
collection | PubMed |
description | We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforcement learning algorithms by utilizing the Koopman operator theory to build a human intent model from the human demonstrated trajectories. In order to ensure that the developed reward function produces the correct reward, the demonstrated trajectories are further used to create a trust domain within which the Koopman operator–based human intent prediction is considered. Otherwise, the proposed algorithm asks for human feedback to receive rewards. The designed reward function is incorporated inside the deep Q-learning (DQN) framework, which results in a modified DQN algorithm. The effectiveness of the proposed learning algorithm is demonstrated using a simulated robotic arm to learn the paths for constrained end-effector motion and considering the safety of the human in the surroundings of the robot. |
format | Online Article Text |
id | pubmed-9245025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92450252022-07-01 Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction Sinha, Anirban Wang, Yue Front Robot AI Robotics and AI We developed a novel framework for deep reinforcement learning (DRL) algorithms in task constrained path generation problems of robotic manipulators leveraging human demonstrated trajectories. The main contribution of this article is to design a reward function that can be used with generic reinforcement learning algorithms by utilizing the Koopman operator theory to build a human intent model from the human demonstrated trajectories. In order to ensure that the developed reward function produces the correct reward, the demonstrated trajectories are further used to create a trust domain within which the Koopman operator–based human intent prediction is considered. Otherwise, the proposed algorithm asks for human feedback to receive rewards. The designed reward function is incorporated inside the deep Q-learning (DQN) framework, which results in a modified DQN algorithm. The effectiveness of the proposed learning algorithm is demonstrated using a simulated robotic arm to learn the paths for constrained end-effector motion and considering the safety of the human in the surroundings of the robot. Frontiers Media S.A. 2022-06-16 /pmc/articles/PMC9245025/ /pubmed/35783024 http://dx.doi.org/10.3389/frobt.2022.779194 Text en Copyright © 2022 Sinha and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Sinha, Anirban Wang, Yue Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction |
title | Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction |
title_full | Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction |
title_fullStr | Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction |
title_full_unstemmed | Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction |
title_short | Koopman Operator–Based Knowledge-Guided Reinforcement Learning for Safe Human–Robot Interaction |
title_sort | koopman operator–based knowledge-guided reinforcement learning for safe human–robot interaction |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9245025/ https://www.ncbi.nlm.nih.gov/pubmed/35783024 http://dx.doi.org/10.3389/frobt.2022.779194 |
work_keys_str_mv | AT sinhaanirban koopmanoperatorbasedknowledgeguidedreinforcementlearningforsafehumanrobotinteraction AT wangyue koopmanoperatorbasedknowledgeguidedreinforcementlearningforsafehumanrobotinteraction |