Cargando…
A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-l...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/ https://www.ncbi.nlm.nih.gov/pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213 |
_version_ | 1784681732763549696 |
---|---|
author | Zehfroosh, Ashkan Tanner , Herbert G. |
author_facet | Zehfroosh, Ashkan Tanner , Herbert G. |
author_sort | Zehfroosh, Ashkan |
collection | PubMed |
description | This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method. |
format | Online Article Text |
id | pubmed-8982074 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-89820742022-04-06 A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction Zehfroosh, Ashkan Tanner , Herbert G. Front Robot AI Robotics and AI This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method. Frontiers Media S.A. 2022-03-09 /pmc/articles/PMC8982074/ /pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213 Text en Copyright © 2022 Zehfroosh and Tanner. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Zehfroosh, Ashkan Tanner , Herbert G. A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction |
title | A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction |
title_full | A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction |
title_fullStr | A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction |
title_full_unstemmed | A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction |
title_short | A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction |
title_sort | hybrid pac reinforcement learning algorithm for human-robot interaction |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/ https://www.ncbi.nlm.nih.gov/pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213 |
work_keys_str_mv | AT zehfrooshashkan ahybridpacreinforcementlearningalgorithmforhumanrobotinteraction AT tannerherbertg ahybridpacreinforcementlearningalgorithmforhumanrobotinteraction AT zehfrooshashkan hybridpacreinforcementlearningalgorithmforhumanrobotinteraction AT tannerherbertg hybridpacreinforcementlearningalgorithmforhumanrobotinteraction |