Cargando…

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-l...

Descripción completa

Detalles Bibliográficos
Autores principales: Zehfroosh, Ashkan, Tanner , Herbert G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/
https://www.ncbi.nlm.nih.gov/pubmed/35391942
http://dx.doi.org/10.3389/frobt.2022.797213
_version_ 1784681732763549696
author Zehfroosh, Ashkan
Tanner , Herbert G.
author_facet Zehfroosh, Ashkan
Tanner , Herbert G.
author_sort Zehfroosh, Ashkan
collection PubMed
description This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method.
format Online
Article
Text
id pubmed-8982074
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89820742022-04-06 A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction Zehfroosh, Ashkan Tanner , Herbert G. Front Robot AI Robotics and AI This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method. Frontiers Media S.A. 2022-03-09 /pmc/articles/PMC8982074/ /pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213 Text en Copyright © 2022 Zehfroosh and Tanner. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Zehfroosh, Ashkan
Tanner , Herbert G.
A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_full A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_fullStr A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_full_unstemmed A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_short A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_sort hybrid pac reinforcement learning algorithm for human-robot interaction
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/
https://www.ncbi.nlm.nih.gov/pubmed/35391942
http://dx.doi.org/10.3389/frobt.2022.797213
work_keys_str_mv AT zehfrooshashkan ahybridpacreinforcementlearningalgorithmforhumanrobotinteraction
AT tannerherbertg ahybridpacreinforcementlearningalgorithmforhumanrobotinteraction
AT zehfrooshashkan hybridpacreinforcementlearningalgorithmforhumanrobotinteraction
AT tannerherbertg hybridpacreinforcementlearningalgorithmforhumanrobotinteraction