Cargando…

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-l...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zehfroosh, Ashkan, Tanner , Herbert G.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/ https://www.ncbi.nlm.nih.gov/pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213

_version_	1784681732763549696
author	Zehfroosh, Ashkan Tanner , Herbert G.
author_facet	Zehfroosh, Ashkan Tanner , Herbert G.
author_sort	Zehfroosh, Ashkan
collection	PubMed
description	This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method.
format	Online Article Text
id	pubmed-8982074
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-89820742022-04-06 A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction Zehfroosh, Ashkan Tanner , Herbert G. Front Robot AI Robotics and AI This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-learning (DDQ) algorithm, combines model-free Delayed Q-learning and model-based R-max algorithms while outperforming both in most cases. The paper includes a PAC analysis of the DDQ algorithm and a derivation of its sample complexity. Numerical results are provided to support the claim regarding the new algorithm’s sample efficiency compared to its parents as well as the best known PAC model-free and model-based algorithms in application. A real-world experimental implementation of DDQ in the context of pediatric motor rehabilitation facilitated by infant-robot interaction highlights the potential benefits of the reported method. Frontiers Media S.A. 2022-03-09 /pmc/articles/PMC8982074/ /pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213 Text en Copyright © 2022 Zehfroosh and Tanner. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Robotics and AI Zehfroosh, Ashkan Tanner , Herbert G. A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title	A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_full	A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_fullStr	A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_full_unstemmed	A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_short	A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction
title_sort	hybrid pac reinforcement learning algorithm for human-robot interaction
topic	Robotics and AI
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/ https://www.ncbi.nlm.nih.gov/pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213
work_keys_str_mv	AT zehfrooshashkan ahybridpacreinforcementlearningalgorithmforhumanrobotinteraction AT tannerherbertg ahybridpacreinforcementlearningalgorithmforhumanrobotinteraction AT zehfrooshashkan hybridpacreinforcementlearningalgorithmforhumanrobotinteraction AT tannerherbertg hybridpacreinforcementlearningalgorithmforhumanrobotinteraction

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

Ejemplares similares