Cargando…

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

This paper offers a new hybrid probably approximately correct (PAC) reinforcement learning (RL) algorithm for Markov decision processes (MDPs) that intelligently maintains favorable features of both model-based and model-free methodologies. The designed algorithm, referred to as the Dyna-Delayed Q-l...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zehfroosh, Ashkan, Tanner , Herbert G.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Robotics and AI
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/ https://www.ncbi.nlm.nih.gov/pubmed/35391942 http://dx.doi.org/10.3389/frobt.2022.797213

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982074/
https://www.ncbi.nlm.nih.gov/pubmed/35391942
http://dx.doi.org/10.3389/frobt.2022.797213

A Hybrid PAC Reinforcement Learning Algorithm for Human-Robot Interaction

Internet

Ejemplares similares