Cargando…

Scaling Up Q-Learning via Exploiting State–Action Equivalence

Recent success stories in reinforcement learning have demonstrated that leveraging structural properties of the underlying environment is key in devising viable methods capable of solving complex tasks. We study off-policy learning in discounted reinforcement learning, where some equivalence relatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Lyu, Yunlian, Côme, Aymeric, Zhang, Yijie, Talebi, Mohammad Sadegh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10137898/
https://www.ncbi.nlm.nih.gov/pubmed/37190372
http://dx.doi.org/10.3390/e25040584