Cargando…

Scaling Up Q-Learning via Exploiting State–Action Equivalence

Recent success stories in reinforcement learning have demonstrated that leveraging structural properties of the underlying environment is key in devising viable methods capable of solving complex tasks. We study off-policy learning in discounted reinforcement learning, where some equivalence relatio...

Descripción completa

Detalles Bibliográficos
Autores principales:	Lyu, Yunlian, Côme, Aymeric, Zhang, Yijie, Talebi, Mohammad Sadegh
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10137898/ https://www.ncbi.nlm.nih.gov/pubmed/37190372 http://dx.doi.org/10.3390/e25040584

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10137898/
https://www.ncbi.nlm.nih.gov/pubmed/37190372
http://dx.doi.org/10.3390/e25040584

Scaling Up Q-Learning via Exploiting State–Action Equivalence

Internet

Ejemplares similares