Cargando…
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Recent success stories in reinforcement learning have demonstrated that leveraging structural properties of the underlying environment is key in devising viable methods capable of solving complex tasks. We study off-policy learning in discounted reinforcement learning, where some equivalence relatio...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10137898/ https://www.ncbi.nlm.nih.gov/pubmed/37190372 http://dx.doi.org/10.3390/e25040584 |