Cargando…

Balancing Exploration and Exploitation in Self-imitation Learning

Sparse reward tasks are always challenging in reinforcement learning. Learning such tasks requires both efficient exploitation and exploration to reduce the sample complexity. One line of research called self-imitation learning is recently proposed, which encourages the agent to do more exploitation...

Descripción completa

Detalles Bibliográficos
Autores principales: Kang, Chun-Yao, Chen, Ming-Syan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206262/
http://dx.doi.org/10.1007/978-3-030-47436-2_21