Cargando…

Diversity Evolutionary Policy Deep Reinforcement Learning

The reinforcement learning algorithms based on policy gradient may fall into local optimal due to gradient disappearance during the update process, which in turn affects the exploration ability of the reinforcement learning agent. In order to solve the above problem, in this paper, the cross-entropy...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jian, Feng, Liming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357468/
https://www.ncbi.nlm.nih.gov/pubmed/34394336
http://dx.doi.org/10.1155/2021/5300189