Cargando…
Diversity Evolutionary Policy Deep Reinforcement Learning
The reinforcement learning algorithms based on policy gradient may fall into local optimal due to gradient disappearance during the update process, which in turn affects the exploration ability of the reinforcement learning agent. In order to solve the above problem, in this paper, the cross-entropy...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357468/ https://www.ncbi.nlm.nih.gov/pubmed/34394336 http://dx.doi.org/10.1155/2021/5300189 |