Cargando…
Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms
The traditional Deep Deterministic Policy Gradient (DDPG) algorithm has been widely used in continuous action spaces, but it still suffers from the problems of easily falling into local optima and large error fluctuations. Aiming at these deficiencies, this paper proposes a dual-actor-dual-critic DD...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9699738/ https://www.ncbi.nlm.nih.gov/pubmed/36438689 http://dx.doi.org/10.1155/2022/1117781 |