Cargando…

Network Architecture for Optimizing Deep Deterministic Policy Gradient Algorithms

The traditional Deep Deterministic Policy Gradient (DDPG) algorithm has been widely used in continuous action spaces, but it still suffers from the problems of easily falling into local optima and large error fluctuations. Aiming at these deficiencies, this paper proposes a dual-actor-dual-critic DD...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Haifei, Xu, Jian, Zhang, Jian, Liu, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9699738/
https://www.ncbi.nlm.nih.gov/pubmed/36438689
http://dx.doi.org/10.1155/2022/1117781