Cargando…

Efficient Actor-Critic Algorithm with Hierarchical Model Learning and Planning

To improve the convergence rate and the sample efficiency, two efficient learning methods AC-HMLP and RAC-HMLP (AC-HMLP with ℓ (2)-regularization) are proposed by combining actor-critic algorithm with hierarchical model learning and planning. The hierarchical models consisting of the local and the g...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Shan, Liu, Quan, Fu, QiMing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5066029/
https://www.ncbi.nlm.nih.gov/pubmed/27795704
http://dx.doi.org/10.1155/2016/4824072