Cargando…
Realistic Actor-Critic: A framework for balance between value overestimation and underestimation
INTRODUCTION: The value approximation bias is known to lead to suboptimal policies or catastrophic overestimation bias accumulation that prevent the agent from making the right decisions between exploration and exploitation. Algorithms have been proposed to mitigate the above contradiction. However,...
Autores principales: | Li, Sicen, Tang, Qinyun, Pang, Yiming, Ma, Xinmeng, Wang, Gang |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9868235/ https://www.ncbi.nlm.nih.gov/pubmed/36699950 http://dx.doi.org/10.3389/fnbot.2022.1081242 |
Ejemplares similares
-
Combining backpropagation with Equilibrium Propagation to improve an Actor-Critic reinforcement learning framework
por: Kubo, Yoshimasa, et al.
Publicado: (2022) -
Self-Assessment of Medical Knowledge: Do Physicians Overestimate or Underestimate?
por: Jankowski, Janusz, et al.
Publicado: (1991) -
Possible overestimation of chest wall driving pressure and underestimation of airway closure
por: Nakayama, Ryuichi, et al.
Publicado: (2022) -
Underestimation of metabolic unhealthiness and overestimation of non-alcoholic fatty liver disease
por: Sarathi, Vijaya, et al.
Publicado: (2023) -
Overestimated lead times in cancer screening has led to substantial underestimation of overdiagnosis
por: Zahl, P-H, et al.
Publicado: (2013)