Cargando…

Mesolimbic dopamine adapts the rate of learning from action

Recent success in training artificial agents and robots derives from a combination of direct learning of behavioural policies and indirect learning through value functions(1–3). Policy learning and value learning use distinct algorithms that optimize behavioural performance and reward prediction, re...

Descripción completa

Detalles Bibliográficos
Autores principales: Coddington, Luke T., Lindo, Sarah E., Dudman, Joshua T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908546/
https://www.ncbi.nlm.nih.gov/pubmed/36653450
http://dx.doi.org/10.1038/s41586-022-05614-z