Cargando…
Mesolimbic dopamine adapts the rate of learning from action
Recent success in training artificial agents and robots derives from a combination of direct learning of behavioural policies and indirect learning through value functions(1–3). Policy learning and value learning use distinct algorithms that optimize behavioural performance and reward prediction, re...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9908546/ https://www.ncbi.nlm.nih.gov/pubmed/36653450 http://dx.doi.org/10.1038/s41586-022-05614-z |