Cargando…
Learning the payoffs and costs of actions
A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively and are differentially modulated by dopamine projections from the midbrain. Inspired by t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413954/ https://www.ncbi.nlm.nih.gov/pubmed/30818357 http://dx.doi.org/10.1371/journal.pcbi.1006285 |
_version_ | 1783402913607974912 |
---|---|
author | Möller, Moritz Bogacz, Rafal |
author_facet | Möller, Moritz Bogacz, Rafal |
author_sort | Möller, Moritz |
collection | PubMed |
description | A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively and are differentially modulated by dopamine projections from the midbrain. Inspired by the influential opponent actor learning model, we demonstrate that, under certain circumstances, these pathways may represent learned estimates of the positive and negative consequences (payoffs and costs) of individual actions. In the model, the level of dopamine activity encodes the motivational state and controls to what extent payoffs and costs enter the overall evaluation of actions. We show that a set of previously proposed plasticity rules is suitable to extract payoffs and costs from a prediction error signal if they occur at different moments in time. For those plasticity rules, successful learning requires differential effects of positive and negative outcome prediction errors on the two pathways and a weak decay of synaptic weights over trials. We also confirm through simulations that the model reproduces drug-induced changes of willingness to work, as observed in classical experiments with the D2-antagonist haloperidol. |
format | Online Article Text |
id | pubmed-6413954 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-64139542019-04-01 Learning the payoffs and costs of actions Möller, Moritz Bogacz, Rafal PLoS Comput Biol Research Article A set of sub-cortical nuclei called basal ganglia is critical for learning the values of actions. The basal ganglia include two pathways, which have been associated with approach and avoid behavior respectively and are differentially modulated by dopamine projections from the midbrain. Inspired by the influential opponent actor learning model, we demonstrate that, under certain circumstances, these pathways may represent learned estimates of the positive and negative consequences (payoffs and costs) of individual actions. In the model, the level of dopamine activity encodes the motivational state and controls to what extent payoffs and costs enter the overall evaluation of actions. We show that a set of previously proposed plasticity rules is suitable to extract payoffs and costs from a prediction error signal if they occur at different moments in time. For those plasticity rules, successful learning requires differential effects of positive and negative outcome prediction errors on the two pathways and a weak decay of synaptic weights over trials. We also confirm through simulations that the model reproduces drug-induced changes of willingness to work, as observed in classical experiments with the D2-antagonist haloperidol. Public Library of Science 2019-02-28 /pmc/articles/PMC6413954/ /pubmed/30818357 http://dx.doi.org/10.1371/journal.pcbi.1006285 Text en © 2019 Möller, Bogacz http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Möller, Moritz Bogacz, Rafal Learning the payoffs and costs of actions |
title | Learning the payoffs and costs of actions |
title_full | Learning the payoffs and costs of actions |
title_fullStr | Learning the payoffs and costs of actions |
title_full_unstemmed | Learning the payoffs and costs of actions |
title_short | Learning the payoffs and costs of actions |
title_sort | learning the payoffs and costs of actions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6413954/ https://www.ncbi.nlm.nih.gov/pubmed/30818357 http://dx.doi.org/10.1371/journal.pcbi.1006285 |
work_keys_str_mv | AT mollermoritz learningthepayoffsandcostsofactions AT bogaczrafal learningthepayoffsandcostsofactions |