Cargando…
Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the id...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991603/ https://www.ncbi.nlm.nih.gov/pubmed/24747416 http://dx.doi.org/10.1371/journal.pone.0094308 |
_version_ | 1782312466708955136 |
---|---|
author | Mohammadi Sepahvand, Nazanin Stöttinger, Elisabeth Danckert, James Anderson, Britt |
author_facet | Mohammadi Sepahvand, Nazanin Stöttinger, Elisabeth Danckert, James Anderson, Britt |
author_sort | Mohammadi Sepahvand, Nazanin |
collection | PubMed |
description | Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms. |
format | Online Article Text |
id | pubmed-3991603 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39916032014-04-21 Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts Mohammadi Sepahvand, Nazanin Stöttinger, Elisabeth Danckert, James Anderson, Britt PLoS One Research Article Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms. Public Library of Science 2014-04-18 /pmc/articles/PMC3991603/ /pubmed/24747416 http://dx.doi.org/10.1371/journal.pone.0094308 Text en © 2014 Mohammadi Sepahvand et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Mohammadi Sepahvand, Nazanin Stöttinger, Elisabeth Danckert, James Anderson, Britt Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts |
title | Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts |
title_full | Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts |
title_fullStr | Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts |
title_full_unstemmed | Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts |
title_short | Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts |
title_sort | sequential decisions: a computational comparison of observational and reinforcement accounts |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991603/ https://www.ncbi.nlm.nih.gov/pubmed/24747416 http://dx.doi.org/10.1371/journal.pone.0094308 |
work_keys_str_mv | AT mohammadisepahvandnazanin sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts AT stottingerelisabeth sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts AT danckertjames sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts AT andersonbritt sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts |