Cargando…

Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts

Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the id...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammadi Sepahvand, Nazanin, Stöttinger, Elisabeth, Danckert, James, Anderson, Britt
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991603/
https://www.ncbi.nlm.nih.gov/pubmed/24747416
http://dx.doi.org/10.1371/journal.pone.0094308
_version_ 1782312466708955136
author Mohammadi Sepahvand, Nazanin
Stöttinger, Elisabeth
Danckert, James
Anderson, Britt
author_facet Mohammadi Sepahvand, Nazanin
Stöttinger, Elisabeth
Danckert, James
Anderson, Britt
author_sort Mohammadi Sepahvand, Nazanin
collection PubMed
description Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms.
format Online
Article
Text
id pubmed-3991603
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39916032014-04-21 Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts Mohammadi Sepahvand, Nazanin Stöttinger, Elisabeth Danckert, James Anderson, Britt PLoS One Research Article Right brain damaged patients show impairments in sequential decision making tasks for which healthy people do not show any difficulty. We hypothesized that this difficulty could be due to the failure of right brain damage patients to develop well-matched models of the world. Our motivation is the idea that to navigate uncertainty, humans use models of the world to direct the decisions they make when interacting with their environment. The better the model is, the better their decisions are. To explore the model building and updating process in humans and the basis for impairment after brain injury, we used a computational model of non-stationary sequence learning. RELPH (Reinforcement and Entropy Learned Pruned Hypothesis space) was able to qualitatively and quantitatively reproduce the results of left and right brain damaged patient groups and healthy controls playing a sequential version of Rock, Paper, Scissors. Our results suggests that, in general, humans employ a sub-optimal reinforcement based learning method rather than an objectively better statistical learning approach, and that differences between right brain damaged and healthy control groups can be explained by different exploration policies, rather than qualitatively different learning mechanisms. Public Library of Science 2014-04-18 /pmc/articles/PMC3991603/ /pubmed/24747416 http://dx.doi.org/10.1371/journal.pone.0094308 Text en © 2014 Mohammadi Sepahvand et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mohammadi Sepahvand, Nazanin
Stöttinger, Elisabeth
Danckert, James
Anderson, Britt
Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
title Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
title_full Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
title_fullStr Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
title_full_unstemmed Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
title_short Sequential Decisions: A Computational Comparison of Observational and Reinforcement Accounts
title_sort sequential decisions: a computational comparison of observational and reinforcement accounts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3991603/
https://www.ncbi.nlm.nih.gov/pubmed/24747416
http://dx.doi.org/10.1371/journal.pone.0094308
work_keys_str_mv AT mohammadisepahvandnazanin sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts
AT stottingerelisabeth sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts
AT danckertjames sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts
AT andersonbritt sequentialdecisionsacomputationalcomparisonofobservationalandreinforcementaccounts