Cargando…
The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
Previous research has shown that trial ordering affects cognitive performance, but this has not been tested using category-learning tasks that differentiate learning from reward and punishment. Here, we tested two groups of healthy young adults using a probabilistic category learning task of reward...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4513240/ https://www.ncbi.nlm.nih.gov/pubmed/26257616 http://dx.doi.org/10.3389/fnbeh.2015.00153 |
_version_ | 1782382614640852992 |
---|---|
author | Moustafa, Ahmed A. Gluck, Mark A. Herzallah, Mohammad M. Myers, Catherine E. |
author_facet | Moustafa, Ahmed A. Gluck, Mark A. Herzallah, Mohammad M. Myers, Catherine E. |
author_sort | Moustafa, Ahmed A. |
collection | PubMed |
description | Previous research has shown that trial ordering affects cognitive performance, but this has not been tested using category-learning tasks that differentiate learning from reward and punishment. Here, we tested two groups of healthy young adults using a probabilistic category learning task of reward and punishment in which there are two types of trials (reward, punishment) and three possible outcomes: (1) positive feedback for correct responses in reward trials; (2) negative feedback for incorrect responses in punishment trials; and (3) no feedback for incorrect answers in reward trials and correct answers in punishment trials. Hence, trials without feedback are ambiguous, and may represent either successful avoidance of punishment or failure to obtain reward. In Experiment 1, the first group of subjects received an intermixed task in which reward and punishment trials were presented in the same block, as a standard baseline task. In Experiment 2, a second group completed the separated task, in which reward and punishment trials were presented in separate blocks. Additionally, in order to understand the mechanisms underlying performance in the experimental conditions, we fit individual data using a Q-learning model. Results from Experiment 1 show that subjects who completed the intermixed task paradoxically valued the no-feedback outcome as a reinforcer when it occurred on reinforcement-based trials, and as a punisher when it occurred on punishment-based trials. This is supported by patterns of empirical responding, where subjects showed more win-stay behavior following an explicit reward than following an omission of punishment, and more lose-shift behavior following an explicit punisher than following an omission of reward. In Experiment 2, results showed similar performance whether subjects received reward-based or punishment-based trials first. However, when the Q-learning model was applied to these data, there were differences between subjects in the reward-first and punishment-first conditions on the relative weighting of neutral feedback. Specifically, early training on reward-based trials led to omission of reward being treated as similar to punishment, but prior training on punishment-based trials led to omission of reward being treated more neutrally. This suggests that early training on one type of trials, specifically reward-based trials, can create a bias in how neutral feedback is processed, relative to those receiving early punishment-based training or training that mixes positive and negative outcomes. |
format | Online Article Text |
id | pubmed-4513240 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-45132402015-08-07 The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses Moustafa, Ahmed A. Gluck, Mark A. Herzallah, Mohammad M. Myers, Catherine E. Front Behav Neurosci Neuroscience Previous research has shown that trial ordering affects cognitive performance, but this has not been tested using category-learning tasks that differentiate learning from reward and punishment. Here, we tested two groups of healthy young adults using a probabilistic category learning task of reward and punishment in which there are two types of trials (reward, punishment) and three possible outcomes: (1) positive feedback for correct responses in reward trials; (2) negative feedback for incorrect responses in punishment trials; and (3) no feedback for incorrect answers in reward trials and correct answers in punishment trials. Hence, trials without feedback are ambiguous, and may represent either successful avoidance of punishment or failure to obtain reward. In Experiment 1, the first group of subjects received an intermixed task in which reward and punishment trials were presented in the same block, as a standard baseline task. In Experiment 2, a second group completed the separated task, in which reward and punishment trials were presented in separate blocks. Additionally, in order to understand the mechanisms underlying performance in the experimental conditions, we fit individual data using a Q-learning model. Results from Experiment 1 show that subjects who completed the intermixed task paradoxically valued the no-feedback outcome as a reinforcer when it occurred on reinforcement-based trials, and as a punisher when it occurred on punishment-based trials. This is supported by patterns of empirical responding, where subjects showed more win-stay behavior following an explicit reward than following an omission of punishment, and more lose-shift behavior following an explicit punisher than following an omission of reward. In Experiment 2, results showed similar performance whether subjects received reward-based or punishment-based trials first. However, when the Q-learning model was applied to these data, there were differences between subjects in the reward-first and punishment-first conditions on the relative weighting of neutral feedback. Specifically, early training on reward-based trials led to omission of reward being treated as similar to punishment, but prior training on punishment-based trials led to omission of reward being treated more neutrally. This suggests that early training on one type of trials, specifically reward-based trials, can create a bias in how neutral feedback is processed, relative to those receiving early punishment-based training or training that mixes positive and negative outcomes. Frontiers Media S.A. 2015-07-24 /pmc/articles/PMC4513240/ /pubmed/26257616 http://dx.doi.org/10.3389/fnbeh.2015.00153 Text en Copyright © 2015 Moustafa, Gluck, Herzallah and Myers. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution and reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Moustafa, Ahmed A. Gluck, Mark A. Herzallah, Mohammad M. Myers, Catherine E. The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
title | The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
title_full | The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
title_fullStr | The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
title_full_unstemmed | The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
title_short | The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
title_sort | influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4513240/ https://www.ncbi.nlm.nih.gov/pubmed/26257616 http://dx.doi.org/10.3389/fnbeh.2015.00153 |
work_keys_str_mv | AT moustafaahmeda theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT gluckmarka theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT herzallahmohammadm theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT myerscatherinee theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT moustafaahmeda influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT gluckmarka influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT herzallahmohammadm influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses AT myerscatherinee influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses |