Cargando…

The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses

Previous research has shown that trial ordering affects cognitive performance, but this has not been tested using category-learning tasks that differentiate learning from reward and punishment. Here, we tested two groups of healthy young adults using a probabilistic category learning task of reward...

Descripción completa

Detalles Bibliográficos
Autores principales: Moustafa, Ahmed A., Gluck, Mark A., Herzallah, Mohammad M., Myers, Catherine E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4513240/
https://www.ncbi.nlm.nih.gov/pubmed/26257616
http://dx.doi.org/10.3389/fnbeh.2015.00153
_version_ 1782382614640852992
author Moustafa, Ahmed A.
Gluck, Mark A.
Herzallah, Mohammad M.
Myers, Catherine E.
author_facet Moustafa, Ahmed A.
Gluck, Mark A.
Herzallah, Mohammad M.
Myers, Catherine E.
author_sort Moustafa, Ahmed A.
collection PubMed
description Previous research has shown that trial ordering affects cognitive performance, but this has not been tested using category-learning tasks that differentiate learning from reward and punishment. Here, we tested two groups of healthy young adults using a probabilistic category learning task of reward and punishment in which there are two types of trials (reward, punishment) and three possible outcomes: (1) positive feedback for correct responses in reward trials; (2) negative feedback for incorrect responses in punishment trials; and (3) no feedback for incorrect answers in reward trials and correct answers in punishment trials. Hence, trials without feedback are ambiguous, and may represent either successful avoidance of punishment or failure to obtain reward. In Experiment 1, the first group of subjects received an intermixed task in which reward and punishment trials were presented in the same block, as a standard baseline task. In Experiment 2, a second group completed the separated task, in which reward and punishment trials were presented in separate blocks. Additionally, in order to understand the mechanisms underlying performance in the experimental conditions, we fit individual data using a Q-learning model. Results from Experiment 1 show that subjects who completed the intermixed task paradoxically valued the no-feedback outcome as a reinforcer when it occurred on reinforcement-based trials, and as a punisher when it occurred on punishment-based trials. This is supported by patterns of empirical responding, where subjects showed more win-stay behavior following an explicit reward than following an omission of punishment, and more lose-shift behavior following an explicit punisher than following an omission of reward. In Experiment 2, results showed similar performance whether subjects received reward-based or punishment-based trials first. However, when the Q-learning model was applied to these data, there were differences between subjects in the reward-first and punishment-first conditions on the relative weighting of neutral feedback. Specifically, early training on reward-based trials led to omission of reward being treated as similar to punishment, but prior training on punishment-based trials led to omission of reward being treated more neutrally. This suggests that early training on one type of trials, specifically reward-based trials, can create a bias in how neutral feedback is processed, relative to those receiving early punishment-based training or training that mixes positive and negative outcomes.
format Online
Article
Text
id pubmed-4513240
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-45132402015-08-07 The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses Moustafa, Ahmed A. Gluck, Mark A. Herzallah, Mohammad M. Myers, Catherine E. Front Behav Neurosci Neuroscience Previous research has shown that trial ordering affects cognitive performance, but this has not been tested using category-learning tasks that differentiate learning from reward and punishment. Here, we tested two groups of healthy young adults using a probabilistic category learning task of reward and punishment in which there are two types of trials (reward, punishment) and three possible outcomes: (1) positive feedback for correct responses in reward trials; (2) negative feedback for incorrect responses in punishment trials; and (3) no feedback for incorrect answers in reward trials and correct answers in punishment trials. Hence, trials without feedback are ambiguous, and may represent either successful avoidance of punishment or failure to obtain reward. In Experiment 1, the first group of subjects received an intermixed task in which reward and punishment trials were presented in the same block, as a standard baseline task. In Experiment 2, a second group completed the separated task, in which reward and punishment trials were presented in separate blocks. Additionally, in order to understand the mechanisms underlying performance in the experimental conditions, we fit individual data using a Q-learning model. Results from Experiment 1 show that subjects who completed the intermixed task paradoxically valued the no-feedback outcome as a reinforcer when it occurred on reinforcement-based trials, and as a punisher when it occurred on punishment-based trials. This is supported by patterns of empirical responding, where subjects showed more win-stay behavior following an explicit reward than following an omission of punishment, and more lose-shift behavior following an explicit punisher than following an omission of reward. In Experiment 2, results showed similar performance whether subjects received reward-based or punishment-based trials first. However, when the Q-learning model was applied to these data, there were differences between subjects in the reward-first and punishment-first conditions on the relative weighting of neutral feedback. Specifically, early training on reward-based trials led to omission of reward being treated as similar to punishment, but prior training on punishment-based trials led to omission of reward being treated more neutrally. This suggests that early training on one type of trials, specifically reward-based trials, can create a bias in how neutral feedback is processed, relative to those receiving early punishment-based training or training that mixes positive and negative outcomes. Frontiers Media S.A. 2015-07-24 /pmc/articles/PMC4513240/ /pubmed/26257616 http://dx.doi.org/10.3389/fnbeh.2015.00153 Text en Copyright © 2015 Moustafa, Gluck, Herzallah and Myers. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution and reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Moustafa, Ahmed A.
Gluck, Mark A.
Herzallah, Mohammad M.
Myers, Catherine E.
The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
title The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
title_full The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
title_fullStr The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
title_full_unstemmed The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
title_short The influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
title_sort influence of trial order on learning from reward vs. punishment in a probabilistic categorization task: experimental and computational analyses
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4513240/
https://www.ncbi.nlm.nih.gov/pubmed/26257616
http://dx.doi.org/10.3389/fnbeh.2015.00153
work_keys_str_mv AT moustafaahmeda theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT gluckmarka theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT herzallahmohammadm theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT myerscatherinee theinfluenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT moustafaahmeda influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT gluckmarka influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT herzallahmohammadm influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses
AT myerscatherinee influenceoftrialorderonlearningfromrewardvspunishmentinaprobabilisticcategorizationtaskexperimentalandcomputationalanalyses