Cargando…

Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller

Decision-making ability in the frontal lobe (among other brain structures) relies on the assignment of value to states of the animal and its environment. Then higher valued states can be pursued and lower (or negative) valued states avoided. The same principle forms the basis for computational reinf...

Descripción completa

Detalles Bibliográficos
Autores principales: DiStasio, Marcello M., Francis, Joseph T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3718814/
https://www.ncbi.nlm.nih.gov/pubmed/23894500
http://dx.doi.org/10.1371/journal.pone.0069541
_version_ 1782277828308369408
author DiStasio, Marcello M.
Francis, Joseph T.
author_facet DiStasio, Marcello M.
Francis, Joseph T.
author_sort DiStasio, Marcello M.
collection PubMed
description Decision-making ability in the frontal lobe (among other brain structures) relies on the assignment of value to states of the animal and its environment. Then higher valued states can be pursued and lower (or negative) valued states avoided. The same principle forms the basis for computational reinforcement learning controllers, which have been fruitfully applied both as models of value estimation in the brain, and as artificial controllers in their own right. This work shows how state desirability signals decoded from frontal lobe hemodynamics, as measured with near-infrared spectroscopy (NIRS), can be applied as reinforcers to an adaptable artificial learning agent in order to guide its acquisition of skills. A set of experiments carried out on an alert macaque demonstrate that both oxy- and deoxyhemoglobin concentrations in the frontal lobe show differences in response to both primarily and secondarily desirable (versus undesirable) stimuli. This difference allows a NIRS signal classifier to serve successfully as a reinforcer for an adaptive controller performing a virtual tool-retrieval task. The agent's adaptability allows its performance to exceed the limits of the NIRS classifier decoding accuracy. We also show that decoding state desirabilities is more accurate when using relative concentrations of both oxyhemoglobin and deoxyhemoglobin, rather than either species alone.
format Online
Article
Text
id pubmed-3718814
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37188142013-07-26 Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller DiStasio, Marcello M. Francis, Joseph T. PLoS One Research Article Decision-making ability in the frontal lobe (among other brain structures) relies on the assignment of value to states of the animal and its environment. Then higher valued states can be pursued and lower (or negative) valued states avoided. The same principle forms the basis for computational reinforcement learning controllers, which have been fruitfully applied both as models of value estimation in the brain, and as artificial controllers in their own right. This work shows how state desirability signals decoded from frontal lobe hemodynamics, as measured with near-infrared spectroscopy (NIRS), can be applied as reinforcers to an adaptable artificial learning agent in order to guide its acquisition of skills. A set of experiments carried out on an alert macaque demonstrate that both oxy- and deoxyhemoglobin concentrations in the frontal lobe show differences in response to both primarily and secondarily desirable (versus undesirable) stimuli. This difference allows a NIRS signal classifier to serve successfully as a reinforcer for an adaptive controller performing a virtual tool-retrieval task. The agent's adaptability allows its performance to exceed the limits of the NIRS classifier decoding accuracy. We also show that decoding state desirabilities is more accurate when using relative concentrations of both oxyhemoglobin and deoxyhemoglobin, rather than either species alone. Public Library of Science 2013-07-22 /pmc/articles/PMC3718814/ /pubmed/23894500 http://dx.doi.org/10.1371/journal.pone.0069541 Text en © 2013 DiStasio, Francis http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
DiStasio, Marcello M.
Francis, Joseph T.
Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller
title Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller
title_full Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller
title_fullStr Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller
title_full_unstemmed Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller
title_short Use of Frontal Lobe Hemodynamics as Reinforcement Signals to an Adaptive Controller
title_sort use of frontal lobe hemodynamics as reinforcement signals to an adaptive controller
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3718814/
https://www.ncbi.nlm.nih.gov/pubmed/23894500
http://dx.doi.org/10.1371/journal.pone.0069541
work_keys_str_mv AT distasiomarcellom useoffrontallobehemodynamicsasreinforcementsignalstoanadaptivecontroller
AT francisjosepht useoffrontallobehemodynamicsasreinforcementsignalstoanadaptivecontroller