Cargando…

Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics

Experience replay is widely used in AI to bootstrap reinforcement learning (RL) by enabling an agent to remember and reuse past experiences. Classical techniques include shuffled-, reversed-ordered- and prioritized-memory buffers, which have different properties and advantages depending on the natur...

Descripción completa

Detalles Bibliográficos
Autores principales: Massi, Elisa, Barthélemy, Jeanne, Mailly, Juliane, Dromnelle, Rémi, Canitrot, Julien, Poniatowski, Esther, Girard, Benoît, Khamassi, Mehdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263850/
https://www.ncbi.nlm.nih.gov/pubmed/35812782
http://dx.doi.org/10.3389/fnbot.2022.864380
_version_ 1784742840299945984
author Massi, Elisa
Barthélemy, Jeanne
Mailly, Juliane
Dromnelle, Rémi
Canitrot, Julien
Poniatowski, Esther
Girard, Benoît
Khamassi, Mehdi
author_facet Massi, Elisa
Barthélemy, Jeanne
Mailly, Juliane
Dromnelle, Rémi
Canitrot, Julien
Poniatowski, Esther
Girard, Benoît
Khamassi, Mehdi
author_sort Massi, Elisa
collection PubMed
description Experience replay is widely used in AI to bootstrap reinforcement learning (RL) by enabling an agent to remember and reuse past experiences. Classical techniques include shuffled-, reversed-ordered- and prioritized-memory buffers, which have different properties and advantages depending on the nature of the data and problem. Interestingly, recent computational neuroscience work has shown that these techniques are relevant to model hippocampal reactivations recorded during rodent navigation. Nevertheless, the brain mechanisms for orchestrating hippocampal replay are still unclear. In this paper, we present recent neurorobotics research aiming to endow a navigating robot with a neuro-inspired RL architecture (including different learning strategies, such as model-based (MB) and model-free (MF), and different replay techniques). We illustrate through a series of numerical simulations how the specificities of robotic experimentation (e.g., autonomous state decomposition by the robot, noisy perception, state transition uncertainty, non-stationarity) can shed new lights on which replay techniques turn out to be more efficient in different situations. Finally, we close the loop by raising new hypotheses for neuroscience from such robotic models of hippocampal replay.
format Online
Article
Text
id pubmed-9263850
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-92638502022-07-09 Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics Massi, Elisa Barthélemy, Jeanne Mailly, Juliane Dromnelle, Rémi Canitrot, Julien Poniatowski, Esther Girard, Benoît Khamassi, Mehdi Front Neurorobot Neuroscience Experience replay is widely used in AI to bootstrap reinforcement learning (RL) by enabling an agent to remember and reuse past experiences. Classical techniques include shuffled-, reversed-ordered- and prioritized-memory buffers, which have different properties and advantages depending on the nature of the data and problem. Interestingly, recent computational neuroscience work has shown that these techniques are relevant to model hippocampal reactivations recorded during rodent navigation. Nevertheless, the brain mechanisms for orchestrating hippocampal replay are still unclear. In this paper, we present recent neurorobotics research aiming to endow a navigating robot with a neuro-inspired RL architecture (including different learning strategies, such as model-based (MB) and model-free (MF), and different replay techniques). We illustrate through a series of numerical simulations how the specificities of robotic experimentation (e.g., autonomous state decomposition by the robot, noisy perception, state transition uncertainty, non-stationarity) can shed new lights on which replay techniques turn out to be more efficient in different situations. Finally, we close the loop by raising new hypotheses for neuroscience from such robotic models of hippocampal replay. Frontiers Media S.A. 2022-06-24 /pmc/articles/PMC9263850/ /pubmed/35812782 http://dx.doi.org/10.3389/fnbot.2022.864380 Text en Copyright © 2022 Massi, Barthélemy, Mailly, Dromnelle, Canitrot, Poniatowski, Girard and Khamassi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Massi, Elisa
Barthélemy, Jeanne
Mailly, Juliane
Dromnelle, Rémi
Canitrot, Julien
Poniatowski, Esther
Girard, Benoît
Khamassi, Mehdi
Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
title Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
title_full Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
title_fullStr Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
title_full_unstemmed Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
title_short Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
title_sort model-based and model-free replay mechanisms for reinforcement learning in neurorobotics
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263850/
https://www.ncbi.nlm.nih.gov/pubmed/35812782
http://dx.doi.org/10.3389/fnbot.2022.864380
work_keys_str_mv AT massielisa modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT barthelemyjeanne modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT maillyjuliane modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT dromnelleremi modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT canitrotjulien modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT poniatowskiesther modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT girardbenoit modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics
AT khamassimehdi modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics