Cargando…
Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics
Experience replay is widely used in AI to bootstrap reinforcement learning (RL) by enabling an agent to remember and reuse past experiences. Classical techniques include shuffled-, reversed-ordered- and prioritized-memory buffers, which have different properties and advantages depending on the natur...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263850/ https://www.ncbi.nlm.nih.gov/pubmed/35812782 http://dx.doi.org/10.3389/fnbot.2022.864380 |
_version_ | 1784742840299945984 |
---|---|
author | Massi, Elisa Barthélemy, Jeanne Mailly, Juliane Dromnelle, Rémi Canitrot, Julien Poniatowski, Esther Girard, Benoît Khamassi, Mehdi |
author_facet | Massi, Elisa Barthélemy, Jeanne Mailly, Juliane Dromnelle, Rémi Canitrot, Julien Poniatowski, Esther Girard, Benoît Khamassi, Mehdi |
author_sort | Massi, Elisa |
collection | PubMed |
description | Experience replay is widely used in AI to bootstrap reinforcement learning (RL) by enabling an agent to remember and reuse past experiences. Classical techniques include shuffled-, reversed-ordered- and prioritized-memory buffers, which have different properties and advantages depending on the nature of the data and problem. Interestingly, recent computational neuroscience work has shown that these techniques are relevant to model hippocampal reactivations recorded during rodent navigation. Nevertheless, the brain mechanisms for orchestrating hippocampal replay are still unclear. In this paper, we present recent neurorobotics research aiming to endow a navigating robot with a neuro-inspired RL architecture (including different learning strategies, such as model-based (MB) and model-free (MF), and different replay techniques). We illustrate through a series of numerical simulations how the specificities of robotic experimentation (e.g., autonomous state decomposition by the robot, noisy perception, state transition uncertainty, non-stationarity) can shed new lights on which replay techniques turn out to be more efficient in different situations. Finally, we close the loop by raising new hypotheses for neuroscience from such robotic models of hippocampal replay. |
format | Online Article Text |
id | pubmed-9263850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-92638502022-07-09 Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics Massi, Elisa Barthélemy, Jeanne Mailly, Juliane Dromnelle, Rémi Canitrot, Julien Poniatowski, Esther Girard, Benoît Khamassi, Mehdi Front Neurorobot Neuroscience Experience replay is widely used in AI to bootstrap reinforcement learning (RL) by enabling an agent to remember and reuse past experiences. Classical techniques include shuffled-, reversed-ordered- and prioritized-memory buffers, which have different properties and advantages depending on the nature of the data and problem. Interestingly, recent computational neuroscience work has shown that these techniques are relevant to model hippocampal reactivations recorded during rodent navigation. Nevertheless, the brain mechanisms for orchestrating hippocampal replay are still unclear. In this paper, we present recent neurorobotics research aiming to endow a navigating robot with a neuro-inspired RL architecture (including different learning strategies, such as model-based (MB) and model-free (MF), and different replay techniques). We illustrate through a series of numerical simulations how the specificities of robotic experimentation (e.g., autonomous state decomposition by the robot, noisy perception, state transition uncertainty, non-stationarity) can shed new lights on which replay techniques turn out to be more efficient in different situations. Finally, we close the loop by raising new hypotheses for neuroscience from such robotic models of hippocampal replay. Frontiers Media S.A. 2022-06-24 /pmc/articles/PMC9263850/ /pubmed/35812782 http://dx.doi.org/10.3389/fnbot.2022.864380 Text en Copyright © 2022 Massi, Barthélemy, Mailly, Dromnelle, Canitrot, Poniatowski, Girard and Khamassi. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Massi, Elisa Barthélemy, Jeanne Mailly, Juliane Dromnelle, Rémi Canitrot, Julien Poniatowski, Esther Girard, Benoît Khamassi, Mehdi Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics |
title | Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics |
title_full | Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics |
title_fullStr | Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics |
title_full_unstemmed | Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics |
title_short | Model-Based and Model-Free Replay Mechanisms for Reinforcement Learning in Neurorobotics |
title_sort | model-based and model-free replay mechanisms for reinforcement learning in neurorobotics |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263850/ https://www.ncbi.nlm.nih.gov/pubmed/35812782 http://dx.doi.org/10.3389/fnbot.2022.864380 |
work_keys_str_mv | AT massielisa modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT barthelemyjeanne modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT maillyjuliane modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT dromnelleremi modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT canitrotjulien modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT poniatowskiesther modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT girardbenoit modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics AT khamassimehdi modelbasedandmodelfreereplaymechanismsforreinforcementlearninginneurorobotics |