Cargando…
Embodied Synaptic Plasticity With Online Reinforcement Learning
The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786305/ https://www.ncbi.nlm.nih.gov/pubmed/31632262 http://dx.doi.org/10.3389/fnbot.2019.00081 |
_version_ | 1783458049779826688 |
---|---|
author | Kaiser, Jacques Hoff, Michael Konle, Andreas Vasquez Tieck, J. Camilo Kappel, David Reichard, Daniel Subramoney, Anand Legenstein, Robert Roennau, Arne Maass, Wolfgang Dillmann, Rüdiger |
author_facet | Kaiser, Jacques Hoff, Michael Konle, Andreas Vasquez Tieck, J. Camilo Kappel, David Reichard, Daniel Subramoney, Anand Legenstein, Robert Roennau, Arne Maass, Wolfgang Dillmann, Rüdiger |
author_sort | Kaiser, Jacques |
collection | PubMed |
description | The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks. |
format | Online Article Text |
id | pubmed-6786305 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-67863052019-10-18 Embodied Synaptic Plasticity With Online Reinforcement Learning Kaiser, Jacques Hoff, Michael Konle, Andreas Vasquez Tieck, J. Camilo Kappel, David Reichard, Daniel Subramoney, Anand Legenstein, Robert Roennau, Arne Maass, Wolfgang Dillmann, Rüdiger Front Neurorobot Neuroscience The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks. Frontiers Media S.A. 2019-10-03 /pmc/articles/PMC6786305/ /pubmed/31632262 http://dx.doi.org/10.3389/fnbot.2019.00081 Text en Copyright © 2019 Kaiser, Hoff, Konle, Vasquez Tieck, Kappel, Reichard, Subramoney, Legenstein, Roennau, Maass and Dillmann. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Kaiser, Jacques Hoff, Michael Konle, Andreas Vasquez Tieck, J. Camilo Kappel, David Reichard, Daniel Subramoney, Anand Legenstein, Robert Roennau, Arne Maass, Wolfgang Dillmann, Rüdiger Embodied Synaptic Plasticity With Online Reinforcement Learning |
title | Embodied Synaptic Plasticity With Online Reinforcement Learning |
title_full | Embodied Synaptic Plasticity With Online Reinforcement Learning |
title_fullStr | Embodied Synaptic Plasticity With Online Reinforcement Learning |
title_full_unstemmed | Embodied Synaptic Plasticity With Online Reinforcement Learning |
title_short | Embodied Synaptic Plasticity With Online Reinforcement Learning |
title_sort | embodied synaptic plasticity with online reinforcement learning |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786305/ https://www.ncbi.nlm.nih.gov/pubmed/31632262 http://dx.doi.org/10.3389/fnbot.2019.00081 |
work_keys_str_mv | AT kaiserjacques embodiedsynapticplasticitywithonlinereinforcementlearning AT hoffmichael embodiedsynapticplasticitywithonlinereinforcementlearning AT konleandreas embodiedsynapticplasticitywithonlinereinforcementlearning AT vasqueztieckjcamilo embodiedsynapticplasticitywithonlinereinforcementlearning AT kappeldavid embodiedsynapticplasticitywithonlinereinforcementlearning AT reicharddaniel embodiedsynapticplasticitywithonlinereinforcementlearning AT subramoneyanand embodiedsynapticplasticitywithonlinereinforcementlearning AT legensteinrobert embodiedsynapticplasticitywithonlinereinforcementlearning AT roennauarne embodiedsynapticplasticitywithonlinereinforcementlearning AT maasswolfgang embodiedsynapticplasticitywithonlinereinforcementlearning AT dillmannrudiger embodiedsynapticplasticitywithonlinereinforcementlearning |