Cargando…

Embodied Synaptic Plasticity With Online Reinforcement Learning

The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaiser, Jacques, Hoff, Michael, Konle, Andreas, Vasquez Tieck, J. Camilo, Kappel, David, Reichard, Daniel, Subramoney, Anand, Legenstein, Robert, Roennau, Arne, Maass, Wolfgang, Dillmann, Rüdiger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786305/
https://www.ncbi.nlm.nih.gov/pubmed/31632262
http://dx.doi.org/10.3389/fnbot.2019.00081
_version_ 1783458049779826688
author Kaiser, Jacques
Hoff, Michael
Konle, Andreas
Vasquez Tieck, J. Camilo
Kappel, David
Reichard, Daniel
Subramoney, Anand
Legenstein, Robert
Roennau, Arne
Maass, Wolfgang
Dillmann, Rüdiger
author_facet Kaiser, Jacques
Hoff, Michael
Konle, Andreas
Vasquez Tieck, J. Camilo
Kappel, David
Reichard, Daniel
Subramoney, Anand
Legenstein, Robert
Roennau, Arne
Maass, Wolfgang
Dillmann, Rüdiger
author_sort Kaiser, Jacques
collection PubMed
description The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks.
format Online
Article
Text
id pubmed-6786305
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67863052019-10-18 Embodied Synaptic Plasticity With Online Reinforcement Learning Kaiser, Jacques Hoff, Michael Konle, Andreas Vasquez Tieck, J. Camilo Kappel, David Reichard, Daniel Subramoney, Anand Legenstein, Robert Roennau, Arne Maass, Wolfgang Dillmann, Rüdiger Front Neurorobot Neuroscience The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks. Frontiers Media S.A. 2019-10-03 /pmc/articles/PMC6786305/ /pubmed/31632262 http://dx.doi.org/10.3389/fnbot.2019.00081 Text en Copyright © 2019 Kaiser, Hoff, Konle, Vasquez Tieck, Kappel, Reichard, Subramoney, Legenstein, Roennau, Maass and Dillmann. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Kaiser, Jacques
Hoff, Michael
Konle, Andreas
Vasquez Tieck, J. Camilo
Kappel, David
Reichard, Daniel
Subramoney, Anand
Legenstein, Robert
Roennau, Arne
Maass, Wolfgang
Dillmann, Rüdiger
Embodied Synaptic Plasticity With Online Reinforcement Learning
title Embodied Synaptic Plasticity With Online Reinforcement Learning
title_full Embodied Synaptic Plasticity With Online Reinforcement Learning
title_fullStr Embodied Synaptic Plasticity With Online Reinforcement Learning
title_full_unstemmed Embodied Synaptic Plasticity With Online Reinforcement Learning
title_short Embodied Synaptic Plasticity With Online Reinforcement Learning
title_sort embodied synaptic plasticity with online reinforcement learning
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786305/
https://www.ncbi.nlm.nih.gov/pubmed/31632262
http://dx.doi.org/10.3389/fnbot.2019.00081
work_keys_str_mv AT kaiserjacques embodiedsynapticplasticitywithonlinereinforcementlearning
AT hoffmichael embodiedsynapticplasticitywithonlinereinforcementlearning
AT konleandreas embodiedsynapticplasticitywithonlinereinforcementlearning
AT vasqueztieckjcamilo embodiedsynapticplasticitywithonlinereinforcementlearning
AT kappeldavid embodiedsynapticplasticitywithonlinereinforcementlearning
AT reicharddaniel embodiedsynapticplasticitywithonlinereinforcementlearning
AT subramoneyanand embodiedsynapticplasticitywithonlinereinforcementlearning
AT legensteinrobert embodiedsynapticplasticitywithonlinereinforcementlearning
AT roennauarne embodiedsynapticplasticitywithonlinereinforcementlearning
AT maasswolfgang embodiedsynapticplasticitywithonlinereinforcementlearning
AT dillmannrudiger embodiedsynapticplasticitywithonlinereinforcementlearning