Cargando…

Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle

Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (S...

Descripción completa

Detalles Bibliográficos
Autores principales: Bing, Zhenshan, Baumann, Ivan, Jiang, Zhuangyi, Huang, Kai, Cai, Caixia, Knoll, Alois
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6509616/
https://www.ncbi.nlm.nih.gov/pubmed/31130854
http://dx.doi.org/10.3389/fnbot.2019.00018
_version_ 1783417279472467968
author Bing, Zhenshan
Baumann, Ivan
Jiang, Zhuangyi
Huang, Kai
Cai, Caixia
Knoll, Alois
author_facet Bing, Zhenshan
Baumann, Ivan
Jiang, Zhuangyi
Huang, Kai
Cai, Caixia
Knoll, Alois
author_sort Bing, Zhenshan
collection PubMed
description Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (STDP) learning rule, existing solutions usually involve hard-coded network architectures solving specific tasks rather than solving different kinds of tasks generally. This results in neglecting one of the biggest advantages of ANNs, i.e., being general-purpose and easy-to-use due to their simple network architecture, which usually consists of an input layer, one or multiple hidden layers and an output layer. This paper addresses the problem by introducing an end-to-end learning approach of spiking neural networks constructed with one hidden layer and reward-modulated Spike-Timing-Dependent Plasticity (R-STDP) synapses in an all-to-all fashion. We use the supervised reward-modulated Spike-Timing-Dependent-Plasticity learning rule to train two different SNN-based sub-controllers to replicate a desired obstacle avoiding and goal approaching behavior, provided by pre-generated datasets. Together they make up a target-reaching controller, which is used to control a simulated mobile robot to reach a target area while avoiding obstacles in its path. We demonstrate the performance and effectiveness of our trained SNNs to achieve target reaching tasks in different unknown scenarios.
format Online
Article
Text
id pubmed-6509616
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65096162019-05-24 Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois Front Neurorobot Neuroscience Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (STDP) learning rule, existing solutions usually involve hard-coded network architectures solving specific tasks rather than solving different kinds of tasks generally. This results in neglecting one of the biggest advantages of ANNs, i.e., being general-purpose and easy-to-use due to their simple network architecture, which usually consists of an input layer, one or multiple hidden layers and an output layer. This paper addresses the problem by introducing an end-to-end learning approach of spiking neural networks constructed with one hidden layer and reward-modulated Spike-Timing-Dependent Plasticity (R-STDP) synapses in an all-to-all fashion. We use the supervised reward-modulated Spike-Timing-Dependent-Plasticity learning rule to train two different SNN-based sub-controllers to replicate a desired obstacle avoiding and goal approaching behavior, provided by pre-generated datasets. Together they make up a target-reaching controller, which is used to control a simulated mobile robot to reach a target area while avoiding obstacles in its path. We demonstrate the performance and effectiveness of our trained SNNs to achieve target reaching tasks in different unknown scenarios. Frontiers Media S.A. 2019-05-03 /pmc/articles/PMC6509616/ /pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018 Text en Copyright © 2019 Bing, Baumann, Jiang, Huang, Cai and Knoll. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Bing, Zhenshan
Baumann, Ivan
Jiang, Zhuangyi
Huang, Kai
Cai, Caixia
Knoll, Alois
Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_full Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_fullStr Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_full_unstemmed Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_short Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_sort supervised learning in snn via reward-modulated spike-timing-dependent plasticity for a target reaching vehicle
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6509616/
https://www.ncbi.nlm.nih.gov/pubmed/31130854
http://dx.doi.org/10.3389/fnbot.2019.00018
work_keys_str_mv AT bingzhenshan supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle
AT baumannivan supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle
AT jiangzhuangyi supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle
AT huangkai supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle
AT caicaixia supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle
AT knollalois supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle