Cargando…
Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (S...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6509616/ https://www.ncbi.nlm.nih.gov/pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018 |
_version_ | 1783417279472467968 |
---|---|
author | Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois |
author_facet | Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois |
author_sort | Bing, Zhenshan |
collection | PubMed |
description | Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (STDP) learning rule, existing solutions usually involve hard-coded network architectures solving specific tasks rather than solving different kinds of tasks generally. This results in neglecting one of the biggest advantages of ANNs, i.e., being general-purpose and easy-to-use due to their simple network architecture, which usually consists of an input layer, one or multiple hidden layers and an output layer. This paper addresses the problem by introducing an end-to-end learning approach of spiking neural networks constructed with one hidden layer and reward-modulated Spike-Timing-Dependent Plasticity (R-STDP) synapses in an all-to-all fashion. We use the supervised reward-modulated Spike-Timing-Dependent-Plasticity learning rule to train two different SNN-based sub-controllers to replicate a desired obstacle avoiding and goal approaching behavior, provided by pre-generated datasets. Together they make up a target-reaching controller, which is used to control a simulated mobile robot to reach a target area while avoiding obstacles in its path. We demonstrate the performance and effectiveness of our trained SNNs to achieve target reaching tasks in different unknown scenarios. |
format | Online Article Text |
id | pubmed-6509616 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-65096162019-05-24 Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois Front Neurorobot Neuroscience Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (STDP) learning rule, existing solutions usually involve hard-coded network architectures solving specific tasks rather than solving different kinds of tasks generally. This results in neglecting one of the biggest advantages of ANNs, i.e., being general-purpose and easy-to-use due to their simple network architecture, which usually consists of an input layer, one or multiple hidden layers and an output layer. This paper addresses the problem by introducing an end-to-end learning approach of spiking neural networks constructed with one hidden layer and reward-modulated Spike-Timing-Dependent Plasticity (R-STDP) synapses in an all-to-all fashion. We use the supervised reward-modulated Spike-Timing-Dependent-Plasticity learning rule to train two different SNN-based sub-controllers to replicate a desired obstacle avoiding and goal approaching behavior, provided by pre-generated datasets. Together they make up a target-reaching controller, which is used to control a simulated mobile robot to reach a target area while avoiding obstacles in its path. We demonstrate the performance and effectiveness of our trained SNNs to achieve target reaching tasks in different unknown scenarios. Frontiers Media S.A. 2019-05-03 /pmc/articles/PMC6509616/ /pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018 Text en Copyright © 2019 Bing, Baumann, Jiang, Huang, Cai and Knoll. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle |
title | Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle |
title_full | Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle |
title_fullStr | Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle |
title_full_unstemmed | Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle |
title_short | Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle |
title_sort | supervised learning in snn via reward-modulated spike-timing-dependent plasticity for a target reaching vehicle |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6509616/ https://www.ncbi.nlm.nih.gov/pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018 |
work_keys_str_mv | AT bingzhenshan supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT baumannivan supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT jiangzhuangyi supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT huangkai supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT caicaixia supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT knollalois supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle |