Cargando…

Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle

Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (S...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bing, Zhenshan, Baumann, Ivan, Jiang, Zhuangyi, Huang, Kai, Cai, Caixia, Knoll, Alois
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2019
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6509616/ https://www.ncbi.nlm.nih.gov/pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018

_version_	1783417279472467968
author	Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois
author_facet	Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois
author_sort	Bing, Zhenshan
collection	PubMed
description	Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (STDP) learning rule, existing solutions usually involve hard-coded network architectures solving specific tasks rather than solving different kinds of tasks generally. This results in neglecting one of the biggest advantages of ANNs, i.e., being general-purpose and easy-to-use due to their simple network architecture, which usually consists of an input layer, one or multiple hidden layers and an output layer. This paper addresses the problem by introducing an end-to-end learning approach of spiking neural networks constructed with one hidden layer and reward-modulated Spike-Timing-Dependent Plasticity (R-STDP) synapses in an all-to-all fashion. We use the supervised reward-modulated Spike-Timing-Dependent-Plasticity learning rule to train two different SNN-based sub-controllers to replicate a desired obstacle avoiding and goal approaching behavior, provided by pre-generated datasets. Together they make up a target-reaching controller, which is used to control a simulated mobile robot to reach a target area while avoiding obstacles in its path. We demonstrate the performance and effectiveness of our trained SNNs to achieve target reaching tasks in different unknown scenarios.
format	Online Article Text
id	pubmed-6509616
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-65096162019-05-24 Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois Front Neurorobot Neuroscience Spiking neural networks (SNNs) offer many advantages over traditional artificial neural networks (ANNs) such as biological plausibility, fast information processing, and energy efficiency. Although SNNs have been used to solve a variety of control tasks using the Spike-Timing-Dependent Plasticity (STDP) learning rule, existing solutions usually involve hard-coded network architectures solving specific tasks rather than solving different kinds of tasks generally. This results in neglecting one of the biggest advantages of ANNs, i.e., being general-purpose and easy-to-use due to their simple network architecture, which usually consists of an input layer, one or multiple hidden layers and an output layer. This paper addresses the problem by introducing an end-to-end learning approach of spiking neural networks constructed with one hidden layer and reward-modulated Spike-Timing-Dependent Plasticity (R-STDP) synapses in an all-to-all fashion. We use the supervised reward-modulated Spike-Timing-Dependent-Plasticity learning rule to train two different SNN-based sub-controllers to replicate a desired obstacle avoiding and goal approaching behavior, provided by pre-generated datasets. Together they make up a target-reaching controller, which is used to control a simulated mobile robot to reach a target area while avoiding obstacles in its path. We demonstrate the performance and effectiveness of our trained SNNs to achieve target reaching tasks in different unknown scenarios. Frontiers Media S.A. 2019-05-03 /pmc/articles/PMC6509616/ /pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018 Text en Copyright © 2019 Bing, Baumann, Jiang, Huang, Cai and Knoll. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Bing, Zhenshan Baumann, Ivan Jiang, Zhuangyi Huang, Kai Cai, Caixia Knoll, Alois Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title	Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_full	Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_fullStr	Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_full_unstemmed	Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_short	Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle
title_sort	supervised learning in snn via reward-modulated spike-timing-dependent plasticity for a target reaching vehicle
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6509616/ https://www.ncbi.nlm.nih.gov/pubmed/31130854 http://dx.doi.org/10.3389/fnbot.2019.00018
work_keys_str_mv	AT bingzhenshan supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT baumannivan supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT jiangzhuangyi supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT huangkai supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT caicaixia supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle AT knollalois supervisedlearninginsnnviarewardmodulatedspiketimingdependentplasticityforatargetreachingvehicle

Supervised Learning in SNN via Reward-Modulated Spike-Timing-Dependent Plasticity for a Target Reaching Vehicle

Ejemplares similares