Cargando…

Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate

In this study, we propose and analyze in simulations a new, highly flexible method of implementing synaptic plasticity in a wafer-scale, accelerated neuromorphic hardware system. The study focuses on globally modulated STDP, as a special use-case of this method. Flexibility is achieved by embedding...

Descripción completa

Detalles Bibliográficos
Autores principales:	Friedmann, Simon, Frémaux, Nicolas, Schemmel, Johannes, Gerstner, Wulfram, Meier, Karlheinz
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2013
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3778319/ https://www.ncbi.nlm.nih.gov/pubmed/24065877 http://dx.doi.org/10.3389/fnins.2013.00160

_version_	1782285096651325440
author	Friedmann, Simon Frémaux, Nicolas Schemmel, Johannes Gerstner, Wulfram Meier, Karlheinz
author_facet	Friedmann, Simon Frémaux, Nicolas Schemmel, Johannes Gerstner, Wulfram Meier, Karlheinz
author_sort	Friedmann, Simon
collection	PubMed
description	In this study, we propose and analyze in simulations a new, highly flexible method of implementing synaptic plasticity in a wafer-scale, accelerated neuromorphic hardware system. The study focuses on globally modulated STDP, as a special use-case of this method. Flexibility is achieved by embedding a general-purpose processor dedicated to plasticity into the wafer. To evaluate the suitability of the proposed system, we use a reward modulated STDP rule in a spike train learning task. A single layer of neurons is trained to fire at specific points in time with only the reward as feedback. This model is simulated to measure its performance, i.e., the increase in received reward after learning. Using this performance as baseline, we then simulate the model with various constraints imposed by the proposed implementation and compare the performance. The simulated constraints include discretized synaptic weights, a restricted interface between analog synapses and embedded processor, and mismatch of analog circuits. We find that probabilistic updates can increase the performance of low-resolution weights, a simple interface between analog synapses and processor is sufficient for learning, and performance is insensitive to mismatch. Further, we consider communication latency between wafer and the conventional control computer system that is simulating the environment. This latency increases the delay, with which the reward is sent to the embedded processor. Because of the time continuous operation of the analog synapses, delay can cause a deviation of the updates as compared to the not delayed situation. We find that for highly accelerated systems latency has to be kept to a minimum. This study demonstrates the suitability of the proposed implementation to emulate the selected reward modulated STDP learning rule. It is therefore an ideal candidate for implementation in an upgraded version of the wafer-scale system developed within the BrainScaleS project.
format	Online Article Text
id	pubmed-3778319
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-37783192013-09-24 Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate Friedmann, Simon Frémaux, Nicolas Schemmel, Johannes Gerstner, Wulfram Meier, Karlheinz Front Neurosci Neuroscience In this study, we propose and analyze in simulations a new, highly flexible method of implementing synaptic plasticity in a wafer-scale, accelerated neuromorphic hardware system. The study focuses on globally modulated STDP, as a special use-case of this method. Flexibility is achieved by embedding a general-purpose processor dedicated to plasticity into the wafer. To evaluate the suitability of the proposed system, we use a reward modulated STDP rule in a spike train learning task. A single layer of neurons is trained to fire at specific points in time with only the reward as feedback. This model is simulated to measure its performance, i.e., the increase in received reward after learning. Using this performance as baseline, we then simulate the model with various constraints imposed by the proposed implementation and compare the performance. The simulated constraints include discretized synaptic weights, a restricted interface between analog synapses and embedded processor, and mismatch of analog circuits. We find that probabilistic updates can increase the performance of low-resolution weights, a simple interface between analog synapses and processor is sufficient for learning, and performance is insensitive to mismatch. Further, we consider communication latency between wafer and the conventional control computer system that is simulating the environment. This latency increases the delay, with which the reward is sent to the embedded processor. Because of the time continuous operation of the analog synapses, delay can cause a deviation of the updates as compared to the not delayed situation. We find that for highly accelerated systems latency has to be kept to a minimum. This study demonstrates the suitability of the proposed implementation to emulate the selected reward modulated STDP learning rule. It is therefore an ideal candidate for implementation in an upgraded version of the wafer-scale system developed within the BrainScaleS project. Frontiers Media S.A. 2013-09-20 /pmc/articles/PMC3778319/ /pubmed/24065877 http://dx.doi.org/10.3389/fnins.2013.00160 Text en Copyright © 2013 Friedmann, Frémaux, Schemmel, Gerstner and Meier. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Friedmann, Simon Frémaux, Nicolas Schemmel, Johannes Gerstner, Wulfram Meier, Karlheinz Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate
title	Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate
title_full	Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate
title_fullStr	Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate
title_full_unstemmed	Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate
title_short	Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate
title_sort	reward-based learning under hardware constraints—using a risc processor embedded in a neuromorphic substrate
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3778319/ https://www.ncbi.nlm.nih.gov/pubmed/24065877 http://dx.doi.org/10.3389/fnins.2013.00160
work_keys_str_mv	AT friedmannsimon rewardbasedlearningunderhardwareconstraintsusingariscprocessorembeddedinaneuromorphicsubstrate AT fremauxnicolas rewardbasedlearningunderhardwareconstraintsusingariscprocessorembeddedinaneuromorphicsubstrate AT schemmeljohannes rewardbasedlearningunderhardwareconstraintsusingariscprocessorembeddedinaneuromorphicsubstrate AT gerstnerwulfram rewardbasedlearningunderhardwareconstraintsusingariscprocessorembeddedinaneuromorphicsubstrate AT meierkarlheinz rewardbasedlearningunderhardwareconstraintsusingariscprocessorembeddedinaneuromorphicsubstrate

Reward-based learning under hardware constraints—using a RISC processor embedded in a neuromorphic substrate

Ejemplares similares