Cargando…

Learnable axonal delay in spiking neural networks improves spoken word recognition

Spiking neural networks (SNNs), which are composed of biologically plausible spiking neurons, and combined with bio-physically realistic auditory periphery models, offer a means to explore and understand human auditory processing-especially in tasks where precise timing is essential. However, becaus...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Pengfei, Chua, Yansong, Devos, Paul, Botteldooren, Dick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665570/
https://www.ncbi.nlm.nih.gov/pubmed/38027508
http://dx.doi.org/10.3389/fnins.2023.1275944
_version_ 1785138855132790784
author Sun, Pengfei
Chua, Yansong
Devos, Paul
Botteldooren, Dick
author_facet Sun, Pengfei
Chua, Yansong
Devos, Paul
Botteldooren, Dick
author_sort Sun, Pengfei
collection PubMed
description Spiking neural networks (SNNs), which are composed of biologically plausible spiking neurons, and combined with bio-physically realistic auditory periphery models, offer a means to explore and understand human auditory processing-especially in tasks where precise timing is essential. However, because of the inherent temporal complexity in spike sequences, the performance of SNNs has remained less competitive compared to artificial neural networks (ANNs). To tackle this challenge, a fundamental research topic is the configuration of spike-timing and the exploration of more intricate architectures. In this work, we demonstrate a learnable axonal delay combined with local skip-connections yields state-of-the-art performance on challenging benchmarks for spoken word recognition. Additionally, we introduce an auxiliary loss term to further enhance accuracy and stability. Experiments on the neuromorphic speech benchmark datasets, NTIDIDIGITS and SHD, show improvements in performance when incorporating our delay module in comparison to vanilla feedforward SNNs. Specifically, with the integration of our delay module, the performance on NTIDIDIGITS and SHD improves by 14% and 18%, respectively. When paired with local skip-connections and the auxiliary loss, our approach surpasses both recurrent and convolutional neural networks, yet uses 10 × fewer parameters for NTIDIDIGITS and 7 × fewer for SHD.
format Online
Article
Text
id pubmed-10665570
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106655702023-01-01 Learnable axonal delay in spiking neural networks improves spoken word recognition Sun, Pengfei Chua, Yansong Devos, Paul Botteldooren, Dick Front Neurosci Neuroscience Spiking neural networks (SNNs), which are composed of biologically plausible spiking neurons, and combined with bio-physically realistic auditory periphery models, offer a means to explore and understand human auditory processing-especially in tasks where precise timing is essential. However, because of the inherent temporal complexity in spike sequences, the performance of SNNs has remained less competitive compared to artificial neural networks (ANNs). To tackle this challenge, a fundamental research topic is the configuration of spike-timing and the exploration of more intricate architectures. In this work, we demonstrate a learnable axonal delay combined with local skip-connections yields state-of-the-art performance on challenging benchmarks for spoken word recognition. Additionally, we introduce an auxiliary loss term to further enhance accuracy and stability. Experiments on the neuromorphic speech benchmark datasets, NTIDIDIGITS and SHD, show improvements in performance when incorporating our delay module in comparison to vanilla feedforward SNNs. Specifically, with the integration of our delay module, the performance on NTIDIDIGITS and SHD improves by 14% and 18%, respectively. When paired with local skip-connections and the auxiliary loss, our approach surpasses both recurrent and convolutional neural networks, yet uses 10 × fewer parameters for NTIDIDIGITS and 7 × fewer for SHD. Frontiers Media S.A. 2023-11-09 /pmc/articles/PMC10665570/ /pubmed/38027508 http://dx.doi.org/10.3389/fnins.2023.1275944 Text en Copyright © 2023 Sun, Chua, Devos and Botteldooren. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Sun, Pengfei
Chua, Yansong
Devos, Paul
Botteldooren, Dick
Learnable axonal delay in spiking neural networks improves spoken word recognition
title Learnable axonal delay in spiking neural networks improves spoken word recognition
title_full Learnable axonal delay in spiking neural networks improves spoken word recognition
title_fullStr Learnable axonal delay in spiking neural networks improves spoken word recognition
title_full_unstemmed Learnable axonal delay in spiking neural networks improves spoken word recognition
title_short Learnable axonal delay in spiking neural networks improves spoken word recognition
title_sort learnable axonal delay in spiking neural networks improves spoken word recognition
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665570/
https://www.ncbi.nlm.nih.gov/pubmed/38027508
http://dx.doi.org/10.3389/fnins.2023.1275944
work_keys_str_mv AT sunpengfei learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition
AT chuayansong learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition
AT devospaul learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition
AT botteldoorendick learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition