Cargando…
Learnable axonal delay in spiking neural networks improves spoken word recognition
Spiking neural networks (SNNs), which are composed of biologically plausible spiking neurons, and combined with bio-physically realistic auditory periphery models, offer a means to explore and understand human auditory processing-especially in tasks where precise timing is essential. However, becaus...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665570/ https://www.ncbi.nlm.nih.gov/pubmed/38027508 http://dx.doi.org/10.3389/fnins.2023.1275944 |
_version_ | 1785138855132790784 |
---|---|
author | Sun, Pengfei Chua, Yansong Devos, Paul Botteldooren, Dick |
author_facet | Sun, Pengfei Chua, Yansong Devos, Paul Botteldooren, Dick |
author_sort | Sun, Pengfei |
collection | PubMed |
description | Spiking neural networks (SNNs), which are composed of biologically plausible spiking neurons, and combined with bio-physically realistic auditory periphery models, offer a means to explore and understand human auditory processing-especially in tasks where precise timing is essential. However, because of the inherent temporal complexity in spike sequences, the performance of SNNs has remained less competitive compared to artificial neural networks (ANNs). To tackle this challenge, a fundamental research topic is the configuration of spike-timing and the exploration of more intricate architectures. In this work, we demonstrate a learnable axonal delay combined with local skip-connections yields state-of-the-art performance on challenging benchmarks for spoken word recognition. Additionally, we introduce an auxiliary loss term to further enhance accuracy and stability. Experiments on the neuromorphic speech benchmark datasets, NTIDIDIGITS and SHD, show improvements in performance when incorporating our delay module in comparison to vanilla feedforward SNNs. Specifically, with the integration of our delay module, the performance on NTIDIDIGITS and SHD improves by 14% and 18%, respectively. When paired with local skip-connections and the auxiliary loss, our approach surpasses both recurrent and convolutional neural networks, yet uses 10 × fewer parameters for NTIDIDIGITS and 7 × fewer for SHD. |
format | Online Article Text |
id | pubmed-10665570 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106655702023-01-01 Learnable axonal delay in spiking neural networks improves spoken word recognition Sun, Pengfei Chua, Yansong Devos, Paul Botteldooren, Dick Front Neurosci Neuroscience Spiking neural networks (SNNs), which are composed of biologically plausible spiking neurons, and combined with bio-physically realistic auditory periphery models, offer a means to explore and understand human auditory processing-especially in tasks where precise timing is essential. However, because of the inherent temporal complexity in spike sequences, the performance of SNNs has remained less competitive compared to artificial neural networks (ANNs). To tackle this challenge, a fundamental research topic is the configuration of spike-timing and the exploration of more intricate architectures. In this work, we demonstrate a learnable axonal delay combined with local skip-connections yields state-of-the-art performance on challenging benchmarks for spoken word recognition. Additionally, we introduce an auxiliary loss term to further enhance accuracy and stability. Experiments on the neuromorphic speech benchmark datasets, NTIDIDIGITS and SHD, show improvements in performance when incorporating our delay module in comparison to vanilla feedforward SNNs. Specifically, with the integration of our delay module, the performance on NTIDIDIGITS and SHD improves by 14% and 18%, respectively. When paired with local skip-connections and the auxiliary loss, our approach surpasses both recurrent and convolutional neural networks, yet uses 10 × fewer parameters for NTIDIDIGITS and 7 × fewer for SHD. Frontiers Media S.A. 2023-11-09 /pmc/articles/PMC10665570/ /pubmed/38027508 http://dx.doi.org/10.3389/fnins.2023.1275944 Text en Copyright © 2023 Sun, Chua, Devos and Botteldooren. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Sun, Pengfei Chua, Yansong Devos, Paul Botteldooren, Dick Learnable axonal delay in spiking neural networks improves spoken word recognition |
title | Learnable axonal delay in spiking neural networks improves spoken word recognition |
title_full | Learnable axonal delay in spiking neural networks improves spoken word recognition |
title_fullStr | Learnable axonal delay in spiking neural networks improves spoken word recognition |
title_full_unstemmed | Learnable axonal delay in spiking neural networks improves spoken word recognition |
title_short | Learnable axonal delay in spiking neural networks improves spoken word recognition |
title_sort | learnable axonal delay in spiking neural networks improves spoken word recognition |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665570/ https://www.ncbi.nlm.nih.gov/pubmed/38027508 http://dx.doi.org/10.3389/fnins.2023.1275944 |
work_keys_str_mv | AT sunpengfei learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition AT chuayansong learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition AT devospaul learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition AT botteldoorendick learnableaxonaldelayinspikingneuralnetworksimprovesspokenwordrecognition |