Cargando…

A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition

This paper presents a real-time, low-complexity neuromorphic speech recognition system using a spiking silicon cochlea, a feature extraction module and a population encoding method based Neural Engineering Framework (NEF)/Extreme Learning Machine (ELM) classifier IC. Several feature extraction metho...

Descripción completa

Detalles Bibliográficos
Autores principales: Acharya, Jyotibdha, Patil, Aakash, Li, Xiaoya, Chen, Yi, Liu, Shih-Chii, Basu, Arindam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882819/
https://www.ncbi.nlm.nih.gov/pubmed/29643760
http://dx.doi.org/10.3389/fnins.2018.00160
_version_ 1783311529142124544
author Acharya, Jyotibdha
Patil, Aakash
Li, Xiaoya
Chen, Yi
Liu, Shih-Chii
Basu, Arindam
author_facet Acharya, Jyotibdha
Patil, Aakash
Li, Xiaoya
Chen, Yi
Liu, Shih-Chii
Basu, Arindam
author_sort Acharya, Jyotibdha
collection PubMed
description This paper presents a real-time, low-complexity neuromorphic speech recognition system using a spiking silicon cochlea, a feature extraction module and a population encoding method based Neural Engineering Framework (NEF)/Extreme Learning Machine (ELM) classifier IC. Several feature extraction methods with varying memory and computational complexity are presented along with their corresponding classification accuracies. On the N-TIDIGITS18 dataset, we show that a fixed bin size based feature extraction method that votes across both time and spike count features can achieve an accuracy of 95% in software similar to previously report methods that use fixed number of bins per sample while using ~3× less energy and ~25× less memory for feature extraction (~1.5× less overall). Hardware measurements for the same topology show a slightly reduced accuracy of 94% that can be attributed to the extra correlations in hardware random weights. The hardware accuracy can be increased by further increasing the number of hidden nodes in ELM at the cost of memory and energy.
format Online
Article
Text
id pubmed-5882819
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-58828192018-04-11 A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition Acharya, Jyotibdha Patil, Aakash Li, Xiaoya Chen, Yi Liu, Shih-Chii Basu, Arindam Front Neurosci Neuroscience This paper presents a real-time, low-complexity neuromorphic speech recognition system using a spiking silicon cochlea, a feature extraction module and a population encoding method based Neural Engineering Framework (NEF)/Extreme Learning Machine (ELM) classifier IC. Several feature extraction methods with varying memory and computational complexity are presented along with their corresponding classification accuracies. On the N-TIDIGITS18 dataset, we show that a fixed bin size based feature extraction method that votes across both time and spike count features can achieve an accuracy of 95% in software similar to previously report methods that use fixed number of bins per sample while using ~3× less energy and ~25× less memory for feature extraction (~1.5× less overall). Hardware measurements for the same topology show a slightly reduced accuracy of 94% that can be attributed to the extra correlations in hardware random weights. The hardware accuracy can be increased by further increasing the number of hidden nodes in ELM at the cost of memory and energy. Frontiers Media S.A. 2018-03-28 /pmc/articles/PMC5882819/ /pubmed/29643760 http://dx.doi.org/10.3389/fnins.2018.00160 Text en Copyright © 2018 Acharya, Patil, Li, Chen, Liu and Basu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Acharya, Jyotibdha
Patil, Aakash
Li, Xiaoya
Chen, Yi
Liu, Shih-Chii
Basu, Arindam
A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition
title A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition
title_full A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition
title_fullStr A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition
title_full_unstemmed A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition
title_short A Comparison of Low-Complexity Real-Time Feature Extraction for Neuromorphic Speech Recognition
title_sort comparison of low-complexity real-time feature extraction for neuromorphic speech recognition
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5882819/
https://www.ncbi.nlm.nih.gov/pubmed/29643760
http://dx.doi.org/10.3389/fnins.2018.00160
work_keys_str_mv AT acharyajyotibdha acomparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT patilaakash acomparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT lixiaoya acomparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT chenyi acomparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT liushihchii acomparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT basuarindam acomparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT acharyajyotibdha comparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT patilaakash comparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT lixiaoya comparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT chenyi comparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT liushihchii comparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition
AT basuarindam comparisonoflowcomplexityrealtimefeatureextractionforneuromorphicspeechrecognition