Cargando…

Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants

Current cochlear implant (CI) strategies carry speech information via the waveform envelope in frequency subbands. CIs require efficient speech processing to maximize information transfer to the brain, especially in background noise, where the speech envelope is not robust to noise interference. In...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Hongmei, Lutman, Mark E., Ewert, Stephan D., Li, Guoping, Bleeck, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4771045/
https://www.ncbi.nlm.nih.gov/pubmed/26721919
http://dx.doi.org/10.1177/2331216515616941
_version_ 1782418358506881024
author Hu, Hongmei
Lutman, Mark E.
Ewert, Stephan D.
Li, Guoping
Bleeck, Stefan
author_facet Hu, Hongmei
Lutman, Mark E.
Ewert, Stephan D.
Li, Guoping
Bleeck, Stefan
author_sort Hu, Hongmei
collection PubMed
description Current cochlear implant (CI) strategies carry speech information via the waveform envelope in frequency subbands. CIs require efficient speech processing to maximize information transfer to the brain, especially in background noise, where the speech envelope is not robust to noise interference. In such conditions, the envelope, after decomposition into frequency bands, may be enhanced by sparse transformations, such as nonnegative matrix factorization (NMF). Here, a novel CI processing algorithm is described, which works by applying NMF to the envelope matrix (envelopogram) of 22 frequency channels in order to improve performance in noisy environments. It is evaluated for speech in eight-talker babble noise. The critical sparsity constraint parameter was first tuned using objective measures and then evaluated with subjective speech perception experiments for both normal hearing and CI subjects. Results from vocoder simulations with 10 normal hearing subjects showed that the algorithm significantly enhances speech intelligibility with the selected sparsity constraints. Results from eight CI subjects showed no significant overall improvement compared with the standard advanced combination encoder algorithm, but a trend toward improvement of word identification of about 10 percentage points at +15 dB signal-to-noise ratio (SNR) was observed in the eight CI subjects. Additionally, a considerable reduction of the spread of speech perception performance from 40% to 93% for advanced combination encoder to 80% to 100% for the suggested NMF coding strategy was observed.
format Online
Article
Text
id pubmed-4771045
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-47710452016-05-26 Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants Hu, Hongmei Lutman, Mark E. Ewert, Stephan D. Li, Guoping Bleeck, Stefan Trends Hear Special Issue Current cochlear implant (CI) strategies carry speech information via the waveform envelope in frequency subbands. CIs require efficient speech processing to maximize information transfer to the brain, especially in background noise, where the speech envelope is not robust to noise interference. In such conditions, the envelope, after decomposition into frequency bands, may be enhanced by sparse transformations, such as nonnegative matrix factorization (NMF). Here, a novel CI processing algorithm is described, which works by applying NMF to the envelope matrix (envelopogram) of 22 frequency channels in order to improve performance in noisy environments. It is evaluated for speech in eight-talker babble noise. The critical sparsity constraint parameter was first tuned using objective measures and then evaluated with subjective speech perception experiments for both normal hearing and CI subjects. Results from vocoder simulations with 10 normal hearing subjects showed that the algorithm significantly enhances speech intelligibility with the selected sparsity constraints. Results from eight CI subjects showed no significant overall improvement compared with the standard advanced combination encoder algorithm, but a trend toward improvement of word identification of about 10 percentage points at +15 dB signal-to-noise ratio (SNR) was observed in the eight CI subjects. Additionally, a considerable reduction of the spread of speech perception performance from 40% to 93% for advanced combination encoder to 80% to 100% for the suggested NMF coding strategy was observed. SAGE Publications 2015-12-16 /pmc/articles/PMC4771045/ /pubmed/26721919 http://dx.doi.org/10.1177/2331216515616941 Text en © The Author(s) 2015 http://creativecommons.org/licenses/by-nc/3.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 3.0 License (http://www.creativecommons.org/licenses/by-nc/3.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page(https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Special Issue
Hu, Hongmei
Lutman, Mark E.
Ewert, Stephan D.
Li, Guoping
Bleeck, Stefan
Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants
title Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants
title_full Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants
title_fullStr Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants
title_full_unstemmed Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants
title_short Sparse Nonnegative Matrix Factorization Strategy for Cochlear Implants
title_sort sparse nonnegative matrix factorization strategy for cochlear implants
topic Special Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4771045/
https://www.ncbi.nlm.nih.gov/pubmed/26721919
http://dx.doi.org/10.1177/2331216515616941
work_keys_str_mv AT huhongmei sparsenonnegativematrixfactorizationstrategyforcochlearimplants
AT lutmanmarke sparsenonnegativematrixfactorizationstrategyforcochlearimplants
AT ewertstephand sparsenonnegativematrixfactorizationstrategyforcochlearimplants
AT liguoping sparsenonnegativematrixfactorizationstrategyforcochlearimplants
AT bleeckstefan sparsenonnegativematrixfactorizationstrategyforcochlearimplants