Cargando…
Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants
Although cochlear implants work well for people with hearing impairment in quiet conditions, it is well-known that they are not as effective in noisy environments. Noise reduction algorithms based on machine learning allied with appropriate speech features can be used to address this problem. The pu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10490615/ https://www.ncbi.nlm.nih.gov/pubmed/37688009 http://dx.doi.org/10.3390/s23177553 |
_version_ | 1785103880876457984 |
---|---|
author | Henry, Fergal Parsi, Ashkan Glavin, Martin Jones, Edward |
author_facet | Henry, Fergal Parsi, Ashkan Glavin, Martin Jones, Edward |
author_sort | Henry, Fergal |
collection | PubMed |
description | Although cochlear implants work well for people with hearing impairment in quiet conditions, it is well-known that they are not as effective in noisy environments. Noise reduction algorithms based on machine learning allied with appropriate speech features can be used to address this problem. The purpose of this study is to investigate the importance of acoustic features in such algorithms. Acoustic features are extracted from speech and noise mixtures and used in conjunction with the ideal binary mask to train a deep neural network to estimate masks for speech synthesis to produce enhanced speech. The intelligibility of this speech is objectively measured using metrics such as Short-time Objective Intelligibility (STOI), Hit Rate minus False Alarm Rate (HIT-FA) and Normalized Covariance Measure (NCM) for both simulated normal-hearing and hearing-impaired scenarios. A wide range of existing features is experimentally evaluated, including features that have not been traditionally applied in this application. The results demonstrate that frequency domain features perform best. In particular, Gammatone features performed best for normal hearing over a range of signal-to-noise ratios and noise types (STOI = 0.7826). Mel spectrogram features exhibited the best overall performance for hearing impairment (NCM = 0.7314). There is a stronger correlation between STOI and NCM than HIT-FA and NCM, suggesting that the former is a better predictor of intelligibility for hearing-impaired listeners. The results of this study may be useful in the design of adaptive intelligibility enhancement systems for cochlear implants based on both the noise level and the nature of the noise (stationary or non-stationary). |
format | Online Article Text |
id | pubmed-10490615 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-104906152023-09-09 Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants Henry, Fergal Parsi, Ashkan Glavin, Martin Jones, Edward Sensors (Basel) Article Although cochlear implants work well for people with hearing impairment in quiet conditions, it is well-known that they are not as effective in noisy environments. Noise reduction algorithms based on machine learning allied with appropriate speech features can be used to address this problem. The purpose of this study is to investigate the importance of acoustic features in such algorithms. Acoustic features are extracted from speech and noise mixtures and used in conjunction with the ideal binary mask to train a deep neural network to estimate masks for speech synthesis to produce enhanced speech. The intelligibility of this speech is objectively measured using metrics such as Short-time Objective Intelligibility (STOI), Hit Rate minus False Alarm Rate (HIT-FA) and Normalized Covariance Measure (NCM) for both simulated normal-hearing and hearing-impaired scenarios. A wide range of existing features is experimentally evaluated, including features that have not been traditionally applied in this application. The results demonstrate that frequency domain features perform best. In particular, Gammatone features performed best for normal hearing over a range of signal-to-noise ratios and noise types (STOI = 0.7826). Mel spectrogram features exhibited the best overall performance for hearing impairment (NCM = 0.7314). There is a stronger correlation between STOI and NCM than HIT-FA and NCM, suggesting that the former is a better predictor of intelligibility for hearing-impaired listeners. The results of this study may be useful in the design of adaptive intelligibility enhancement systems for cochlear implants based on both the noise level and the nature of the noise (stationary or non-stationary). MDPI 2023-08-31 /pmc/articles/PMC10490615/ /pubmed/37688009 http://dx.doi.org/10.3390/s23177553 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Henry, Fergal Parsi, Ashkan Glavin, Martin Jones, Edward Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants |
title | Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants |
title_full | Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants |
title_fullStr | Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants |
title_full_unstemmed | Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants |
title_short | Experimental Investigation of Acoustic Features to Optimize Intelligibility in Cochlear Implants |
title_sort | experimental investigation of acoustic features to optimize intelligibility in cochlear implants |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10490615/ https://www.ncbi.nlm.nih.gov/pubmed/37688009 http://dx.doi.org/10.3390/s23177553 |
work_keys_str_mv | AT henryfergal experimentalinvestigationofacousticfeaturestooptimizeintelligibilityincochlearimplants AT parsiashkan experimentalinvestigationofacousticfeaturestooptimizeintelligibilityincochlearimplants AT glavinmartin experimentalinvestigationofacousticfeaturestooptimizeintelligibilityincochlearimplants AT jonesedward experimentalinvestigationofacousticfeaturestooptimizeintelligibilityincochlearimplants |