Cargando…

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom....

Descripción completa

Detalles Bibliográficos
Autores principales:	Green, Tim, Hilkhuysen, Gaston, Huckvale, Mark, Rosen, Stuart, Brookes, Mike, Moore, Alastair, Naylor, Patrick, Lightburn, Leo, Xue, Wei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	SAGE Publications 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8744079/ https://www.ncbi.nlm.nih.gov/pubmed/34985356 http://dx.doi.org/10.1177/23312165211068629

_version_	1784630044777250816
author	Green, Tim Hilkhuysen, Gaston Huckvale, Mark Rosen, Stuart Brookes, Mike Moore, Alastair Naylor, Patrick Lightburn, Leo Xue, Wei
author_facet	Green, Tim Hilkhuysen, Gaston Huckvale, Mark Rosen, Stuart Brookes, Mike Moore, Alastair Naylor, Patrick Lightburn, Leo Xue, Wei
author_sort	Green, Tim
collection	PubMed
description	A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom. Two types of beamforming were compared: binaural, with the two microphones of each aid treated as a single array, and bilateral, where independent left and right beamformers were derived. Binaural beamforming produces a narrower beam, maximising improvement in signal-to-noise ratio (SNR), but eliminates the spatial diversity that is preserved in bilateral beamforming. Each beamformer type was optimised for the true target position and implemented with and without additional speech enhancement in which spectral features extracted from the beamformer output were passed to a deep neural network trained to identify time-frequency regions dominated by target speech. Additional conditions comprising binaural beamforming combined with speech enhancement implemented using Wiener filtering or modulation-domain Kalman filtering were tested in normally-hearing (NH) listeners. Both beamformer types gave substantial improvements relative to no processing, with significantly greater benefit for binaural beamforming. Performance with additional mask-informed enhancement was poorer than with beamforming alone, for both beamformer types and both listener groups. In NH listeners the addition of mask-informed enhancement produced significantly poorer performance than both other forms of enhancement, neither of which differed from the beamformer alone. In summary, the additional improvement in SNR provided by binaural beamforming appeared to outweigh loss of spatial information, while speech understanding was not further improved by the mask-informed enhancement method implemented here.
format	Online Article Text
id	pubmed-8744079
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	SAGE Publications
record_format	MEDLINE/PubMed
spelling	pubmed-87440792022-01-11 Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement Green, Tim Hilkhuysen, Gaston Huckvale, Mark Rosen, Stuart Brookes, Mike Moore, Alastair Naylor, Patrick Lightburn, Leo Xue, Wei Trends Hear Original Article A signal processing approach combining beamforming with mask-informed speech enhancement was assessed by measuring sentence recognition in listeners with mild-to-moderate hearing impairment in adverse listening conditions that simulated the output of behind-the-ear hearing aids in a noisy classroom. Two types of beamforming were compared: binaural, with the two microphones of each aid treated as a single array, and bilateral, where independent left and right beamformers were derived. Binaural beamforming produces a narrower beam, maximising improvement in signal-to-noise ratio (SNR), but eliminates the spatial diversity that is preserved in bilateral beamforming. Each beamformer type was optimised for the true target position and implemented with and without additional speech enhancement in which spectral features extracted from the beamformer output were passed to a deep neural network trained to identify time-frequency regions dominated by target speech. Additional conditions comprising binaural beamforming combined with speech enhancement implemented using Wiener filtering or modulation-domain Kalman filtering were tested in normally-hearing (NH) listeners. Both beamformer types gave substantial improvements relative to no processing, with significantly greater benefit for binaural beamforming. Performance with additional mask-informed enhancement was poorer than with beamforming alone, for both beamformer types and both listener groups. In NH listeners the addition of mask-informed enhancement produced significantly poorer performance than both other forms of enhancement, neither of which differed from the beamformer alone. In summary, the additional improvement in SNR provided by binaural beamforming appeared to outweigh loss of spatial information, while speech understanding was not further improved by the mask-informed enhancement method implemented here. SAGE Publications 2022-01-05 /pmc/articles/PMC8744079/ /pubmed/34985356 http://dx.doi.org/10.1177/23312165211068629 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle	Original Article Green, Tim Hilkhuysen, Gaston Huckvale, Mark Rosen, Stuart Brookes, Mike Moore, Alastair Naylor, Patrick Lightburn, Leo Xue, Wei Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
title	Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
title_full	Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
title_fullStr	Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
title_full_unstemmed	Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
title_short	Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
title_sort	speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8744079/ https://www.ncbi.nlm.nih.gov/pubmed/34985356 http://dx.doi.org/10.1177/23312165211068629
work_keys_str_mv	AT greentim speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT hilkhuysengaston speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT huckvalemark speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT rosenstuart speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT brookesmike speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT moorealastair speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT naylorpatrick speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT lightburnleo speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement AT xuewei speechrecognitionwithahearingaidprocessingschemecombiningbeamformingwithmaskinformedspeechenhancement

Speech recognition with a hearing-aid processing scheme combining beamforming with mask-informed speech enhancement

Ejemplares similares