Cargando…
A biologically oriented algorithm for spatial sound segregation
Listening in an acoustically cluttered scene remains a difficult task for both machines and hearing-impaired listeners. Normal-hearing listeners accomplish this task with relative ease by segregating the scene into its constituent sound sources, then selecting and attending to a target source. An as...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9614053/ https://www.ncbi.nlm.nih.gov/pubmed/36312015 http://dx.doi.org/10.3389/fnins.2022.1004071 |
_version_ | 1784820109542424576 |
---|---|
author | Chou, Kenny F. Boyd, Alexander D. Best, Virginia Colburn, H. Steven Sen, Kamal |
author_facet | Chou, Kenny F. Boyd, Alexander D. Best, Virginia Colburn, H. Steven Sen, Kamal |
author_sort | Chou, Kenny F. |
collection | PubMed |
description | Listening in an acoustically cluttered scene remains a difficult task for both machines and hearing-impaired listeners. Normal-hearing listeners accomplish this task with relative ease by segregating the scene into its constituent sound sources, then selecting and attending to a target source. An assistive listening device that mimics the biological mechanisms underlying this behavior may provide an effective solution for those with difficulty listening in acoustically cluttered environments (e.g., a cocktail party). Here, we present a binaural sound segregation algorithm based on a hierarchical network model of the auditory system. In the algorithm, binaural sound inputs first drive populations of neurons tuned to specific spatial locations and frequencies. The spiking response of neurons in the output layer are then reconstructed into audible waveforms via a novel reconstruction method. We evaluate the performance of the algorithm with a speech-on-speech intelligibility task in normal-hearing listeners. This two-microphone-input algorithm is shown to provide listeners with perceptual benefit similar to that of a 16-microphone acoustic beamformer. These results demonstrate the promise of this biologically inspired algorithm for enhancing selective listening in challenging multi-talker scenes. |
format | Online Article Text |
id | pubmed-9614053 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-96140532022-10-29 A biologically oriented algorithm for spatial sound segregation Chou, Kenny F. Boyd, Alexander D. Best, Virginia Colburn, H. Steven Sen, Kamal Front Neurosci Neuroscience Listening in an acoustically cluttered scene remains a difficult task for both machines and hearing-impaired listeners. Normal-hearing listeners accomplish this task with relative ease by segregating the scene into its constituent sound sources, then selecting and attending to a target source. An assistive listening device that mimics the biological mechanisms underlying this behavior may provide an effective solution for those with difficulty listening in acoustically cluttered environments (e.g., a cocktail party). Here, we present a binaural sound segregation algorithm based on a hierarchical network model of the auditory system. In the algorithm, binaural sound inputs first drive populations of neurons tuned to specific spatial locations and frequencies. The spiking response of neurons in the output layer are then reconstructed into audible waveforms via a novel reconstruction method. We evaluate the performance of the algorithm with a speech-on-speech intelligibility task in normal-hearing listeners. This two-microphone-input algorithm is shown to provide listeners with perceptual benefit similar to that of a 16-microphone acoustic beamformer. These results demonstrate the promise of this biologically inspired algorithm for enhancing selective listening in challenging multi-talker scenes. Frontiers Media S.A. 2022-10-14 /pmc/articles/PMC9614053/ /pubmed/36312015 http://dx.doi.org/10.3389/fnins.2022.1004071 Text en Copyright © 2022 Chou, Boyd, Best, Colburn and Sen. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Chou, Kenny F. Boyd, Alexander D. Best, Virginia Colburn, H. Steven Sen, Kamal A biologically oriented algorithm for spatial sound segregation |
title | A biologically oriented algorithm for spatial sound segregation |
title_full | A biologically oriented algorithm for spatial sound segregation |
title_fullStr | A biologically oriented algorithm for spatial sound segregation |
title_full_unstemmed | A biologically oriented algorithm for spatial sound segregation |
title_short | A biologically oriented algorithm for spatial sound segregation |
title_sort | biologically oriented algorithm for spatial sound segregation |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9614053/ https://www.ncbi.nlm.nih.gov/pubmed/36312015 http://dx.doi.org/10.3389/fnins.2022.1004071 |
work_keys_str_mv | AT choukennyf abiologicallyorientedalgorithmforspatialsoundsegregation AT boydalexanderd abiologicallyorientedalgorithmforspatialsoundsegregation AT bestvirginia abiologicallyorientedalgorithmforspatialsoundsegregation AT colburnhsteven abiologicallyorientedalgorithmforspatialsoundsegregation AT senkamal abiologicallyorientedalgorithmforspatialsoundsegregation AT choukennyf biologicallyorientedalgorithmforspatialsoundsegregation AT boydalexanderd biologicallyorientedalgorithmforspatialsoundsegregation AT bestvirginia biologicallyorientedalgorithmforspatialsoundsegregation AT colburnhsteven biologicallyorientedalgorithmforspatialsoundsegregation AT senkamal biologicallyorientedalgorithmforspatialsoundsegregation |