Cargando…

A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking

Beamforming is a type of audio array processing techniques used for interference reduction, sound source localization, and as pre-processing stage for audio event classification and speaker identification. The auditory scene analysis community can benefit from a systemic evaluation and comparison be...

Descripción completa

Detalles Bibliográficos
Autor principal: Rascon, Caleb
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8347759/
https://www.ncbi.nlm.nih.gov/pubmed/34372248
http://dx.doi.org/10.3390/s21155005
_version_ 1783735171481075712
author Rascon, Caleb
author_facet Rascon, Caleb
author_sort Rascon, Caleb
collection PubMed
description Beamforming is a type of audio array processing techniques used for interference reduction, sound source localization, and as pre-processing stage for audio event classification and speaker identification. The auditory scene analysis community can benefit from a systemic evaluation and comparison between different beamforming techniques. In this paper, five popular beamforming techniques are evaluated in two different acoustic environments, while varying the number of microphones, the number of interferences, and the direction-of-arrival error, by using the Acoustic Interactions for Robot Audition (AIRA) corpus and a common software framework. Additionally, a highly efficient phase-based frequency masking beamformer is also evaluated, which is shown to outperform all five techniques. Both the evaluation corpus and the beamforming implementations are freely available and provided for experiment repeatability and transparency. Raw results are also provided as a complement to this work to the reader, to facilitate an informed decision of which technique to use. Finally, the insights and tendencies observed from the evaluation results are presented.
format Online
Article
Text
id pubmed-8347759
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83477592021-08-08 A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking Rascon, Caleb Sensors (Basel) Article Beamforming is a type of audio array processing techniques used for interference reduction, sound source localization, and as pre-processing stage for audio event classification and speaker identification. The auditory scene analysis community can benefit from a systemic evaluation and comparison between different beamforming techniques. In this paper, five popular beamforming techniques are evaluated in two different acoustic environments, while varying the number of microphones, the number of interferences, and the direction-of-arrival error, by using the Acoustic Interactions for Robot Audition (AIRA) corpus and a common software framework. Additionally, a highly efficient phase-based frequency masking beamformer is also evaluated, which is shown to outperform all five techniques. Both the evaluation corpus and the beamforming implementations are freely available and provided for experiment repeatability and transparency. Raw results are also provided as a complement to this work to the reader, to facilitate an informed decision of which technique to use. Finally, the insights and tendencies observed from the evaluation results are presented. MDPI 2021-07-23 /pmc/articles/PMC8347759/ /pubmed/34372248 http://dx.doi.org/10.3390/s21155005 Text en © 2021 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rascon, Caleb
A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_full A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_fullStr A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_full_unstemmed A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_short A Corpus-Based Evaluation of Beamforming Techniques and Phase-Based Frequency Masking
title_sort corpus-based evaluation of beamforming techniques and phase-based frequency masking
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8347759/
https://www.ncbi.nlm.nih.gov/pubmed/34372248
http://dx.doi.org/10.3390/s21155005
work_keys_str_mv AT rasconcaleb acorpusbasedevaluationofbeamformingtechniquesandphasebasedfrequencymasking
AT rasconcaleb corpusbasedevaluationofbeamformingtechniquesandphasebasedfrequencymasking