Cargando…

Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition

Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decom...

Descripción completa

Detalles Bibliográficos
Autores principales: TOKGOZ, SERKAN, PANAHI, ISSA M. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8681871/
https://www.ncbi.nlm.nih.gov/pubmed/34926101
http://dx.doi.org/10.1109/access.2021.3130180
_version_ 1784617080573657088
author TOKGOZ, SERKAN
PANAHI, ISSA M. S.
author_facet TOKGOZ, SERKAN
PANAHI, ISSA M. S.
author_sort TOKGOZ, SERKAN
collection PubMed
description Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no ‘left-right’ ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.
format Online
Article
Text
id pubmed-8681871
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-86818712021-12-17 Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition TOKGOZ, SERKAN PANAHI, ISSA M. S. IEEE Access Article Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no ‘left-right’ ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications. 2021-11-23 2021 /pmc/articles/PMC8681871/ /pubmed/34926101 http://dx.doi.org/10.1109/access.2021.3130180 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
spellingShingle Article
TOKGOZ, SERKAN
PANAHI, ISSA M. S.
Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
title Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
title_full Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
title_fullStr Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
title_full_unstemmed Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
title_short Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
title_sort robust three-microphone speech source localization using randomized singular value decomposition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8681871/
https://www.ncbi.nlm.nih.gov/pubmed/34926101
http://dx.doi.org/10.1109/access.2021.3130180
work_keys_str_mv AT tokgozserkan robustthreemicrophonespeechsourcelocalizationusingrandomizedsingularvaluedecomposition
AT panahiissams robustthreemicrophonespeechsourcelocalizationusingrandomizedsingularvaluedecomposition