Cargando…

Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation

In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restrict...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Xuliang, Ding, Zhaogui, Li, Weifeng, Liao, Qingmin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492097/
https://www.ncbi.nlm.nih.gov/pubmed/28632166
http://dx.doi.org/10.3390/s17061447
_version_ 1783247257801326592
author Li, Xuliang
Ding, Zhaogui
Li, Weifeng
Liao, Qingmin
author_facet Li, Xuliang
Ding, Zhaogui
Li, Weifeng
Liao, Qingmin
author_sort Li, Xuliang
collection PubMed
description In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restriction. The technique utilizes the estimation of interaural time difference (ITD) statistics and binary time-frequency mask for the separation of mixed speech sources. The novelties of the paper consist in: (1) the extended application of delay-and-sum beamforming (DSB) and cosine function for ITD calculation; and (2) the clarification of the connection between ideal binary mask and DSB amplitude ratio. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed method.
format Online
Article
Text
id pubmed-5492097
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-54920972017-07-03 Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation Li, Xuliang Ding, Zhaogui Li, Weifeng Liao, Qingmin Sensors (Basel) Article In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restriction. The technique utilizes the estimation of interaural time difference (ITD) statistics and binary time-frequency mask for the separation of mixed speech sources. The novelties of the paper consist in: (1) the extended application of delay-and-sum beamforming (DSB) and cosine function for ITD calculation; and (2) the clarification of the connection between ideal binary mask and DSB amplitude ratio. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed method. MDPI 2017-06-20 /pmc/articles/PMC5492097/ /pubmed/28632166 http://dx.doi.org/10.3390/s17061447 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Xuliang
Ding, Zhaogui
Li, Weifeng
Liao, Qingmin
Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
title Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
title_full Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
title_fullStr Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
title_full_unstemmed Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
title_short Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
title_sort dual-channel cosine function based itd estimation for robust speech separation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492097/
https://www.ncbi.nlm.nih.gov/pubmed/28632166
http://dx.doi.org/10.3390/s17061447
work_keys_str_mv AT lixuliang dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation
AT dingzhaogui dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation
AT liweifeng dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation
AT liaoqingmin dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation