Cargando…
Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation
In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restrict...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492097/ https://www.ncbi.nlm.nih.gov/pubmed/28632166 http://dx.doi.org/10.3390/s17061447 |
_version_ | 1783247257801326592 |
---|---|
author | Li, Xuliang Ding, Zhaogui Li, Weifeng Liao, Qingmin |
author_facet | Li, Xuliang Ding, Zhaogui Li, Weifeng Liao, Qingmin |
author_sort | Li, Xuliang |
collection | PubMed |
description | In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restriction. The technique utilizes the estimation of interaural time difference (ITD) statistics and binary time-frequency mask for the separation of mixed speech sources. The novelties of the paper consist in: (1) the extended application of delay-and-sum beamforming (DSB) and cosine function for ITD calculation; and (2) the clarification of the connection between ideal binary mask and DSB amplitude ratio. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed method. |
format | Online Article Text |
id | pubmed-5492097 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-54920972017-07-03 Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation Li, Xuliang Ding, Zhaogui Li, Weifeng Liao, Qingmin Sensors (Basel) Article In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restriction. The technique utilizes the estimation of interaural time difference (ITD) statistics and binary time-frequency mask for the separation of mixed speech sources. The novelties of the paper consist in: (1) the extended application of delay-and-sum beamforming (DSB) and cosine function for ITD calculation; and (2) the clarification of the connection between ideal binary mask and DSB amplitude ratio. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed method. MDPI 2017-06-20 /pmc/articles/PMC5492097/ /pubmed/28632166 http://dx.doi.org/10.3390/s17061447 Text en © 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Xuliang Ding, Zhaogui Li, Weifeng Liao, Qingmin Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation |
title | Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation |
title_full | Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation |
title_fullStr | Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation |
title_full_unstemmed | Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation |
title_short | Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation |
title_sort | dual-channel cosine function based itd estimation for robust speech separation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492097/ https://www.ncbi.nlm.nih.gov/pubmed/28632166 http://dx.doi.org/10.3390/s17061447 |
work_keys_str_mv | AT lixuliang dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation AT dingzhaogui dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation AT liweifeng dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation AT liaoqingmin dualchannelcosinefunctionbaseditdestimationforrobustspeechseparation |