Cargando…

Speech Enhancement in the STFT Domain

This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-c...

Descripción completa

Detalles Bibliográficos
Autores principales:	Benesty, Jacob, Chen, Jingdong, Habets, Emanuël A P
Lenguaje:	eng
Publicado:	Springer 2012
Materias:	Engineering
Acceso en línea:	https://dx.doi.org/10.1007/978-3-642-23250-3 http://cds.cern.ch/record/1503846

_version_	1780927192816943104
author	Benesty, Jacob Chen, Jingdong Habets, Emanuël A P
author_facet	Benesty, Jacob Chen, Jingdong Habets, Emanuël A P
author_sort	Benesty, Jacob
collection	CERN
description	This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames and frequency bands are assumed to be independent. In this case, the noise reduction filter in each frequency band is basically a real gain. Since a gain does not improve the signal-to-noise ratio (SNR) for any given subband and frame, the noise reduction is basically achieved by liftering the subbands and frames that are less noisy while weighing down on those that are more noisy. The second category also concerns the single-channel problem. The difference is that now the interframe correlation is taken into account and a filter is applied in each subband instead of just a gain. The advantage of using the interframe correlation is that we can improve not only the long-time fullband SNR, but the frame-wise subband SNR as well. The third and fourth classes discuss the problem of multichannel noise reduction in the STFT domain with and without interframe correlation, respectively. In the last category, we consider the interband correlation in the design of the noise reduction filters. We illustrate the basic principle for the single-channel case as an example, while this concept can be generalized to other scenarios. In all categories, we propose different optimization cost functions from which we derive the optimal filters and we also define the performance measures that help analyzing them.
id	cern-1503846
institution	Organización Europea para la Investigación Nuclear
language	eng
publishDate	2012
publisher	Springer
record_format	invenio
spelling	cern-15038462021-04-21T23:52:55Zdoi:10.1007/978-3-642-23250-3http://cds.cern.ch/record/1503846engBenesty, JacobChen, JingdongHabets, Emanuël A PSpeech Enhancement in the STFT DomainEngineeringThis work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the single-channel problem where STFT coefficients at different frames and frequency bands are assumed to be independent. In this case, the noise reduction filter in each frequency band is basically a real gain. Since a gain does not improve the signal-to-noise ratio (SNR) for any given subband and frame, the noise reduction is basically achieved by liftering the subbands and frames that are less noisy while weighing down on those that are more noisy. The second category also concerns the single-channel problem. The difference is that now the interframe correlation is taken into account and a filter is applied in each subband instead of just a gain. The advantage of using the interframe correlation is that we can improve not only the long-time fullband SNR, but the frame-wise subband SNR as well. The third and fourth classes discuss the problem of multichannel noise reduction in the STFT domain with and without interframe correlation, respectively. In the last category, we consider the interband correlation in the design of the noise reduction filters. We illustrate the basic principle for the single-channel case as an example, while this concept can be generalized to other scenarios. In all categories, we propose different optimization cost functions from which we derive the optimal filters and we also define the performance measures that help analyzing them.Springeroai:cds.cern.ch:15038462012
spellingShingle	Engineering Benesty, Jacob Chen, Jingdong Habets, Emanuël A P Speech Enhancement in the STFT Domain
title	Speech Enhancement in the STFT Domain
title_full	Speech Enhancement in the STFT Domain
title_fullStr	Speech Enhancement in the STFT Domain
title_full_unstemmed	Speech Enhancement in the STFT Domain
title_short	Speech Enhancement in the STFT Domain
title_sort	speech enhancement in the stft domain
topic	Engineering
url	https://dx.doi.org/10.1007/978-3-642-23250-3 http://cds.cern.ch/record/1503846
work_keys_str_mv	AT benestyjacob speechenhancementinthestftdomain AT chenjingdong speechenhancementinthestftdomain AT habetsemanuelap speechenhancementinthestftdomain

Speech Enhancement in the STFT Domain

Ejemplares similares