Cargando…

Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †

This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time–frequency deconvolution with optimized fractional [Formula: see text]-divergence. The [Formula: see text]-divergence is a group of cost functions parametrized by a single parameter [Formula: see text]. T...

Descripción completa

Detalles Bibliográficos
Autores principales:	Woo, Wai Lok, Gao, Bin, Bouridane, Ahmed, Ling, Bingo Wing-Kuen, Chin, Cheng Siong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5982401/ https://www.ncbi.nlm.nih.gov/pubmed/29702629 http://dx.doi.org/10.3390/s18051371

_version_	1783328233271328768
author	Woo, Wai Lok Gao, Bin Bouridane, Ahmed Ling, Bingo Wing-Kuen Chin, Cheng Siong
author_facet	Woo, Wai Lok Gao, Bin Bouridane, Ahmed Ling, Bingo Wing-Kuen Chin, Cheng Siong
author_sort	Woo, Wai Lok
collection	PubMed
description	This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time–frequency deconvolution with optimized fractional [Formula: see text]-divergence. The [Formula: see text]-divergence is a group of cost functions parametrized by a single parameter [Formula: see text]. The Itakura–Saito divergence, Kullback–Leibler divergence and Least Square distance are special cases that correspond to [Formula: see text] , respectively. This paper presents a generalized algorithm that uses a flexible range of [Formula: see text] that includes fractional values. It describes a maximization–minimization (MM) algorithm leading to the development of a fast convergence multiplicative update algorithm with guaranteed convergence. The proposed model operates in the time–frequency domain and decomposes an information-bearing matrix into two-dimensional deconvolution of factor matrices that represent the spectral dictionary and temporal codes. The deconvolution process has been optimized to yield sparse temporal codes through maximizing the likelihood of the observations. The paper also presents a method to estimate the fractional [Formula: see text] value. The method is demonstrated on separating audio mixtures recorded from a single channel. The paper shows that the extraction of the spectral dictionary and temporal codes is significantly more efficient by using the proposed algorithm and subsequently leads to better source separation performance. Experimental tests and comparisons with other factorization methods have been conducted to verify its efficacy.
format	Online Article Text
id	pubmed-5982401
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-59824012018-06-05 Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution † Woo, Wai Lok Gao, Bin Bouridane, Ahmed Ling, Bingo Wing-Kuen Chin, Cheng Siong Sensors (Basel) Article This paper presents an unsupervised learning algorithm for sparse nonnegative matrix factor time–frequency deconvolution with optimized fractional [Formula: see text]-divergence. The [Formula: see text]-divergence is a group of cost functions parametrized by a single parameter [Formula: see text]. The Itakura–Saito divergence, Kullback–Leibler divergence and Least Square distance are special cases that correspond to [Formula: see text] , respectively. This paper presents a generalized algorithm that uses a flexible range of [Formula: see text] that includes fractional values. It describes a maximization–minimization (MM) algorithm leading to the development of a fast convergence multiplicative update algorithm with guaranteed convergence. The proposed model operates in the time–frequency domain and decomposes an information-bearing matrix into two-dimensional deconvolution of factor matrices that represent the spectral dictionary and temporal codes. The deconvolution process has been optimized to yield sparse temporal codes through maximizing the likelihood of the observations. The paper also presents a method to estimate the fractional [Formula: see text] value. The method is demonstrated on separating audio mixtures recorded from a single channel. The paper shows that the extraction of the spectral dictionary and temporal codes is significantly more efficient by using the proposed algorithm and subsequently leads to better source separation performance. Experimental tests and comparisons with other factorization methods have been conducted to verify its efficacy. MDPI 2018-04-27 /pmc/articles/PMC5982401/ /pubmed/29702629 http://dx.doi.org/10.3390/s18051371 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Woo, Wai Lok Gao, Bin Bouridane, Ahmed Ling, Bingo Wing-Kuen Chin, Cheng Siong Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †
title	Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †
title_full	Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †
title_fullStr	Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †
title_full_unstemmed	Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †
title_short	Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †
title_sort	unsupervised learning for monaural source separation using maximization–minimization algorithm with time–frequency deconvolution †
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5982401/ https://www.ncbi.nlm.nih.gov/pubmed/29702629 http://dx.doi.org/10.3390/s18051371
work_keys_str_mv	AT woowailok unsupervisedlearningformonauralsourceseparationusingmaximizationminimizationalgorithmwithtimefrequencydeconvolution AT gaobin unsupervisedlearningformonauralsourceseparationusingmaximizationminimizationalgorithmwithtimefrequencydeconvolution AT bouridaneahmed unsupervisedlearningformonauralsourceseparationusingmaximizationminimizationalgorithmwithtimefrequencydeconvolution AT lingbingowingkuen unsupervisedlearningformonauralsourceseparationusingmaximizationminimizationalgorithmwithtimefrequencydeconvolution AT chinchengsiong unsupervisedlearningformonauralsourceseparationusingmaximizationminimizationalgorithmwithtimefrequencydeconvolution

Unsupervised Learning for Monaural Source Separation Using Maximization–Minimization Algorithm with Time–Frequency Deconvolution †

Ejemplares similares