Cargando…

Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection

Singing-voice separation is a separation task that involves a singing voice and musical accompaniment. In this paper, we propose a novel, unsupervised methodology for extracting a singing voice from the background in a musical mixture. This method is a modification of robust principal component anal...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Feng, Hu, Yujun, Wang, Lingling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10056690/
https://www.ncbi.nlm.nih.gov/pubmed/36991724
http://dx.doi.org/10.3390/s23063015
_version_ 1785016185607159808
author Li, Feng
Hu, Yujun
Wang, Lingling
author_facet Li, Feng
Hu, Yujun
Wang, Lingling
author_sort Li, Feng
collection PubMed
description Singing-voice separation is a separation task that involves a singing voice and musical accompaniment. In this paper, we propose a novel, unsupervised methodology for extracting a singing voice from the background in a musical mixture. This method is a modification of robust principal component analysis (RPCA) that separates a singing voice by using weighting based on gammatone filterbank and vocal activity detection. Although RPCA is a helpful method for separating voices from the music mixture, it fails when one single value, such as drums, is much larger than others (e.g., the accompanying instruments). As a result, the proposed approach takes advantage of varying values between low-rank (background) and sparse matrices (singing voice). Additionally, we propose an expanded RPCA on the cochleagram by utilizing coalescent masking on the gammatone. Finally, we utilize vocal activity detection to enhance the separation outcomes by eliminating the lingering music signal. Evaluation results reveal that the proposed approach provides superior separation outcomes than RPCA on ccMixter and DSD100 datasets.
format Online
Article
Text
id pubmed-10056690
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100566902023-03-30 Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection Li, Feng Hu, Yujun Wang, Lingling Sensors (Basel) Article Singing-voice separation is a separation task that involves a singing voice and musical accompaniment. In this paper, we propose a novel, unsupervised methodology for extracting a singing voice from the background in a musical mixture. This method is a modification of robust principal component analysis (RPCA) that separates a singing voice by using weighting based on gammatone filterbank and vocal activity detection. Although RPCA is a helpful method for separating voices from the music mixture, it fails when one single value, such as drums, is much larger than others (e.g., the accompanying instruments). As a result, the proposed approach takes advantage of varying values between low-rank (background) and sparse matrices (singing voice). Additionally, we propose an expanded RPCA on the cochleagram by utilizing coalescent masking on the gammatone. Finally, we utilize vocal activity detection to enhance the separation outcomes by eliminating the lingering music signal. Evaluation results reveal that the proposed approach provides superior separation outcomes than RPCA on ccMixter and DSD100 datasets. MDPI 2023-03-10 /pmc/articles/PMC10056690/ /pubmed/36991724 http://dx.doi.org/10.3390/s23063015 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Feng
Hu, Yujun
Wang, Lingling
Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
title Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
title_full Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
title_fullStr Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
title_full_unstemmed Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
title_short Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
title_sort unsupervised single-channel singing voice separation with weighted robust principal component analysis based on gammatone auditory filterbank and vocal activity detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10056690/
https://www.ncbi.nlm.nih.gov/pubmed/36991724
http://dx.doi.org/10.3390/s23063015
work_keys_str_mv AT lifeng unsupervisedsinglechannelsingingvoiceseparationwithweightedrobustprincipalcomponentanalysisbasedongammatoneauditoryfilterbankandvocalactivitydetection
AT huyujun unsupervisedsinglechannelsingingvoiceseparationwithweightedrobustprincipalcomponentanalysisbasedongammatoneauditoryfilterbankandvocalactivitydetection
AT wanglingling unsupervisedsinglechannelsingingvoiceseparationwithweightedrobustprincipalcomponentanalysisbasedongammatoneauditoryfilterbankandvocalactivitydetection