Cargando…

A hybrid technique for speech segregation and classification using a sophisticated deep neural network

Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challengi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Qazi, Khurram Ashfaq, Nawaz, Tabassam, Mehmood, Zahid, Rashid, Muhammad, Habib, Hafiz Adnan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860734/ https://www.ncbi.nlm.nih.gov/pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151

_version_	1783307990832513024
author	Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan
author_facet	Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan
author_sort	Qazi, Khurram Ashfaq
collection	PubMed
description	Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challenging task in the presence of noise. This paper proposes a novel method of speech segregation for unlabelled stationary noisy audio signals using the deep belief network (DBN) model. The proposed method successfully segregates a music signal from noisy audio streams. A recurrent neural network (RNN)-based hidden layer segregation model is applied to remove stationary noise. Dictionary-based fisher algorithms are employed for speech classification. The proposed method is tested on three datasets (TIMIT, MIR-1K, and MusicBrainz), and the results indicate the robustness of proposed method for speech segregation. The qualitative and quantitative analysis carried out on three datasets demonstrate the efficiency of the proposed method compared to the state-of-the-art speech segregation and classification-based methods.
format	Online Article Text
id	pubmed-5860734
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-58607342018-03-28 A hybrid technique for speech segregation and classification using a sophisticated deep neural network Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan PLoS One Research Article Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challenging task in the presence of noise. This paper proposes a novel method of speech segregation for unlabelled stationary noisy audio signals using the deep belief network (DBN) model. The proposed method successfully segregates a music signal from noisy audio streams. A recurrent neural network (RNN)-based hidden layer segregation model is applied to remove stationary noise. Dictionary-based fisher algorithms are employed for speech classification. The proposed method is tested on three datasets (TIMIT, MIR-1K, and MusicBrainz), and the results indicate the robustness of proposed method for speech segregation. The qualitative and quantitative analysis carried out on three datasets demonstrate the efficiency of the proposed method compared to the state-of-the-art speech segregation and classification-based methods. Public Library of Science 2018-03-20 /pmc/articles/PMC5860734/ /pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151 Text en © 2018 Qazi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title	A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_full	A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_fullStr	A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_full_unstemmed	A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_short	A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_sort	hybrid technique for speech segregation and classification using a sophisticated deep neural network
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860734/ https://www.ncbi.nlm.nih.gov/pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151
work_keys_str_mv	AT qazikhurramashfaq ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT nawaztabassam ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT mehmoodzahid ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT rashidmuhammad ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT habibhafizadnan ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT qazikhurramashfaq hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT nawaztabassam hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT mehmoodzahid hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT rashidmuhammad hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT habibhafizadnan hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork

A hybrid technique for speech segregation and classification using a sophisticated deep neural network

Ejemplares similares