Cargando…

A hybrid technique for speech segregation and classification using a sophisticated deep neural network

Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challengi...

Descripción completa

Detalles Bibliográficos
Autores principales: Qazi, Khurram Ashfaq, Nawaz, Tabassam, Mehmood, Zahid, Rashid, Muhammad, Habib, Hafiz Adnan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860734/
https://www.ncbi.nlm.nih.gov/pubmed/29558485
http://dx.doi.org/10.1371/journal.pone.0194151
_version_ 1783307990832513024
author Qazi, Khurram Ashfaq
Nawaz, Tabassam
Mehmood, Zahid
Rashid, Muhammad
Habib, Hafiz Adnan
author_facet Qazi, Khurram Ashfaq
Nawaz, Tabassam
Mehmood, Zahid
Rashid, Muhammad
Habib, Hafiz Adnan
author_sort Qazi, Khurram Ashfaq
collection PubMed
description Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challenging task in the presence of noise. This paper proposes a novel method of speech segregation for unlabelled stationary noisy audio signals using the deep belief network (DBN) model. The proposed method successfully segregates a music signal from noisy audio streams. A recurrent neural network (RNN)-based hidden layer segregation model is applied to remove stationary noise. Dictionary-based fisher algorithms are employed for speech classification. The proposed method is tested on three datasets (TIMIT, MIR-1K, and MusicBrainz), and the results indicate the robustness of proposed method for speech segregation. The qualitative and quantitative analysis carried out on three datasets demonstrate the efficiency of the proposed method compared to the state-of-the-art speech segregation and classification-based methods.
format Online
Article
Text
id pubmed-5860734
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58607342018-03-28 A hybrid technique for speech segregation and classification using a sophisticated deep neural network Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan PLoS One Research Article Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challenging task in the presence of noise. This paper proposes a novel method of speech segregation for unlabelled stationary noisy audio signals using the deep belief network (DBN) model. The proposed method successfully segregates a music signal from noisy audio streams. A recurrent neural network (RNN)-based hidden layer segregation model is applied to remove stationary noise. Dictionary-based fisher algorithms are employed for speech classification. The proposed method is tested on three datasets (TIMIT, MIR-1K, and MusicBrainz), and the results indicate the robustness of proposed method for speech segregation. The qualitative and quantitative analysis carried out on three datasets demonstrate the efficiency of the proposed method compared to the state-of-the-art speech segregation and classification-based methods. Public Library of Science 2018-03-20 /pmc/articles/PMC5860734/ /pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151 Text en © 2018 Qazi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Qazi, Khurram Ashfaq
Nawaz, Tabassam
Mehmood, Zahid
Rashid, Muhammad
Habib, Hafiz Adnan
A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_full A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_fullStr A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_full_unstemmed A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_short A hybrid technique for speech segregation and classification using a sophisticated deep neural network
title_sort hybrid technique for speech segregation and classification using a sophisticated deep neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860734/
https://www.ncbi.nlm.nih.gov/pubmed/29558485
http://dx.doi.org/10.1371/journal.pone.0194151
work_keys_str_mv AT qazikhurramashfaq ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT nawaztabassam ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT mehmoodzahid ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT rashidmuhammad ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT habibhafizadnan ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT qazikhurramashfaq hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT nawaztabassam hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT mehmoodzahid hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT rashidmuhammad hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork
AT habibhafizadnan hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork