Cargando…
A hybrid technique for speech segregation and classification using a sophisticated deep neural network
Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challengi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860734/ https://www.ncbi.nlm.nih.gov/pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151 |
_version_ | 1783307990832513024 |
---|---|
author | Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan |
author_facet | Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan |
author_sort | Qazi, Khurram Ashfaq |
collection | PubMed |
description | Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challenging task in the presence of noise. This paper proposes a novel method of speech segregation for unlabelled stationary noisy audio signals using the deep belief network (DBN) model. The proposed method successfully segregates a music signal from noisy audio streams. A recurrent neural network (RNN)-based hidden layer segregation model is applied to remove stationary noise. Dictionary-based fisher algorithms are employed for speech classification. The proposed method is tested on three datasets (TIMIT, MIR-1K, and MusicBrainz), and the results indicate the robustness of proposed method for speech segregation. The qualitative and quantitative analysis carried out on three datasets demonstrate the efficiency of the proposed method compared to the state-of-the-art speech segregation and classification-based methods. |
format | Online Article Text |
id | pubmed-5860734 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-58607342018-03-28 A hybrid technique for speech segregation and classification using a sophisticated deep neural network Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan PLoS One Research Article Recent research on speech segregation and music fingerprinting has led to improvements in speech segregation and music identification algorithms. Speech and music segregation generally involves the identification of music followed by speech segregation. However, music segregation becomes a challenging task in the presence of noise. This paper proposes a novel method of speech segregation for unlabelled stationary noisy audio signals using the deep belief network (DBN) model. The proposed method successfully segregates a music signal from noisy audio streams. A recurrent neural network (RNN)-based hidden layer segregation model is applied to remove stationary noise. Dictionary-based fisher algorithms are employed for speech classification. The proposed method is tested on three datasets (TIMIT, MIR-1K, and MusicBrainz), and the results indicate the robustness of proposed method for speech segregation. The qualitative and quantitative analysis carried out on three datasets demonstrate the efficiency of the proposed method compared to the state-of-the-art speech segregation and classification-based methods. Public Library of Science 2018-03-20 /pmc/articles/PMC5860734/ /pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151 Text en © 2018 Qazi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Qazi, Khurram Ashfaq Nawaz, Tabassam Mehmood, Zahid Rashid, Muhammad Habib, Hafiz Adnan A hybrid technique for speech segregation and classification using a sophisticated deep neural network |
title | A hybrid technique for speech segregation and classification using a sophisticated deep neural network |
title_full | A hybrid technique for speech segregation and classification using a sophisticated deep neural network |
title_fullStr | A hybrid technique for speech segregation and classification using a sophisticated deep neural network |
title_full_unstemmed | A hybrid technique for speech segregation and classification using a sophisticated deep neural network |
title_short | A hybrid technique for speech segregation and classification using a sophisticated deep neural network |
title_sort | hybrid technique for speech segregation and classification using a sophisticated deep neural network |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5860734/ https://www.ncbi.nlm.nih.gov/pubmed/29558485 http://dx.doi.org/10.1371/journal.pone.0194151 |
work_keys_str_mv | AT qazikhurramashfaq ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT nawaztabassam ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT mehmoodzahid ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT rashidmuhammad ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT habibhafizadnan ahybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT qazikhurramashfaq hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT nawaztabassam hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT mehmoodzahid hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT rashidmuhammad hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork AT habibhafizadnan hybridtechniqueforspeechsegregationandclassificationusingasophisticateddeepneuralnetwork |