Cargando…

Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data

The advent of new devices, technology, machine learning techniques, and the availability of free large speech corpora results in rapid and accurate speech recognition. In the last two decades, extensive research has been initiated by researchers and different organizations to experiment with new tec...

Descripción completa

Detalles Bibliográficos
Autores principales: Pervaiz, Ayesha, Hussain, Fawad, Israr, Huma, Tahir, Muhammad Ali, Raja, Fawad Riasat, Baloch, Naveed Khan, Ishmanov, Farruh, Zikria, Yousaf Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7219662/
https://www.ncbi.nlm.nih.gov/pubmed/32325814
http://dx.doi.org/10.3390/s20082326
_version_ 1783533025251819520
author Pervaiz, Ayesha
Hussain, Fawad
Israr, Huma
Tahir, Muhammad Ali
Raja, Fawad Riasat
Baloch, Naveed Khan
Ishmanov, Farruh
Zikria, Yousaf Bin
author_facet Pervaiz, Ayesha
Hussain, Fawad
Israr, Huma
Tahir, Muhammad Ali
Raja, Fawad Riasat
Baloch, Naveed Khan
Ishmanov, Farruh
Zikria, Yousaf Bin
author_sort Pervaiz, Ayesha
collection PubMed
description The advent of new devices, technology, machine learning techniques, and the availability of free large speech corpora results in rapid and accurate speech recognition. In the last two decades, extensive research has been initiated by researchers and different organizations to experiment with new techniques and their applications in speech processing systems. There are several speech command based applications in the area of robotics, IoT, ubiquitous computing, and different human-computer interfaces. Various researchers have worked on enhancing the efficiency of speech command based systems and used the speech command dataset. However, none of them catered to noise in the same. Noise is one of the major challenges in any speech recognition system, as real-time noise is a very versatile and unavoidable factor that affects the performance of speech recognition systems, particularly those that have not learned the noise efficiently. We thoroughly analyse the latest trends in speech recognition and evaluate the speech command dataset on different machine learning based and deep learning based techniques. A novel technique is proposed for noise robustness by augmenting noise in training data. Our proposed technique is tested on clean and noisy data along with locally generated data and achieves much better results than existing state-of-the-art techniques, thus setting a new benchmark.
format Online
Article
Text
id pubmed-7219662
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-72196622020-05-22 Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data Pervaiz, Ayesha Hussain, Fawad Israr, Huma Tahir, Muhammad Ali Raja, Fawad Riasat Baloch, Naveed Khan Ishmanov, Farruh Zikria, Yousaf Bin Sensors (Basel) Article The advent of new devices, technology, machine learning techniques, and the availability of free large speech corpora results in rapid and accurate speech recognition. In the last two decades, extensive research has been initiated by researchers and different organizations to experiment with new techniques and their applications in speech processing systems. There are several speech command based applications in the area of robotics, IoT, ubiquitous computing, and different human-computer interfaces. Various researchers have worked on enhancing the efficiency of speech command based systems and used the speech command dataset. However, none of them catered to noise in the same. Noise is one of the major challenges in any speech recognition system, as real-time noise is a very versatile and unavoidable factor that affects the performance of speech recognition systems, particularly those that have not learned the noise efficiently. We thoroughly analyse the latest trends in speech recognition and evaluate the speech command dataset on different machine learning based and deep learning based techniques. A novel technique is proposed for noise robustness by augmenting noise in training data. Our proposed technique is tested on clean and noisy data along with locally generated data and achieves much better results than existing state-of-the-art techniques, thus setting a new benchmark. MDPI 2020-04-19 /pmc/articles/PMC7219662/ /pubmed/32325814 http://dx.doi.org/10.3390/s20082326 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pervaiz, Ayesha
Hussain, Fawad
Israr, Huma
Tahir, Muhammad Ali
Raja, Fawad Riasat
Baloch, Naveed Khan
Ishmanov, Farruh
Zikria, Yousaf Bin
Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
title Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
title_full Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
title_fullStr Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
title_full_unstemmed Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
title_short Incorporating Noise Robustness in Speech Command Recognition by Noise Augmentation of Training Data
title_sort incorporating noise robustness in speech command recognition by noise augmentation of training data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7219662/
https://www.ncbi.nlm.nih.gov/pubmed/32325814
http://dx.doi.org/10.3390/s20082326
work_keys_str_mv AT pervaizayesha incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT hussainfawad incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT israrhuma incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT tahirmuhammadali incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT rajafawadriasat incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT balochnaveedkhan incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT ishmanovfarruh incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata
AT zikriayousafbin incorporatingnoiserobustnessinspeechcommandrecognitionbynoiseaugmentationoftrainingdata