Cargando…

Pashto isolated digits recognition using deep convolutional neural network

Speech recognition has become one of the most significant parts of human-computer interaction due to emergence of new technologies such as smartphone, smart watch and many modern technologies, therefore the need of an ASR for local languages is felt. The basic aim of this paper is to develop an isol...

Descripción completa

Detalles Bibliográficos
Autores principales: Zada, Bakht, Ullah, Rahim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016387/
https://www.ncbi.nlm.nih.gov/pubmed/32083214
http://dx.doi.org/10.1016/j.heliyon.2020.e03372
_version_ 1783496977709793280
author Zada, Bakht
Ullah, Rahim
author_facet Zada, Bakht
Ullah, Rahim
author_sort Zada, Bakht
collection PubMed
description Speech recognition has become one of the most significant parts of human-computer interaction due to emergence of new technologies such as smartphone, smart watch and many modern technologies, therefore the need of an ASR for local languages is felt. The basic aim of this paper is to develop an isolated digits recognition for Pashto language, using deep CNN. The database of Pashto digits from 0 to 9 with 50 utterance for each digits is used. Twenty MFCC features extracted for each isolated digit and fed as input to CNN. The network has been used for the proposed system is deep up to 4 convolutional layers, followed by ReLU and max-pooling layers. The network has been trained on the 50% of data and the rest of the data was used for testing. The total average of 84.17% accuracy was achieved for testing which show 7.32% better performance as compared to existing similar works.
format Online
Article
Text
id pubmed-7016387
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-70163872020-02-20 Pashto isolated digits recognition using deep convolutional neural network Zada, Bakht Ullah, Rahim Heliyon Article Speech recognition has become one of the most significant parts of human-computer interaction due to emergence of new technologies such as smartphone, smart watch and many modern technologies, therefore the need of an ASR for local languages is felt. The basic aim of this paper is to develop an isolated digits recognition for Pashto language, using deep CNN. The database of Pashto digits from 0 to 9 with 50 utterance for each digits is used. Twenty MFCC features extracted for each isolated digit and fed as input to CNN. The network has been used for the proposed system is deep up to 4 convolutional layers, followed by ReLU and max-pooling layers. The network has been trained on the 50% of data and the rest of the data was used for testing. The total average of 84.17% accuracy was achieved for testing which show 7.32% better performance as compared to existing similar works. Elsevier 2020-02-12 /pmc/articles/PMC7016387/ /pubmed/32083214 http://dx.doi.org/10.1016/j.heliyon.2020.e03372 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Zada, Bakht
Ullah, Rahim
Pashto isolated digits recognition using deep convolutional neural network
title Pashto isolated digits recognition using deep convolutional neural network
title_full Pashto isolated digits recognition using deep convolutional neural network
title_fullStr Pashto isolated digits recognition using deep convolutional neural network
title_full_unstemmed Pashto isolated digits recognition using deep convolutional neural network
title_short Pashto isolated digits recognition using deep convolutional neural network
title_sort pashto isolated digits recognition using deep convolutional neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7016387/
https://www.ncbi.nlm.nih.gov/pubmed/32083214
http://dx.doi.org/10.1016/j.heliyon.2020.e03372
work_keys_str_mv AT zadabakht pashtoisolateddigitsrecognitionusingdeepconvolutionalneuralnetwork
AT ullahrahim pashtoisolateddigitsrecognitionusingdeepconvolutionalneuralnetwork