Cargando…

Development of High Accuracy Classifier for the Speaker Recognition System

Speech signal is enriched with plenty of features used for biometrical recognition and other applications like gender and emotional recognition. Channel conditions manifested by background noise and reverberation are the main challenges causing feature shifts in the test and training data. In this p...

Descripción completa

Detalles Bibliográficos
Autores principales:	Al-Hassani, Raghad Tariq, Atilla, Dogu Cagdas, Aydin, Çağatay
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8159638/ https://www.ncbi.nlm.nih.gov/pubmed/34104204 http://dx.doi.org/10.1155/2021/5559616

_version_	1783700131691888640
author	Al-Hassani, Raghad Tariq Atilla, Dogu Cagdas Aydin, Çağatay
author_facet	Al-Hassani, Raghad Tariq Atilla, Dogu Cagdas Aydin, Çağatay
author_sort	Al-Hassani, Raghad Tariq
collection	PubMed
description	Speech signal is enriched with plenty of features used for biometrical recognition and other applications like gender and emotional recognition. Channel conditions manifested by background noise and reverberation are the main challenges causing feature shifts in the test and training data. In this paper, a hybrid speaker identification model for consistent speech features and high recognition accuracy is made. Features using Mel frequency spectrum coefficients (MFCC) have been improved by incorporating a pitch frequency coefficient from speech time domain analysis. In order to enhance noise immunity, we proposed a single hidden layer feed-forward neural network (FFNN) tuned by an optimized particle swarm optimization (OPSO) algorithm. The proposed model is tested using 10-fold cross-validation over different levels of Adaptive White Gaussian Noise (AWGN) (0-50 dB). A recognition accuracy of 97.83% was obtained from the proposed model in clean voice environments. However, a noisy channel is realized with lesser impact on the proposed model as compared with other baseline classifiers such as plain-FFNN, random forest (RF), K-nearest neighbour (KNN), and support vector machine (SVM).
format	Online Article Text
id	pubmed-8159638
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-81596382021-06-07 Development of High Accuracy Classifier for the Speaker Recognition System Al-Hassani, Raghad Tariq Atilla, Dogu Cagdas Aydin, Çağatay Appl Bionics Biomech Research Article Speech signal is enriched with plenty of features used for biometrical recognition and other applications like gender and emotional recognition. Channel conditions manifested by background noise and reverberation are the main challenges causing feature shifts in the test and training data. In this paper, a hybrid speaker identification model for consistent speech features and high recognition accuracy is made. Features using Mel frequency spectrum coefficients (MFCC) have been improved by incorporating a pitch frequency coefficient from speech time domain analysis. In order to enhance noise immunity, we proposed a single hidden layer feed-forward neural network (FFNN) tuned by an optimized particle swarm optimization (OPSO) algorithm. The proposed model is tested using 10-fold cross-validation over different levels of Adaptive White Gaussian Noise (AWGN) (0-50 dB). A recognition accuracy of 97.83% was obtained from the proposed model in clean voice environments. However, a noisy channel is realized with lesser impact on the proposed model as compared with other baseline classifiers such as plain-FFNN, random forest (RF), K-nearest neighbour (KNN), and support vector machine (SVM). Hindawi 2021-05-19 /pmc/articles/PMC8159638/ /pubmed/34104204 http://dx.doi.org/10.1155/2021/5559616 Text en Copyright © 2021 Raghad Tariq Al-Hassani et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Al-Hassani, Raghad Tariq Atilla, Dogu Cagdas Aydin, Çağatay Development of High Accuracy Classifier for the Speaker Recognition System
title	Development of High Accuracy Classifier for the Speaker Recognition System
title_full	Development of High Accuracy Classifier for the Speaker Recognition System
title_fullStr	Development of High Accuracy Classifier for the Speaker Recognition System
title_full_unstemmed	Development of High Accuracy Classifier for the Speaker Recognition System
title_short	Development of High Accuracy Classifier for the Speaker Recognition System
title_sort	development of high accuracy classifier for the speaker recognition system
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8159638/ https://www.ncbi.nlm.nih.gov/pubmed/34104204 http://dx.doi.org/10.1155/2021/5559616
work_keys_str_mv	AT alhassaniraghadtariq developmentofhighaccuracyclassifierforthespeakerrecognitionsystem AT atilladogucagdas developmentofhighaccuracyclassifierforthespeakerrecognitionsystem AT aydincagatay developmentofhighaccuracyclassifierforthespeakerrecognitionsystem

Development of High Accuracy Classifier for the Speaker Recognition System

Ejemplares similares