Cargando…

Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition

The presented paper introduces principal component analysis application for dimensionality reduction of variables describing speech signal and applicability of obtained results for the disturbed and fluent speech recognition process. A set of fluent speech signals and three speech disturbances—block...

Descripción completa

Detalles Bibliográficos
Autores principales: Świetlicka, Izabela, Kuniszyk-Jóźkowiak, Wiesława, Świetlicki, Michał
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8749906/
https://www.ncbi.nlm.nih.gov/pubmed/35009863
http://dx.doi.org/10.3390/s22010321
_version_ 1784631340897927168
author Świetlicka, Izabela
Kuniszyk-Jóźkowiak, Wiesława
Świetlicki, Michał
author_facet Świetlicka, Izabela
Kuniszyk-Jóźkowiak, Wiesława
Świetlicki, Michał
author_sort Świetlicka, Izabela
collection PubMed
description The presented paper introduces principal component analysis application for dimensionality reduction of variables describing speech signal and applicability of obtained results for the disturbed and fluent speech recognition process. A set of fluent speech signals and three speech disturbances—blocks before words starting with plosives, syllable repetitions, and sound-initial prolongations—was transformed using principal component analysis. The result was a model containing four principal components describing analysed utterances. Distances between standardised original variables and elements of the observation matrix in a new system of coordinates were calculated and then applied in the recognition process. As a classifying algorithm, the multilayer perceptron network was used. Achieved results were compared with outcomes from previous experiments where speech samples were parameterised with the Kohonen network application. The classifying network achieved overall accuracy at 76% (from 50% to 91%, depending on the dysfluency type).
format Online
Article
Text
id pubmed-8749906
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-87499062022-01-12 Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition Świetlicka, Izabela Kuniszyk-Jóźkowiak, Wiesława Świetlicki, Michał Sensors (Basel) Article The presented paper introduces principal component analysis application for dimensionality reduction of variables describing speech signal and applicability of obtained results for the disturbed and fluent speech recognition process. A set of fluent speech signals and three speech disturbances—blocks before words starting with plosives, syllable repetitions, and sound-initial prolongations—was transformed using principal component analysis. The result was a model containing four principal components describing analysed utterances. Distances between standardised original variables and elements of the observation matrix in a new system of coordinates were calculated and then applied in the recognition process. As a classifying algorithm, the multilayer perceptron network was used. Achieved results were compared with outcomes from previous experiments where speech samples were parameterised with the Kohonen network application. The classifying network achieved overall accuracy at 76% (from 50% to 91%, depending on the dysfluency type). MDPI 2022-01-01 /pmc/articles/PMC8749906/ /pubmed/35009863 http://dx.doi.org/10.3390/s22010321 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Świetlicka, Izabela
Kuniszyk-Jóźkowiak, Wiesława
Świetlicki, Michał
Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition
title Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition
title_full Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition
title_fullStr Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition
title_full_unstemmed Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition
title_short Artificial Neural Networks Combined with the Principal Component Analysis for Non-Fluent Speech Recognition
title_sort artificial neural networks combined with the principal component analysis for non-fluent speech recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8749906/
https://www.ncbi.nlm.nih.gov/pubmed/35009863
http://dx.doi.org/10.3390/s22010321
work_keys_str_mv AT swietlickaizabela artificialneuralnetworkscombinedwiththeprincipalcomponentanalysisfornonfluentspeechrecognition
AT kuniszykjozkowiakwiesława artificialneuralnetworkscombinedwiththeprincipalcomponentanalysisfornonfluentspeechrecognition
AT swietlickimichał artificialneuralnetworkscombinedwiththeprincipalcomponentanalysisfornonfluentspeechrecognition