Cargando…

Speech emotion classification using attention based network and regularized feature selection

Speech emotion classification (SEC) has gained the utmost height and occupied a conspicuous position within the research community in recent times. Its vital role in Human–Computer Interaction (HCI) and affective computing cannot be overemphasized. Many primitive algorithmic solutions and deep neura...

Descripción completa

Detalles Bibliográficos
Autores principales: Akinpelu, Samson, Viriri, Serestina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368662/
https://www.ncbi.nlm.nih.gov/pubmed/37491423
http://dx.doi.org/10.1038/s41598-023-38868-2
_version_ 1785077551651094528
author Akinpelu, Samson
Viriri, Serestina
author_facet Akinpelu, Samson
Viriri, Serestina
author_sort Akinpelu, Samson
collection PubMed
description Speech emotion classification (SEC) has gained the utmost height and occupied a conspicuous position within the research community in recent times. Its vital role in Human–Computer Interaction (HCI) and affective computing cannot be overemphasized. Many primitive algorithmic solutions and deep neural network (DNN) models have been proposed for efficient recognition of emotion from speech however, the suitability of these methods to accurately classify emotion from speech with multi-lingual background and other factors that impede efficient classification of emotion is still demanding critical consideration. This study proposed an attention-based network with a pre-trained convolutional neural network and regularized neighbourhood component analysis (RNCA) feature selection techniques for improved classification of speech emotion. The attention model has proven to be successful in many sequence-based and time-series tasks. An extensive experiment was carried out using three major classifiers (SVM, MLP and Random Forest) on a publicly available TESS (Toronto English Speech Sentence) dataset. The result of our proposed model (Attention-based DCNN+RNCA+RF) achieved 97.8% classification accuracy and yielded a 3.27% improved performance, which outperforms state-of-the-art SEC approaches. Our model evaluation revealed the consistency of attention mechanism and feature selection with human behavioural patterns in classifying emotion from auditory speech.
format Online
Article
Text
id pubmed-10368662
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103686622023-07-27 Speech emotion classification using attention based network and regularized feature selection Akinpelu, Samson Viriri, Serestina Sci Rep Article Speech emotion classification (SEC) has gained the utmost height and occupied a conspicuous position within the research community in recent times. Its vital role in Human–Computer Interaction (HCI) and affective computing cannot be overemphasized. Many primitive algorithmic solutions and deep neural network (DNN) models have been proposed for efficient recognition of emotion from speech however, the suitability of these methods to accurately classify emotion from speech with multi-lingual background and other factors that impede efficient classification of emotion is still demanding critical consideration. This study proposed an attention-based network with a pre-trained convolutional neural network and regularized neighbourhood component analysis (RNCA) feature selection techniques for improved classification of speech emotion. The attention model has proven to be successful in many sequence-based and time-series tasks. An extensive experiment was carried out using three major classifiers (SVM, MLP and Random Forest) on a publicly available TESS (Toronto English Speech Sentence) dataset. The result of our proposed model (Attention-based DCNN+RNCA+RF) achieved 97.8% classification accuracy and yielded a 3.27% improved performance, which outperforms state-of-the-art SEC approaches. Our model evaluation revealed the consistency of attention mechanism and feature selection with human behavioural patterns in classifying emotion from auditory speech. Nature Publishing Group UK 2023-07-25 /pmc/articles/PMC10368662/ /pubmed/37491423 http://dx.doi.org/10.1038/s41598-023-38868-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Akinpelu, Samson
Viriri, Serestina
Speech emotion classification using attention based network and regularized feature selection
title Speech emotion classification using attention based network and regularized feature selection
title_full Speech emotion classification using attention based network and regularized feature selection
title_fullStr Speech emotion classification using attention based network and regularized feature selection
title_full_unstemmed Speech emotion classification using attention based network and regularized feature selection
title_short Speech emotion classification using attention based network and regularized feature selection
title_sort speech emotion classification using attention based network and regularized feature selection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368662/
https://www.ncbi.nlm.nih.gov/pubmed/37491423
http://dx.doi.org/10.1038/s41598-023-38868-2
work_keys_str_mv AT akinpelusamson speechemotionclassificationusingattentionbasednetworkandregularizedfeatureselection
AT viririserestina speechemotionclassificationusingattentionbasednetworkandregularizedfeatureselection