Cargando…
Speech emotion classification using attention based network and regularized feature selection
Speech emotion classification (SEC) has gained the utmost height and occupied a conspicuous position within the research community in recent times. Its vital role in Human–Computer Interaction (HCI) and affective computing cannot be overemphasized. Many primitive algorithmic solutions and deep neura...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368662/ https://www.ncbi.nlm.nih.gov/pubmed/37491423 http://dx.doi.org/10.1038/s41598-023-38868-2 |
_version_ | 1785077551651094528 |
---|---|
author | Akinpelu, Samson Viriri, Serestina |
author_facet | Akinpelu, Samson Viriri, Serestina |
author_sort | Akinpelu, Samson |
collection | PubMed |
description | Speech emotion classification (SEC) has gained the utmost height and occupied a conspicuous position within the research community in recent times. Its vital role in Human–Computer Interaction (HCI) and affective computing cannot be overemphasized. Many primitive algorithmic solutions and deep neural network (DNN) models have been proposed for efficient recognition of emotion from speech however, the suitability of these methods to accurately classify emotion from speech with multi-lingual background and other factors that impede efficient classification of emotion is still demanding critical consideration. This study proposed an attention-based network with a pre-trained convolutional neural network and regularized neighbourhood component analysis (RNCA) feature selection techniques for improved classification of speech emotion. The attention model has proven to be successful in many sequence-based and time-series tasks. An extensive experiment was carried out using three major classifiers (SVM, MLP and Random Forest) on a publicly available TESS (Toronto English Speech Sentence) dataset. The result of our proposed model (Attention-based DCNN+RNCA+RF) achieved 97.8% classification accuracy and yielded a 3.27% improved performance, which outperforms state-of-the-art SEC approaches. Our model evaluation revealed the consistency of attention mechanism and feature selection with human behavioural patterns in classifying emotion from auditory speech. |
format | Online Article Text |
id | pubmed-10368662 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-103686622023-07-27 Speech emotion classification using attention based network and regularized feature selection Akinpelu, Samson Viriri, Serestina Sci Rep Article Speech emotion classification (SEC) has gained the utmost height and occupied a conspicuous position within the research community in recent times. Its vital role in Human–Computer Interaction (HCI) and affective computing cannot be overemphasized. Many primitive algorithmic solutions and deep neural network (DNN) models have been proposed for efficient recognition of emotion from speech however, the suitability of these methods to accurately classify emotion from speech with multi-lingual background and other factors that impede efficient classification of emotion is still demanding critical consideration. This study proposed an attention-based network with a pre-trained convolutional neural network and regularized neighbourhood component analysis (RNCA) feature selection techniques for improved classification of speech emotion. The attention model has proven to be successful in many sequence-based and time-series tasks. An extensive experiment was carried out using three major classifiers (SVM, MLP and Random Forest) on a publicly available TESS (Toronto English Speech Sentence) dataset. The result of our proposed model (Attention-based DCNN+RNCA+RF) achieved 97.8% classification accuracy and yielded a 3.27% improved performance, which outperforms state-of-the-art SEC approaches. Our model evaluation revealed the consistency of attention mechanism and feature selection with human behavioural patterns in classifying emotion from auditory speech. Nature Publishing Group UK 2023-07-25 /pmc/articles/PMC10368662/ /pubmed/37491423 http://dx.doi.org/10.1038/s41598-023-38868-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Akinpelu, Samson Viriri, Serestina Speech emotion classification using attention based network and regularized feature selection |
title | Speech emotion classification using attention based network and regularized feature selection |
title_full | Speech emotion classification using attention based network and regularized feature selection |
title_fullStr | Speech emotion classification using attention based network and regularized feature selection |
title_full_unstemmed | Speech emotion classification using attention based network and regularized feature selection |
title_short | Speech emotion classification using attention based network and regularized feature selection |
title_sort | speech emotion classification using attention based network and regularized feature selection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10368662/ https://www.ncbi.nlm.nih.gov/pubmed/37491423 http://dx.doi.org/10.1038/s41598-023-38868-2 |
work_keys_str_mv | AT akinpelusamson speechemotionclassificationusingattentionbasednetworkandregularizedfeatureselection AT viririserestina speechemotionclassificationusingattentionbasednetworkandregularizedfeatureselection |