Cargando…

Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations

The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using r...

Descripción completa

Detalles Bibliográficos
Autores principales: Atmaja, Bagus Tris, Sasou, Akira
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460459/
https://www.ncbi.nlm.nih.gov/pubmed/36080828
http://dx.doi.org/10.3390/s22176369
_version_ 1784786752616005632
author Atmaja, Bagus Tris
Sasou, Akira
author_facet Atmaja, Bagus Tris
Sasou, Akira
author_sort Atmaja, Bagus Tris
collection PubMed
description The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks.
format Online
Article
Text
id pubmed-9460459
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94604592022-09-10 Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations Atmaja, Bagus Tris Sasou, Akira Sensors (Basel) Communication The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks. MDPI 2022-08-24 /pmc/articles/PMC9460459/ /pubmed/36080828 http://dx.doi.org/10.3390/s22176369 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Atmaja, Bagus Tris
Sasou, Akira
Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_full Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_fullStr Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_full_unstemmed Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_short Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_sort sentiment analysis and emotion recognition from speech using universal speech representations
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460459/
https://www.ncbi.nlm.nih.gov/pubmed/36080828
http://dx.doi.org/10.3390/s22176369
work_keys_str_mv AT atmajabagustris sentimentanalysisandemotionrecognitionfromspeechusinguniversalspeechrepresentations
AT sasouakira sentimentanalysisandemotionrecognitionfromspeechusinguniversalspeechrepresentations