Cargando…
Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using r...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460459/ https://www.ncbi.nlm.nih.gov/pubmed/36080828 http://dx.doi.org/10.3390/s22176369 |
_version_ | 1784786752616005632 |
---|---|
author | Atmaja, Bagus Tris Sasou, Akira |
author_facet | Atmaja, Bagus Tris Sasou, Akira |
author_sort | Atmaja, Bagus Tris |
collection | PubMed |
description | The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks. |
format | Online Article Text |
id | pubmed-9460459 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-94604592022-09-10 Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations Atmaja, Bagus Tris Sasou, Akira Sensors (Basel) Communication The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks. MDPI 2022-08-24 /pmc/articles/PMC9460459/ /pubmed/36080828 http://dx.doi.org/10.3390/s22176369 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Atmaja, Bagus Tris Sasou, Akira Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations |
title | Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations |
title_full | Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations |
title_fullStr | Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations |
title_full_unstemmed | Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations |
title_short | Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations |
title_sort | sentiment analysis and emotion recognition from speech using universal speech representations |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460459/ https://www.ncbi.nlm.nih.gov/pubmed/36080828 http://dx.doi.org/10.3390/s22176369 |
work_keys_str_mv | AT atmajabagustris sentimentanalysisandemotionrecognitionfromspeechusinguniversalspeechrepresentations AT sasouakira sentimentanalysisandemotionrecognitionfromspeechusinguniversalspeechrepresentations |