Cargando…

Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations

The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using r...

Descripción completa

Detalles Bibliográficos
Autores principales:	Atmaja, Bagus Tris, Sasou, Akira
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Communication
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460459/ https://www.ncbi.nlm.nih.gov/pubmed/36080828 http://dx.doi.org/10.3390/s22176369

_version_	1784786752616005632
author	Atmaja, Bagus Tris Sasou, Akira
author_facet	Atmaja, Bagus Tris Sasou, Akira
author_sort	Atmaja, Bagus Tris
collection	PubMed
description	The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks.
format	Online Article Text
id	pubmed-9460459
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-94604592022-09-10 Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations Atmaja, Bagus Tris Sasou, Akira Sensors (Basel) Communication The study of understanding sentiment and emotion in speech is a challenging task in human multimodal language. However, in certain cases, such as telephone calls, only audio data can be obtained. In this study, we independently evaluated sentiment analysis and emotion recognition from speech using recent self-supervised learning models—specifically, universal speech representations with speaker-aware pre-training models. Three different sizes of universal models were evaluated for three sentiment tasks and an emotion task. The evaluation revealed that the best results were obtained with two classes of sentiment analysis, based on both weighted and unweighted accuracy scores (81% and 73%). This binary classification with unimodal acoustic analysis also performed competitively compared to previous methods which used multimodal fusion. The models failed to make accurate predictionsin an emotion recognition task and in sentiment analysis tasks with higher numbers of classes. The unbalanced property of the datasets may also have contributed to the performance degradations observed in the six-class emotion, three-class sentiment, and seven-class sentiment tasks. MDPI 2022-08-24 /pmc/articles/PMC9460459/ /pubmed/36080828 http://dx.doi.org/10.3390/s22176369 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Communication Atmaja, Bagus Tris Sasou, Akira Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title	Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_full	Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_fullStr	Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_full_unstemmed	Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_short	Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations
title_sort	sentiment analysis and emotion recognition from speech using universal speech representations
topic	Communication
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9460459/ https://www.ncbi.nlm.nih.gov/pubmed/36080828 http://dx.doi.org/10.3390/s22176369
work_keys_str_mv	AT atmajabagustris sentimentanalysisandemotionrecognitionfromspeechusinguniversalspeechrepresentations AT sasouakira sentimentanalysisandemotionrecognitionfromspeechusinguniversalspeechrepresentations

Sentiment Analysis and Emotion Recognition from Speech Using Universal Speech Representations

Ejemplares similares