Cargando…

Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models

The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human–computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended p...

Descripción completa

Detalles Bibliográficos
Autores principales: Abbaschian, Babak Joze, Sierra-Sosa, Daniel, Elmaghraby, Adel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7916477/
https://www.ncbi.nlm.nih.gov/pubmed/33578714
http://dx.doi.org/10.3390/s21041249
_version_ 1783657485900447744
author Abbaschian, Babak Joze
Sierra-Sosa, Daniel
Elmaghraby, Adel
author_facet Abbaschian, Babak Joze
Sierra-Sosa, Daniel
Elmaghraby, Adel
author_sort Abbaschian, Babak Joze
collection PubMed
description The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human–computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended problem. The current study reviews deep learning approaches for SER with available datasets, followed by conventional machine learning techniques for speech emotion recognition. Ultimately, we present a multi-aspect comparison between practical neural network approaches in speech emotion recognition. The goal of this study is to provide a survey of the field of discrete speech emotion recognition.
format Online
Article
Text
id pubmed-7916477
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79164772021-03-01 Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models Abbaschian, Babak Joze Sierra-Sosa, Daniel Elmaghraby, Adel Sensors (Basel) Review The advancements in neural networks and the on-demand need for accurate and near real-time Speech Emotion Recognition (SER) in human–computer interactions make it mandatory to compare available methods and databases in SER to achieve feasible solutions and a firmer understanding of this open-ended problem. The current study reviews deep learning approaches for SER with available datasets, followed by conventional machine learning techniques for speech emotion recognition. Ultimately, we present a multi-aspect comparison between practical neural network approaches in speech emotion recognition. The goal of this study is to provide a survey of the field of discrete speech emotion recognition. MDPI 2021-02-10 /pmc/articles/PMC7916477/ /pubmed/33578714 http://dx.doi.org/10.3390/s21041249 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Abbaschian, Babak Joze
Sierra-Sosa, Daniel
Elmaghraby, Adel
Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
title Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
title_full Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
title_fullStr Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
title_full_unstemmed Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
title_short Deep Learning Techniques for Speech Emotion Recognition, from Databases to Models
title_sort deep learning techniques for speech emotion recognition, from databases to models
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7916477/
https://www.ncbi.nlm.nih.gov/pubmed/33578714
http://dx.doi.org/10.3390/s21041249
work_keys_str_mv AT abbaschianbabakjoze deeplearningtechniquesforspeechemotionrecognitionfromdatabasestomodels
AT sierrasosadaniel deeplearningtechniquesforspeechemotionrecognitionfromdatabasestomodels
AT elmaghrabyadel deeplearningtechniquesforspeechemotionrecognitionfromdatabasestomodels