Cargando…

Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments

Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a l...

Descripción completa

Detalles Bibliográficos
Autores principales: Bang, Jaehun, Hur, Taeho, Kim, Dohyeong, Huynh-The, Thien, Lee, Jongwon, Han, Yongkoo, Banos, Oresti, Kim, Jee-In, Lee, Sungyoung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264012/
https://www.ncbi.nlm.nih.gov/pubmed/30400224
http://dx.doi.org/10.3390/s18113744
_version_ 1783375397944033280
author Bang, Jaehun
Hur, Taeho
Kim, Dohyeong
Huynh-The, Thien
Lee, Jongwon
Han, Yongkoo
Banos, Oresti
Kim, Jee-In
Lee, Sungyoung
author_facet Bang, Jaehun
Hur, Taeho
Kim, Dohyeong
Huynh-The, Thien
Lee, Jongwon
Han, Yongkoo
Banos, Oresti
Kim, Jee-In
Lee, Sungyoung
author_sort Bang, Jaehun
collection PubMed
description Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numerous target user speech data with emotionally-balanced label samples. Therefore, we propose the Robust Personalized Emotion Recognition Framework with the Adaptive Data Boosting Algorithm to solve the cold-start problem. The proposed framework incrementally provides a customized training model for the target user by reinforcing the dataset by combining the acquired target user speech with speech from other users, followed by applying SMOTE (Synthetic Minority Over-sampling Technique)-based data augmentation. The proposed method proved to be adaptive across a small number of target user datasets and emotionally-imbalanced data environments through iterative experiments using the IEMOCAP (Interactive Emotional Dyadic Motion Capture) database.
format Online
Article
Text
id pubmed-6264012
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-62640122018-12-12 Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung Sensors (Basel) Article Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numerous target user speech data with emotionally-balanced label samples. Therefore, we propose the Robust Personalized Emotion Recognition Framework with the Adaptive Data Boosting Algorithm to solve the cold-start problem. The proposed framework incrementally provides a customized training model for the target user by reinforcing the dataset by combining the acquired target user speech with speech from other users, followed by applying SMOTE (Synthetic Minority Over-sampling Technique)-based data augmentation. The proposed method proved to be adaptive across a small number of target user datasets and emotionally-imbalanced data environments through iterative experiments using the IEMOCAP (Interactive Emotional Dyadic Motion Capture) database. MDPI 2018-11-02 /pmc/articles/PMC6264012/ /pubmed/30400224 http://dx.doi.org/10.3390/s18113744 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bang, Jaehun
Hur, Taeho
Kim, Dohyeong
Huynh-The, Thien
Lee, Jongwon
Han, Yongkoo
Banos, Oresti
Kim, Jee-In
Lee, Sungyoung
Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_full Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_fullStr Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_full_unstemmed Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_short Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_sort adaptive data boosting technique for robust personalized speech emotion in emotionally-imbalanced small-sample environments
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264012/
https://www.ncbi.nlm.nih.gov/pubmed/30400224
http://dx.doi.org/10.3390/s18113744
work_keys_str_mv AT bangjaehun adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT hurtaeho adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT kimdohyeong adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT huynhthethien adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT leejongwon adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT hanyongkoo adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT banosoresti adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT kimjeein adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments
AT leesungyoung adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments