Cargando…

Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments

Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a l...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bang, Jaehun, Hur, Taeho, Kim, Dohyeong, Huynh-The, Thien, Lee, Jongwon, Han, Yongkoo, Banos, Oresti, Kim, Jee-In, Lee, Sungyoung
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264012/ https://www.ncbi.nlm.nih.gov/pubmed/30400224 http://dx.doi.org/10.3390/s18113744

_version_	1783375397944033280
author	Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung
author_facet	Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung
author_sort	Bang, Jaehun
collection	PubMed
description	Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numerous target user speech data with emotionally-balanced label samples. Therefore, we propose the Robust Personalized Emotion Recognition Framework with the Adaptive Data Boosting Algorithm to solve the cold-start problem. The proposed framework incrementally provides a customized training model for the target user by reinforcing the dataset by combining the acquired target user speech with speech from other users, followed by applying SMOTE (Synthetic Minority Over-sampling Technique)-based data augmentation. The proposed method proved to be adaptive across a small number of target user datasets and emotionally-imbalanced data environments through iterative experiments using the IEMOCAP (Interactive Emotional Dyadic Motion Capture) database.
format	Online Article Text
id	pubmed-6264012
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-62640122018-12-12 Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung Sensors (Basel) Article Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numerous target user speech data with emotionally-balanced label samples. Therefore, we propose the Robust Personalized Emotion Recognition Framework with the Adaptive Data Boosting Algorithm to solve the cold-start problem. The proposed framework incrementally provides a customized training model for the target user by reinforcing the dataset by combining the acquired target user speech with speech from other users, followed by applying SMOTE (Synthetic Minority Over-sampling Technique)-based data augmentation. The proposed method proved to be adaptive across a small number of target user datasets and emotionally-imbalanced data environments through iterative experiments using the IEMOCAP (Interactive Emotional Dyadic Motion Capture) database. MDPI 2018-11-02 /pmc/articles/PMC6264012/ /pubmed/30400224 http://dx.doi.org/10.3390/s18113744 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title	Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_full	Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_fullStr	Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_full_unstemmed	Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_short	Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
title_sort	adaptive data boosting technique for robust personalized speech emotion in emotionally-imbalanced small-sample environments
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264012/ https://www.ncbi.nlm.nih.gov/pubmed/30400224 http://dx.doi.org/10.3390/s18113744
work_keys_str_mv	AT bangjaehun adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT hurtaeho adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT kimdohyeong adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT huynhthethien adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT leejongwon adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT hanyongkoo adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT banosoresti adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT kimjeein adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT leesungyoung adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments

Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments

Ejemplares similares