Cargando…
Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments
Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a l...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264012/ https://www.ncbi.nlm.nih.gov/pubmed/30400224 http://dx.doi.org/10.3390/s18113744 |
_version_ | 1783375397944033280 |
---|---|
author | Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung |
author_facet | Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung |
author_sort | Bang, Jaehun |
collection | PubMed |
description | Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numerous target user speech data with emotionally-balanced label samples. Therefore, we propose the Robust Personalized Emotion Recognition Framework with the Adaptive Data Boosting Algorithm to solve the cold-start problem. The proposed framework incrementally provides a customized training model for the target user by reinforcing the dataset by combining the acquired target user speech with speech from other users, followed by applying SMOTE (Synthetic Minority Over-sampling Technique)-based data augmentation. The proposed method proved to be adaptive across a small number of target user datasets and emotionally-imbalanced data environments through iterative experiments using the IEMOCAP (Interactive Emotional Dyadic Motion Capture) database. |
format | Online Article Text |
id | pubmed-6264012 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-62640122018-12-12 Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung Sensors (Basel) Article Personalized emotion recognition provides an individual training model for each target user in order to mitigate the accuracy problem when using general training models collected from multiple users. Existing personalized speech emotion recognition research has a cold-start problem that requires a large amount of emotionally-balanced data samples from the target user when creating the personalized training model. Such research is difficult to apply in real environments due to the difficulty of collecting numerous target user speech data with emotionally-balanced label samples. Therefore, we propose the Robust Personalized Emotion Recognition Framework with the Adaptive Data Boosting Algorithm to solve the cold-start problem. The proposed framework incrementally provides a customized training model for the target user by reinforcing the dataset by combining the acquired target user speech with speech from other users, followed by applying SMOTE (Synthetic Minority Over-sampling Technique)-based data augmentation. The proposed method proved to be adaptive across a small number of target user datasets and emotionally-imbalanced data environments through iterative experiments using the IEMOCAP (Interactive Emotional Dyadic Motion Capture) database. MDPI 2018-11-02 /pmc/articles/PMC6264012/ /pubmed/30400224 http://dx.doi.org/10.3390/s18113744 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Bang, Jaehun Hur, Taeho Kim, Dohyeong Huynh-The, Thien Lee, Jongwon Han, Yongkoo Banos, Oresti Kim, Jee-In Lee, Sungyoung Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments |
title | Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments |
title_full | Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments |
title_fullStr | Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments |
title_full_unstemmed | Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments |
title_short | Adaptive Data Boosting Technique for Robust Personalized Speech Emotion in Emotionally-Imbalanced Small-Sample Environments |
title_sort | adaptive data boosting technique for robust personalized speech emotion in emotionally-imbalanced small-sample environments |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6264012/ https://www.ncbi.nlm.nih.gov/pubmed/30400224 http://dx.doi.org/10.3390/s18113744 |
work_keys_str_mv | AT bangjaehun adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT hurtaeho adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT kimdohyeong adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT huynhthethien adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT leejongwon adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT hanyongkoo adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT banosoresti adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT kimjeein adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments AT leesungyoung adaptivedataboostingtechniqueforrobustpersonalizedspeechemotioninemotionallyimbalancedsmallsampleenvironments |