Cargando…

Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech

In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalizat...

Descripción completa

Detalles Bibliográficos
Autores principales:	Álvarez, Aitor, Sierra, Basilio, Arruti, Andoni, López-Gil, Juan-Miguel, Garay-Vitoria, Nestor
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2015
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4732054/ https://www.ncbi.nlm.nih.gov/pubmed/26712757 http://dx.doi.org/10.3390/s16010021

_version_	1782412642955034624
author	Álvarez, Aitor Sierra, Basilio Arruti, Andoni López-Gil, Juan-Miguel Garay-Vitoria, Nestor
author_facet	Álvarez, Aitor Sierra, Basilio Arruti, Andoni López-Gil, Juan-Miguel Garay-Vitoria, Nestor
author_sort	Álvarez, Aitor
collection	PubMed
description	In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalization by means of an integration of an estimation of distribution algorithm (EDA) in the first layer to select the optimal subset from the standard base classifiers. The good performance of the proposed new paradigm was demonstrated over different configurations and datasets. First, several CSS stacking classifiers were constructed on the RekEmozio dataset, using some specific standard base classifiers and a total of 123 spectral, quality and prosodic features computed using in-house feature extraction algorithms. These initial CSS stacking classifiers were compared to other multi-classifier systems and the employed standard classifiers built on the same set of speech features. Then, new CSS stacking classifiers were built on RekEmozio using a different set of both acoustic parameters (extended version of the Geneva Minimalistic Acoustic Parameter Set (eGeMAPS)) and standard classifiers and employing the best meta-classifier of the initial experiments. The performance of these two CSS stacking classifiers was evaluated and compared. Finally, the new paradigm was tested on the well-known Berlin Emotional Speech database. We compared the performance of single, standard stacking and CSS stacking systems using the same parametrization of the second phase. All of the classifications were performed at the categorical level, including the six primary emotions plus the neutral one.
format	Online Article Text
id	pubmed-4732054
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-47320542016-02-12 Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech Álvarez, Aitor Sierra, Basilio Arruti, Andoni López-Gil, Juan-Miguel Garay-Vitoria, Nestor Sensors (Basel) Article In this paper, a new supervised classification paradigm, called classifier subset selection for stacked generalization (CSS stacking), is presented to deal with speech emotion recognition. The new approach consists of an improvement of a bi-level multi-classifier system known as stacking generalization by means of an integration of an estimation of distribution algorithm (EDA) in the first layer to select the optimal subset from the standard base classifiers. The good performance of the proposed new paradigm was demonstrated over different configurations and datasets. First, several CSS stacking classifiers were constructed on the RekEmozio dataset, using some specific standard base classifiers and a total of 123 spectral, quality and prosodic features computed using in-house feature extraction algorithms. These initial CSS stacking classifiers were compared to other multi-classifier systems and the employed standard classifiers built on the same set of speech features. Then, new CSS stacking classifiers were built on RekEmozio using a different set of both acoustic parameters (extended version of the Geneva Minimalistic Acoustic Parameter Set (eGeMAPS)) and standard classifiers and employing the best meta-classifier of the initial experiments. The performance of these two CSS stacking classifiers was evaluated and compared. Finally, the new paradigm was tested on the well-known Berlin Emotional Speech database. We compared the performance of single, standard stacking and CSS stacking systems using the same parametrization of the second phase. All of the classifications were performed at the categorical level, including the six primary emotions plus the neutral one. MDPI 2015-12-25 /pmc/articles/PMC4732054/ /pubmed/26712757 http://dx.doi.org/10.3390/s16010021 Text en © 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons by Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Álvarez, Aitor Sierra, Basilio Arruti, Andoni López-Gil, Juan-Miguel Garay-Vitoria, Nestor Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
title	Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
title_full	Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
title_fullStr	Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
title_full_unstemmed	Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
title_short	Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech
title_sort	classifier subset selection for the stacked generalization method applied to emotion recognition in speech
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4732054/ https://www.ncbi.nlm.nih.gov/pubmed/26712757 http://dx.doi.org/10.3390/s16010021
work_keys_str_mv	AT alvarezaitor classifiersubsetselectionforthestackedgeneralizationmethodappliedtoemotionrecognitioninspeech AT sierrabasilio classifiersubsetselectionforthestackedgeneralizationmethodappliedtoemotionrecognitioninspeech AT arrutiandoni classifiersubsetselectionforthestackedgeneralizationmethodappliedtoemotionrecognitioninspeech AT lopezgiljuanmiguel classifiersubsetselectionforthestackedgeneralizationmethodappliedtoemotionrecognitioninspeech AT garayvitorianestor classifiersubsetselectionforthestackedgeneralizationmethodappliedtoemotionrecognitioninspeech

Classifier Subset Selection for the Stacked Generalization Method Applied to Emotion Recognition in Speech

Ejemplares similares