Cargando…

Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing

The following paper introduces a group of novel speech-signal descriptors that reflect phoneme-pronunciation variability and that can be considered as potentially useful features for emotion sensing. The proposed group includes a set of statistical parameters of Poincare maps, derived for formant-fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Ślot, Krzysztof, Bronakowski, Łukasz, Cichosz, Jaroslaw, Kim, Hyongsuk
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Molecular Diversity Preservation International (MDPI) 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267199/
https://www.ncbi.nlm.nih.gov/pubmed/22303151
http://dx.doi.org/10.3390/s91209858
_version_ 1782222258471698432
author Ślot, Krzysztof
Bronakowski, Łukasz
Cichosz, Jaroslaw
Kim, Hyongsuk
author_facet Ślot, Krzysztof
Bronakowski, Łukasz
Cichosz, Jaroslaw
Kim, Hyongsuk
author_sort Ślot, Krzysztof
collection PubMed
description The following paper introduces a group of novel speech-signal descriptors that reflect phoneme-pronunciation variability and that can be considered as potentially useful features for emotion sensing. The proposed group includes a set of statistical parameters of Poincare maps, derived for formant-frequency evolution and energy evolution of voiced-speech segments. Two groups of Poincare-map characteristics were considered in the research: descriptors of sample-scatter, which reflect magnitudes of phone-uttering variations and descriptors of cross-correlations that exist among samples and that evaluate consistency of variations. It has been shown that inclusion of the proposed characteristics into the pool of commonly used speech descriptors, results in a noticeable increase—at the level of 10%—in emotion sensing performance. Standard pattern recognition methodology has been adopted for evaluation of the proposed descriptors, with the assumption that three- or four-dimensional feature spaces can provide sufficient emotion sensing. Binary decision trees have been selected for data classification, as they provide with detailed information on emotion-specific discriminative power of various speech descriptors.
format Online
Article
Text
id pubmed-3267199
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Molecular Diversity Preservation International (MDPI)
record_format MEDLINE/PubMed
spelling pubmed-32671992012-02-02 Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing Ślot, Krzysztof Bronakowski, Łukasz Cichosz, Jaroslaw Kim, Hyongsuk Sensors (Basel) Article The following paper introduces a group of novel speech-signal descriptors that reflect phoneme-pronunciation variability and that can be considered as potentially useful features for emotion sensing. The proposed group includes a set of statistical parameters of Poincare maps, derived for formant-frequency evolution and energy evolution of voiced-speech segments. Two groups of Poincare-map characteristics were considered in the research: descriptors of sample-scatter, which reflect magnitudes of phone-uttering variations and descriptors of cross-correlations that exist among samples and that evaluate consistency of variations. It has been shown that inclusion of the proposed characteristics into the pool of commonly used speech descriptors, results in a noticeable increase—at the level of 10%—in emotion sensing performance. Standard pattern recognition methodology has been adopted for evaluation of the proposed descriptors, with the assumption that three- or four-dimensional feature spaces can provide sufficient emotion sensing. Binary decision trees have been selected for data classification, as they provide with detailed information on emotion-specific discriminative power of various speech descriptors. Molecular Diversity Preservation International (MDPI) 2009-12-03 /pmc/articles/PMC3267199/ /pubmed/22303151 http://dx.doi.org/10.3390/s91209858 Text en © 2009 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Ślot, Krzysztof
Bronakowski, Łukasz
Cichosz, Jaroslaw
Kim, Hyongsuk
Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing
title Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing
title_full Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing
title_fullStr Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing
title_full_unstemmed Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing
title_short Application of Poincare-Mapping of Voiced-Speech Segments for Emotion Sensing
title_sort application of poincare-mapping of voiced-speech segments for emotion sensing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3267199/
https://www.ncbi.nlm.nih.gov/pubmed/22303151
http://dx.doi.org/10.3390/s91209858
work_keys_str_mv AT slotkrzysztof applicationofpoincaremappingofvoicedspeechsegmentsforemotionsensing
AT bronakowskiłukasz applicationofpoincaremappingofvoicedspeechsegmentsforemotionsensing
AT cichoszjaroslaw applicationofpoincaremappingofvoicedspeechsegmentsforemotionsensing
AT kimhyongsuk applicationofpoincaremappingofvoicedspeechsegmentsforemotionsensing