Cargando…

Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis

Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughnes...

Descripción completa

Detalles Bibliográficos
Autores principales: Haderlein, Tino, Schwemmle, Cornelia, Döllinger, Michael, Matoušek, Václav, Ptok, Martin, Nöth, Elmar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468283/
https://www.ncbi.nlm.nih.gov/pubmed/26136813
http://dx.doi.org/10.1155/2015/316325
_version_ 1782376482392244224
author Haderlein, Tino
Schwemmle, Cornelia
Döllinger, Michael
Matoušek, Václav
Ptok, Martin
Nöth, Elmar
author_facet Haderlein, Tino
Schwemmle, Cornelia
Döllinger, Michael
Matoušek, Václav
Ptok, Martin
Nöth, Elmar
author_sort Haderlein, Tino
collection PubMed
description Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men; 48.7 ± 17.8 years) containing the German version of the text “The North Wind and the Sun” were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners' ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r = 0.71, ρ = 0.57). These correlations were approximately the same as the interrater agreement among human raters (r = 0.65, ρ = 0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis.
format Online
Article
Text
id pubmed-4468283
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-44682832015-07-01 Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis Haderlein, Tino Schwemmle, Cornelia Döllinger, Michael Matoušek, Václav Ptok, Martin Nöth, Elmar Comput Math Methods Med Research Article Due to low intra- and interrater reliability, perceptual voice evaluation should be supported by objective, automatic methods. In this study, text-based, computer-aided prosodic analysis and measurements of connected speech were combined in order to model perceptual evaluation of the German Roughness-Breathiness-Hoarseness (RBH) scheme. 58 connected speech samples (43 women and 15 men; 48.7 ± 17.8 years) containing the German version of the text “The North Wind and the Sun” were evaluated perceptually by 19 speech and voice therapy students according to the RBH scale. For the human-machine correlation, Support Vector Regression with measurements of the vocal fold cycle irregularities (CFx) and the closed phases of vocal fold vibration (CQx) of the Laryngograph and 33 features from a prosodic analysis module were used to model the listeners' ratings. The best human-machine results for roughness were obtained from a combination of six prosodic features and CFx (r = 0.71, ρ = 0.57). These correlations were approximately the same as the interrater agreement among human raters (r = 0.65, ρ = 0.61). CQx was one of the substantial features of the hoarseness model. For hoarseness and breathiness, the human-machine agreement was substantially lower. Nevertheless, the automatic analysis method can serve as the basis for a meaningful objective support for perceptual analysis. Hindawi Publishing Corporation 2015 2015-06-02 /pmc/articles/PMC4468283/ /pubmed/26136813 http://dx.doi.org/10.1155/2015/316325 Text en Copyright © 2015 Tino Haderlein et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Haderlein, Tino
Schwemmle, Cornelia
Döllinger, Michael
Matoušek, Václav
Ptok, Martin
Nöth, Elmar
Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
title Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
title_full Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
title_fullStr Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
title_full_unstemmed Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
title_short Automatic Evaluation of Voice Quality Using Text-Based Laryngograph Measurements and Prosodic Analysis
title_sort automatic evaluation of voice quality using text-based laryngograph measurements and prosodic analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4468283/
https://www.ncbi.nlm.nih.gov/pubmed/26136813
http://dx.doi.org/10.1155/2015/316325
work_keys_str_mv AT haderleintino automaticevaluationofvoicequalityusingtextbasedlaryngographmeasurementsandprosodicanalysis
AT schwemmlecornelia automaticevaluationofvoicequalityusingtextbasedlaryngographmeasurementsandprosodicanalysis
AT dollingermichael automaticevaluationofvoicequalityusingtextbasedlaryngographmeasurementsandprosodicanalysis
AT matousekvaclav automaticevaluationofvoicequalityusingtextbasedlaryngographmeasurementsandprosodicanalysis
AT ptokmartin automaticevaluationofvoicequalityusingtextbasedlaryngographmeasurementsandprosodicanalysis
AT nothelmar automaticevaluationofvoicequalityusingtextbasedlaryngographmeasurementsandprosodicanalysis