Cargando…
Measuring Speech Recognition With a Matrix Test Using Synthetic Speech
Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could b...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6643172/ https://www.ncbi.nlm.nih.gov/pubmed/31322032 http://dx.doi.org/10.1177/2331216519862982 |
_version_ | 1783437087765168128 |
---|---|
author | Nuesse, Theresa Wiercinski, Bianca Brand, Thomas Holube, Inga |
author_facet | Nuesse, Theresa Wiercinski, Bianca Brand, Thomas Holube, Inga |
author_sort | Nuesse, Theresa |
collection | PubMed |
description | Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5 dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of two different natural speakers. Although no optimization was applied, the synthetic-speech signals led to equivalent recognition of the different test lists and word categories. The outcomes of this study indicate that the application of synthetic speech in speech recognition tests could considerably reduce the development costs and evaluation time. This offers the opportunity to increase the speech corpus for speech recognition tests with acceptable effort. |
format | Online Article Text |
id | pubmed-6643172 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-66431722019-07-31 Measuring Speech Recognition With a Matrix Test Using Synthetic Speech Nuesse, Theresa Wiercinski, Bianca Brand, Thomas Holube, Inga Trends Hear Original Article Speech audiometry is an essential part of audiological diagnostics and clinical measurements. Development times of speech recognition tests are rather long, depending on the size of speech corpus and optimization necessity. The aim of this study was to examine whether this development effort could be reduced by using synthetic speech in speech audiometry, especially in a matrix test for speech recognition. For this purpose, the speech material of the German matrix test was replicated using a preselected commercial system to generate the synthetic speech files. In contrast to the conventional matrix test, no level adjustments or optimization tests were performed while producing the synthetic speech material. Evaluation measurements were conducted by presenting both versions of the German matrix test (with natural or synthetic speech), alternately and at three different signal-to-noise ratios, to 48 young, normal-hearing participants. Psychometric functions were fitted to the empirical data. Speech recognition thresholds were 0.5 dB signal-to-noise ratio higher (worse) for the synthetic speech, while slopes were equal for both speech types. Nevertheless, speech recognition scores were comparable with the literature and the threshold difference lay within the same range as recordings of two different natural speakers. Although no optimization was applied, the synthetic-speech signals led to equivalent recognition of the different test lists and word categories. The outcomes of this study indicate that the application of synthetic speech in speech recognition tests could considerably reduce the development costs and evaluation time. This offers the opportunity to increase the speech corpus for speech recognition tests with acceptable effort. SAGE Publications 2019-07-19 /pmc/articles/PMC6643172/ /pubmed/31322032 http://dx.doi.org/10.1177/2331216519862982 Text en © The Author(s) 2019 http://creativecommons.org/licenses/by-nc/4.0/ Creative Commons Non Commercial CC BY-NC: This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Article Nuesse, Theresa Wiercinski, Bianca Brand, Thomas Holube, Inga Measuring Speech Recognition With a Matrix Test Using Synthetic Speech |
title | Measuring Speech Recognition With a Matrix Test Using Synthetic
Speech |
title_full | Measuring Speech Recognition With a Matrix Test Using Synthetic
Speech |
title_fullStr | Measuring Speech Recognition With a Matrix Test Using Synthetic
Speech |
title_full_unstemmed | Measuring Speech Recognition With a Matrix Test Using Synthetic
Speech |
title_short | Measuring Speech Recognition With a Matrix Test Using Synthetic
Speech |
title_sort | measuring speech recognition with a matrix test using synthetic
speech |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6643172/ https://www.ncbi.nlm.nih.gov/pubmed/31322032 http://dx.doi.org/10.1177/2331216519862982 |
work_keys_str_mv | AT nuessetheresa measuringspeechrecognitionwithamatrixtestusingsyntheticspeech AT wiercinskibianca measuringspeechrecognitionwithamatrixtestusingsyntheticspeech AT brandthomas measuringspeechrecognitionwithamatrixtestusingsyntheticspeech AT holubeinga measuringspeechrecognitionwithamatrixtestusingsyntheticspeech |