Cargando…

Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf

OBJECTIVE: Automated speech recognition (ASR) systems have become increasingly sophisticated, accurate, and deployable on many digital devices, including on a smartphone. This pilot study aims to examine the speech recognition performance of ASR apps using audiological speech tests. In addition, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Pragt, Leontien, van Hengel, Peter, Grob, Dagmar, Wasmann, Jan-Willem A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8889114/
https://www.ncbi.nlm.nih.gov/pubmed/35252959
http://dx.doi.org/10.3389/fdgth.2022.806076
_version_ 1784661326860124160
author Pragt, Leontien
van Hengel, Peter
Grob, Dagmar
Wasmann, Jan-Willem A.
author_facet Pragt, Leontien
van Hengel, Peter
Grob, Dagmar
Wasmann, Jan-Willem A.
author_sort Pragt, Leontien
collection PubMed
description OBJECTIVE: Automated speech recognition (ASR) systems have become increasingly sophisticated, accurate, and deployable on many digital devices, including on a smartphone. This pilot study aims to examine the speech recognition performance of ASR apps using audiological speech tests. In addition, we compare ASR speech recognition performance to normal hearing and hearing impaired listeners and evaluate if standard clinical audiological tests are a meaningful and quick measure of the performance of ASR apps. METHODS: Four apps have been tested on a smartphone, respectively AVA, Earfy, Live Transcribe, and Speechy. The Dutch audiological speech tests performed were speech audiometry in quiet (Dutch CNC-test), Digits-in-Noise (DIN)-test with steady-state speech-shaped noise, sentences in quiet and in averaged long-term speech-shaped spectrum noise (Plomp-test). For comparison, the app's ability to transcribe a spoken dialogue (Dutch and English) was tested. RESULTS: All apps scored at least 50% phonemes correct on the Dutch CNC-test for a conversational speech intensity level (65 dB SPL) and achieved 90–100% phoneme recognition at higher intensity levels. On the DIN-test, AVA and Live Transcribe had the lowest (best) signal-to-noise ratio +8 dB. The lowest signal-to-noise measured with the Plomp-test was +8 to 9 dB for Earfy (Android) and Live Transcribe (Android). Overall, the word error rate for the dialogue in English (19–34%) was lower (better) than for the Dutch dialogue (25–66%). CONCLUSION: The performance of the apps was limited on audiological tests that provide little linguistic context or use low signal to noise levels. For Dutch audiological speech tests in quiet, ASR apps performed similarly to a person with a moderate hearing loss. In noise, the ASR apps performed more poorly than most profoundly deaf people using a hearing aid or cochlear implant. Adding new performance metrics including the semantic difference as a function of SNR and reverberation time could help to monitor and further improve ASR performance.
format Online
Article
Text
id pubmed-8889114
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-88891142022-03-03 Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf Pragt, Leontien van Hengel, Peter Grob, Dagmar Wasmann, Jan-Willem A. Front Digit Health Digital Health OBJECTIVE: Automated speech recognition (ASR) systems have become increasingly sophisticated, accurate, and deployable on many digital devices, including on a smartphone. This pilot study aims to examine the speech recognition performance of ASR apps using audiological speech tests. In addition, we compare ASR speech recognition performance to normal hearing and hearing impaired listeners and evaluate if standard clinical audiological tests are a meaningful and quick measure of the performance of ASR apps. METHODS: Four apps have been tested on a smartphone, respectively AVA, Earfy, Live Transcribe, and Speechy. The Dutch audiological speech tests performed were speech audiometry in quiet (Dutch CNC-test), Digits-in-Noise (DIN)-test with steady-state speech-shaped noise, sentences in quiet and in averaged long-term speech-shaped spectrum noise (Plomp-test). For comparison, the app's ability to transcribe a spoken dialogue (Dutch and English) was tested. RESULTS: All apps scored at least 50% phonemes correct on the Dutch CNC-test for a conversational speech intensity level (65 dB SPL) and achieved 90–100% phoneme recognition at higher intensity levels. On the DIN-test, AVA and Live Transcribe had the lowest (best) signal-to-noise ratio +8 dB. The lowest signal-to-noise measured with the Plomp-test was +8 to 9 dB for Earfy (Android) and Live Transcribe (Android). Overall, the word error rate for the dialogue in English (19–34%) was lower (better) than for the Dutch dialogue (25–66%). CONCLUSION: The performance of the apps was limited on audiological tests that provide little linguistic context or use low signal to noise levels. For Dutch audiological speech tests in quiet, ASR apps performed similarly to a person with a moderate hearing loss. In noise, the ASR apps performed more poorly than most profoundly deaf people using a hearing aid or cochlear implant. Adding new performance metrics including the semantic difference as a function of SNR and reverberation time could help to monitor and further improve ASR performance. Frontiers Media S.A. 2022-02-16 /pmc/articles/PMC8889114/ /pubmed/35252959 http://dx.doi.org/10.3389/fdgth.2022.806076 Text en Copyright © 2022 Pragt, van Hengel, Grob and Wasmann. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Digital Health
Pragt, Leontien
van Hengel, Peter
Grob, Dagmar
Wasmann, Jan-Willem A.
Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf
title Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf
title_full Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf
title_fullStr Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf
title_full_unstemmed Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf
title_short Preliminary Evaluation of Automated Speech Recognition Apps for the Hearing Impaired and Deaf
title_sort preliminary evaluation of automated speech recognition apps for the hearing impaired and deaf
topic Digital Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8889114/
https://www.ncbi.nlm.nih.gov/pubmed/35252959
http://dx.doi.org/10.3389/fdgth.2022.806076
work_keys_str_mv AT pragtleontien preliminaryevaluationofautomatedspeechrecognitionappsforthehearingimpairedanddeaf
AT vanhengelpeter preliminaryevaluationofautomatedspeechrecognitionappsforthehearingimpairedanddeaf
AT grobdagmar preliminaryevaluationofautomatedspeechrecognitionappsforthehearingimpairedanddeaf
AT wasmannjanwillema preliminaryevaluationofautomatedspeechrecognitionappsforthehearingimpairedanddeaf