Cargando…

A New Acoustic-Based Pronunciation Distance Measure

We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as i...

Descripción completa

Detalles Bibliográficos
Autores principales: Bartelds, Martijn, Richter, Caitlin, Liberman, Mark, Wieling, Martijn
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861290/
https://www.ncbi.nlm.nih.gov/pubmed/33733156
http://dx.doi.org/10.3389/frai.2020.00039
_version_ 1783647054267940864
author Bartelds, Martijn
Richter, Caitlin
Liberman, Mark
Wieling, Martijn
author_facet Bartelds, Martijn
Richter, Caitlin
Liberman, Mark
Wieling, Martijn
author_sort Bartelds, Martijn
collection PubMed
description We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as it does not require the time-consuming and error-prone process of phonetically transcribing speech samples which is necessary for current edit distance-based approaches. We minimize speaker variability in the data set by employing speaker-based cepstral mean and variance normalization, and compute word-based acoustic distances using the dynamic time warping algorithm. Our results indicate a strong correlation of r = −0.71 (p < 0.0001) between the acoustic distances and human judgments of native-likeness provided by more than 1,100 native American-English raters. Therefore, the convenient acoustic measure performs only slightly lower than the state-of-the-art transcription-based performance of r = −0.77. We also report the results of several small experiments which show that the acoustic measure is not only sensitive to segmental differences, but also to intonational differences and durational differences. However, it is not immune to unwanted differences caused by using a different recording device.
format Online
Article
Text
id pubmed-7861290
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-78612902021-03-16 A New Acoustic-Based Pronunciation Distance Measure Bartelds, Martijn Richter, Caitlin Liberman, Mark Wieling, Martijn Front Artif Intell Artificial Intelligence We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as it does not require the time-consuming and error-prone process of phonetically transcribing speech samples which is necessary for current edit distance-based approaches. We minimize speaker variability in the data set by employing speaker-based cepstral mean and variance normalization, and compute word-based acoustic distances using the dynamic time warping algorithm. Our results indicate a strong correlation of r = −0.71 (p < 0.0001) between the acoustic distances and human judgments of native-likeness provided by more than 1,100 native American-English raters. Therefore, the convenient acoustic measure performs only slightly lower than the state-of-the-art transcription-based performance of r = −0.77. We also report the results of several small experiments which show that the acoustic measure is not only sensitive to segmental differences, but also to intonational differences and durational differences. However, it is not immune to unwanted differences caused by using a different recording device. Frontiers Media S.A. 2020-05-29 /pmc/articles/PMC7861290/ /pubmed/33733156 http://dx.doi.org/10.3389/frai.2020.00039 Text en Copyright © 2020 Bartelds, Richter, Liberman and Wieling. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Bartelds, Martijn
Richter, Caitlin
Liberman, Mark
Wieling, Martijn
A New Acoustic-Based Pronunciation Distance Measure
title A New Acoustic-Based Pronunciation Distance Measure
title_full A New Acoustic-Based Pronunciation Distance Measure
title_fullStr A New Acoustic-Based Pronunciation Distance Measure
title_full_unstemmed A New Acoustic-Based Pronunciation Distance Measure
title_short A New Acoustic-Based Pronunciation Distance Measure
title_sort new acoustic-based pronunciation distance measure
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861290/
https://www.ncbi.nlm.nih.gov/pubmed/33733156
http://dx.doi.org/10.3389/frai.2020.00039
work_keys_str_mv AT barteldsmartijn anewacousticbasedpronunciationdistancemeasure
AT richtercaitlin anewacousticbasedpronunciationdistancemeasure
AT libermanmark anewacousticbasedpronunciationdistancemeasure
AT wielingmartijn anewacousticbasedpronunciationdistancemeasure
AT barteldsmartijn newacousticbasedpronunciationdistancemeasure
AT richtercaitlin newacousticbasedpronunciationdistancemeasure
AT libermanmark newacousticbasedpronunciationdistancemeasure
AT wielingmartijn newacousticbasedpronunciationdistancemeasure