Cargando…
A New Acoustic-Based Pronunciation Distance Measure
We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861290/ https://www.ncbi.nlm.nih.gov/pubmed/33733156 http://dx.doi.org/10.3389/frai.2020.00039 |
_version_ | 1783647054267940864 |
---|---|
author | Bartelds, Martijn Richter, Caitlin Liberman, Mark Wieling, Martijn |
author_facet | Bartelds, Martijn Richter, Caitlin Liberman, Mark Wieling, Martijn |
author_sort | Bartelds, Martijn |
collection | PubMed |
description | We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as it does not require the time-consuming and error-prone process of phonetically transcribing speech samples which is necessary for current edit distance-based approaches. We minimize speaker variability in the data set by employing speaker-based cepstral mean and variance normalization, and compute word-based acoustic distances using the dynamic time warping algorithm. Our results indicate a strong correlation of r = −0.71 (p < 0.0001) between the acoustic distances and human judgments of native-likeness provided by more than 1,100 native American-English raters. Therefore, the convenient acoustic measure performs only slightly lower than the state-of-the-art transcription-based performance of r = −0.77. We also report the results of several small experiments which show that the acoustic measure is not only sensitive to segmental differences, but also to intonational differences and durational differences. However, it is not immune to unwanted differences caused by using a different recording device. |
format | Online Article Text |
id | pubmed-7861290 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-78612902021-03-16 A New Acoustic-Based Pronunciation Distance Measure Bartelds, Martijn Richter, Caitlin Liberman, Mark Wieling, Martijn Front Artif Intell Artificial Intelligence We present an acoustic distance measure for comparing pronunciations, and apply the measure to assess foreign accent strength in American-English by comparing speech of non-native American-English speakers to a collection of native American-English speakers. An acoustic-only measure is valuable as it does not require the time-consuming and error-prone process of phonetically transcribing speech samples which is necessary for current edit distance-based approaches. We minimize speaker variability in the data set by employing speaker-based cepstral mean and variance normalization, and compute word-based acoustic distances using the dynamic time warping algorithm. Our results indicate a strong correlation of r = −0.71 (p < 0.0001) between the acoustic distances and human judgments of native-likeness provided by more than 1,100 native American-English raters. Therefore, the convenient acoustic measure performs only slightly lower than the state-of-the-art transcription-based performance of r = −0.77. We also report the results of several small experiments which show that the acoustic measure is not only sensitive to segmental differences, but also to intonational differences and durational differences. However, it is not immune to unwanted differences caused by using a different recording device. Frontiers Media S.A. 2020-05-29 /pmc/articles/PMC7861290/ /pubmed/33733156 http://dx.doi.org/10.3389/frai.2020.00039 Text en Copyright © 2020 Bartelds, Richter, Liberman and Wieling. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Bartelds, Martijn Richter, Caitlin Liberman, Mark Wieling, Martijn A New Acoustic-Based Pronunciation Distance Measure |
title | A New Acoustic-Based Pronunciation Distance Measure |
title_full | A New Acoustic-Based Pronunciation Distance Measure |
title_fullStr | A New Acoustic-Based Pronunciation Distance Measure |
title_full_unstemmed | A New Acoustic-Based Pronunciation Distance Measure |
title_short | A New Acoustic-Based Pronunciation Distance Measure |
title_sort | new acoustic-based pronunciation distance measure |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7861290/ https://www.ncbi.nlm.nih.gov/pubmed/33733156 http://dx.doi.org/10.3389/frai.2020.00039 |
work_keys_str_mv | AT barteldsmartijn anewacousticbasedpronunciationdistancemeasure AT richtercaitlin anewacousticbasedpronunciationdistancemeasure AT libermanmark anewacousticbasedpronunciationdistancemeasure AT wielingmartijn anewacousticbasedpronunciationdistancemeasure AT barteldsmartijn newacousticbasedpronunciationdistancemeasure AT richtercaitlin newacousticbasedpronunciationdistancemeasure AT libermanmark newacousticbasedpronunciationdistancemeasure AT wielingmartijn newacousticbasedpronunciationdistancemeasure |