Cargando…

A Visual Speech Intelligibility Benefit Based on Speech Rhythm

This study examined whether visual speech provides speech-rhythm information that perceivers can use in speech perception. This was tested by using speech that naturally varied in the familiarity of its rhythm. Thirty Australian English L1 listeners performed a speech perception in noise task with E...

Descripción completa

Detalles Bibliográficos
Autores principales: Kawase, Saya, Davis, Chris, Kim, Jeesun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296142/
https://www.ncbi.nlm.nih.gov/pubmed/37371410
http://dx.doi.org/10.3390/brainsci13060932
_version_ 1785063587831611392
author Kawase, Saya
Davis, Chris
Kim, Jeesun
author_facet Kawase, Saya
Davis, Chris
Kim, Jeesun
author_sort Kawase, Saya
collection PubMed
description This study examined whether visual speech provides speech-rhythm information that perceivers can use in speech perception. This was tested by using speech that naturally varied in the familiarity of its rhythm. Thirty Australian English L1 listeners performed a speech perception in noise task with English sentences produced by three speakers: an English L1 speaker (familiar rhythm); an experienced English L2 speaker who had a weak foreign accent (familiar rhythm), and an inexperienced English L2 speaker who had a strong foreign accent (unfamiliar speech rhythm). The spoken sentences were presented in three conditions: Audio-Only (AO), Audio-Visual with mouth covered (AVm), and Audio-Visual (AV). Speech was best recognized in the AV condition regardless of the degree of foreign accent. However, speech recognition in AVm was better than AO for the speech with no foreign accent and with a weak accent, but not for the speech with a strong accent. A follow-up experiment was conducted that only used the speech with a strong foreign accent, under more audible conditions. The results also showed no difference between the AVm and AO conditions, indicating the null effect was not due to a floor effect. We propose that speech rhythm is conveyed by the motion of the jaw opening and closing, and perceivers use this information to better perceive speech in noise.
format Online
Article
Text
id pubmed-10296142
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102961422023-06-28 A Visual Speech Intelligibility Benefit Based on Speech Rhythm Kawase, Saya Davis, Chris Kim, Jeesun Brain Sci Article This study examined whether visual speech provides speech-rhythm information that perceivers can use in speech perception. This was tested by using speech that naturally varied in the familiarity of its rhythm. Thirty Australian English L1 listeners performed a speech perception in noise task with English sentences produced by three speakers: an English L1 speaker (familiar rhythm); an experienced English L2 speaker who had a weak foreign accent (familiar rhythm), and an inexperienced English L2 speaker who had a strong foreign accent (unfamiliar speech rhythm). The spoken sentences were presented in three conditions: Audio-Only (AO), Audio-Visual with mouth covered (AVm), and Audio-Visual (AV). Speech was best recognized in the AV condition regardless of the degree of foreign accent. However, speech recognition in AVm was better than AO for the speech with no foreign accent and with a weak accent, but not for the speech with a strong accent. A follow-up experiment was conducted that only used the speech with a strong foreign accent, under more audible conditions. The results also showed no difference between the AVm and AO conditions, indicating the null effect was not due to a floor effect. We propose that speech rhythm is conveyed by the motion of the jaw opening and closing, and perceivers use this information to better perceive speech in noise. MDPI 2023-06-08 /pmc/articles/PMC10296142/ /pubmed/37371410 http://dx.doi.org/10.3390/brainsci13060932 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kawase, Saya
Davis, Chris
Kim, Jeesun
A Visual Speech Intelligibility Benefit Based on Speech Rhythm
title A Visual Speech Intelligibility Benefit Based on Speech Rhythm
title_full A Visual Speech Intelligibility Benefit Based on Speech Rhythm
title_fullStr A Visual Speech Intelligibility Benefit Based on Speech Rhythm
title_full_unstemmed A Visual Speech Intelligibility Benefit Based on Speech Rhythm
title_short A Visual Speech Intelligibility Benefit Based on Speech Rhythm
title_sort visual speech intelligibility benefit based on speech rhythm
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10296142/
https://www.ncbi.nlm.nih.gov/pubmed/37371410
http://dx.doi.org/10.3390/brainsci13060932
work_keys_str_mv AT kawasesaya avisualspeechintelligibilitybenefitbasedonspeechrhythm
AT davischris avisualspeechintelligibilitybenefitbasedonspeechrhythm
AT kimjeesun avisualspeechintelligibilitybenefitbasedonspeechrhythm
AT kawasesaya visualspeechintelligibilitybenefitbasedonspeechrhythm
AT davischris visualspeechintelligibilitybenefitbasedonspeechrhythm
AT kimjeesun visualspeechintelligibilitybenefitbasedonspeechrhythm