Cargando…

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments

Natural speech is processed in the brain as a mixture of auditory and visual features. An example of the importance of visual speech is the McGurk effect and related perceptual illusions that result from mismatching auditory and visual syllables. Although the McGurk effect has widely been applied to...

Descripción completa

Detalles Bibliográficos
Autores principales:	Thézé, Raphaël, Gadiri, Mehdi Ali, Albert, Louis, Provost, Antoine, Giraud, Anne-Lise, Mégevand, Pierre
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511320/ https://www.ncbi.nlm.nih.gov/pubmed/32968127 http://dx.doi.org/10.1038/s41598-020-72375-y

_version_	1783585942115713024
author	Thézé, Raphaël Gadiri, Mehdi Ali Albert, Louis Provost, Antoine Giraud, Anne-Lise Mégevand, Pierre
author_facet	Thézé, Raphaël Gadiri, Mehdi Ali Albert, Louis Provost, Antoine Giraud, Anne-Lise Mégevand, Pierre
author_sort	Thézé, Raphaël
collection	PubMed
description	Natural speech is processed in the brain as a mixture of auditory and visual features. An example of the importance of visual speech is the McGurk effect and related perceptual illusions that result from mismatching auditory and visual syllables. Although the McGurk effect has widely been applied to the exploration of audio-visual speech processing, it relies on isolated syllables, which severely limits the conclusions that can be drawn from the paradigm. In addition, the extreme variability and the quality of the stimuli usually employed prevents comparability across studies. To overcome these limitations, we present an innovative methodology using 3D virtual characters with realistic lip movements synchronized on computer-synthesized speech. We used commercially accessible and affordable tools to facilitate reproducibility and comparability, and the set-up was validated on 24 participants performing a perception task. Within complete and meaningful French sentences, we paired a labiodental fricative viseme (i.e. /v/) with a bilabial occlusive phoneme (i.e. /b/). This audiovisual mismatch is known to induce the illusion of hearing /v/ in a proportion of trials. We tested the rate of the illusion while varying the magnitude of background noise and audiovisual lag. Overall, the effect was observed in 40% of trials. The proportion rose to about 50% with added background noise and up to 66% when controlling for phonetic features. Our results conclusively demonstrate that computer-generated speech stimuli are judicious, and that they can supplement natural speech with higher control over stimulus timing and content.
format	Online Article Text
id	pubmed-7511320
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-75113202020-09-24 Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments Thézé, Raphaël Gadiri, Mehdi Ali Albert, Louis Provost, Antoine Giraud, Anne-Lise Mégevand, Pierre Sci Rep Article Natural speech is processed in the brain as a mixture of auditory and visual features. An example of the importance of visual speech is the McGurk effect and related perceptual illusions that result from mismatching auditory and visual syllables. Although the McGurk effect has widely been applied to the exploration of audio-visual speech processing, it relies on isolated syllables, which severely limits the conclusions that can be drawn from the paradigm. In addition, the extreme variability and the quality of the stimuli usually employed prevents comparability across studies. To overcome these limitations, we present an innovative methodology using 3D virtual characters with realistic lip movements synchronized on computer-synthesized speech. We used commercially accessible and affordable tools to facilitate reproducibility and comparability, and the set-up was validated on 24 participants performing a perception task. Within complete and meaningful French sentences, we paired a labiodental fricative viseme (i.e. /v/) with a bilabial occlusive phoneme (i.e. /b/). This audiovisual mismatch is known to induce the illusion of hearing /v/ in a proportion of trials. We tested the rate of the illusion while varying the magnitude of background noise and audiovisual lag. Overall, the effect was observed in 40% of trials. The proportion rose to about 50% with added background noise and up to 66% when controlling for phonetic features. Our results conclusively demonstrate that computer-generated speech stimuli are judicious, and that they can supplement natural speech with higher control over stimulus timing and content. Nature Publishing Group UK 2020-09-23 /pmc/articles/PMC7511320/ /pubmed/32968127 http://dx.doi.org/10.1038/s41598-020-72375-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle	Article Thézé, Raphaël Gadiri, Mehdi Ali Albert, Louis Provost, Antoine Giraud, Anne-Lise Mégevand, Pierre Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
title	Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
title_full	Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
title_fullStr	Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
title_full_unstemmed	Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
title_short	Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
title_sort	animated virtual characters to explore audio-visual speech in controlled and naturalistic environments
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7511320/ https://www.ncbi.nlm.nih.gov/pubmed/32968127 http://dx.doi.org/10.1038/s41598-020-72375-y
work_keys_str_mv	AT thezeraphael animatedvirtualcharacterstoexploreaudiovisualspeechincontrolledandnaturalisticenvironments AT gadirimehdiali animatedvirtualcharacterstoexploreaudiovisualspeechincontrolledandnaturalisticenvironments AT albertlouis animatedvirtualcharacterstoexploreaudiovisualspeechincontrolledandnaturalisticenvironments AT provostantoine animatedvirtualcharacterstoexploreaudiovisualspeechincontrolledandnaturalisticenvironments AT giraudannelise animatedvirtualcharacterstoexploreaudiovisualspeechincontrolledandnaturalisticenvironments AT megevandpierre animatedvirtualcharacterstoexploreaudiovisualspeechincontrolledandnaturalisticenvironments

Animated virtual characters to explore audio-visual speech in controlled and naturalistic environments

Ejemplares similares