Cargando…

Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification

This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario). Previous research has demonstrated subjective benefits of audio spatialization with regard to speech intelligibility...

Descripción completa

Detalles Bibliográficos
Autores principales:	Uhrig, Stefan, Perkis, Andrew, Möller, Sebastian, Svensson, U. Peter, Behne, Dawn M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8831717/ https://www.ncbi.nlm.nih.gov/pubmed/35153653 http://dx.doi.org/10.3389/fnins.2021.730744

_version_	1784648563670646784
author	Uhrig, Stefan Perkis, Andrew Möller, Sebastian Svensson, U. Peter Behne, Dawn M.
author_facet	Uhrig, Stefan Perkis, Andrew Möller, Sebastian Svensson, U. Peter Behne, Dawn M.
author_sort	Uhrig, Stefan
collection	PubMed
description	This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario). Previous research has demonstrated subjective benefits of audio spatialization with regard to speech intelligibility and talker-identification effort. So far, the deliberate activation of specific perceptual and cognitive processes by listeners to optimize their task performance remained largely unexamined. Spoken sentences selected as stimuli were either clean or degraded due to background noise or bandpass filtering. Stimuli were presented via three horizontally positioned loudspeakers: In a non-spatial mode, both talkers were presented through a central loudspeaker; in a spatial mode, each talker was presented through the central or a talker-specific lateral loudspeaker. Participants identified talkers via speeded keypresses and afterwards provided subjective ratings (speech quality, speech intelligibility, voice similarity, talker-identification effort). In the spatial mode, presentations at lateral loudspeaker locations entailed quicker behavioral responses, which were significantly slower in comparison to a talker-localization task. Under clean speech, response times globally increased in the spatial vs. non-spatial mode (across all locations); these “response time switch costs,” presumably being caused by repeated switching of spatial auditory attention between different locations, diminished under degraded speech. No significant effects of spatialization on subjective ratings were found. The results suggested that when listeners could utilize task-relevant auditory cues about talker location, they continued to rely on voice recognition instead of localization of talker sound sources as primary response strategy. Besides, the presence of speech degradations may have led to increased cognitive control, which in turn compensated for incurring response time switch costs.
format	Online Article Text
id	pubmed-8831717
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-88317172022-02-12 Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification Uhrig, Stefan Perkis, Andrew Möller, Sebastian Svensson, U. Peter Behne, Dawn M. Front Neurosci Neuroscience This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario). Previous research has demonstrated subjective benefits of audio spatialization with regard to speech intelligibility and talker-identification effort. So far, the deliberate activation of specific perceptual and cognitive processes by listeners to optimize their task performance remained largely unexamined. Spoken sentences selected as stimuli were either clean or degraded due to background noise or bandpass filtering. Stimuli were presented via three horizontally positioned loudspeakers: In a non-spatial mode, both talkers were presented through a central loudspeaker; in a spatial mode, each talker was presented through the central or a talker-specific lateral loudspeaker. Participants identified talkers via speeded keypresses and afterwards provided subjective ratings (speech quality, speech intelligibility, voice similarity, talker-identification effort). In the spatial mode, presentations at lateral loudspeaker locations entailed quicker behavioral responses, which were significantly slower in comparison to a talker-localization task. Under clean speech, response times globally increased in the spatial vs. non-spatial mode (across all locations); these “response time switch costs,” presumably being caused by repeated switching of spatial auditory attention between different locations, diminished under degraded speech. No significant effects of spatialization on subjective ratings were found. The results suggested that when listeners could utilize task-relevant auditory cues about talker location, they continued to rely on voice recognition instead of localization of talker sound sources as primary response strategy. Besides, the presence of speech degradations may have led to increased cognitive control, which in turn compensated for incurring response time switch costs. Frontiers Media S.A. 2022-01-28 /pmc/articles/PMC8831717/ /pubmed/35153653 http://dx.doi.org/10.3389/fnins.2021.730744 Text en Copyright © 2022 Uhrig, Perkis, Möller, Svensson and Behne. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Uhrig, Stefan Perkis, Andrew Möller, Sebastian Svensson, U. Peter Behne, Dawn M. Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
title	Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
title_full	Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
title_fullStr	Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
title_full_unstemmed	Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
title_short	Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification
title_sort	effects of spatial speech presentation on listener response strategy for talker-identification
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8831717/ https://www.ncbi.nlm.nih.gov/pubmed/35153653 http://dx.doi.org/10.3389/fnins.2021.730744
work_keys_str_mv	AT uhrigstefan effectsofspatialspeechpresentationonlistenerresponsestrategyfortalkeridentification AT perkisandrew effectsofspatialspeechpresentationonlistenerresponsestrategyfortalkeridentification AT mollersebastian effectsofspatialspeechpresentationonlistenerresponsestrategyfortalkeridentification AT svenssonupeter effectsofspatialspeechpresentationonlistenerresponsestrategyfortalkeridentification AT behnedawnm effectsofspatialspeechpresentationonlistenerresponsestrategyfortalkeridentification

Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification

Ejemplares similares