Cargando…

The cocktail-party problem revisited: early processing and selection of multi-talker speech

How do we recognize what one person is saying when others are speaking at the same time? This review summarizes widespread research in psychoacoustics, auditory scene analysis, and attention, all dealing with early processing and selection of speech, which has been stimulated by this question. Impor...

Descripción completa

Detalles Bibliográficos
Autor principal:	Bronkhorst, Adelbert W.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2015
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4469089/ https://www.ncbi.nlm.nih.gov/pubmed/25828463 http://dx.doi.org/10.3758/s13414-015-0882-9

_version_	1782376587432296448
author	Bronkhorst, Adelbert W.
author_facet	Bronkhorst, Adelbert W.
author_sort	Bronkhorst, Adelbert W.
collection	PubMed
description	How do we recognize what one person is saying when others are speaking at the same time? This review summarizes widespread research in psychoacoustics, auditory scene analysis, and attention, all dealing with early processing and selection of speech, which has been stimulated by this question. Important effects occurring at the peripheral and brainstem levels are mutual masking of sounds and “unmasking” resulting from binaural listening. Psychoacoustic models have been developed that can predict these effects accurately, albeit using computational approaches rather than approximations of neural processing. Grouping—the segregation and streaming of sounds—represents a subsequent processing stage that interacts closely with attention. Sounds can be easily grouped—and subsequently selected—using primitive features such as spatial location and fundamental frequency. More complex processing is required when lexical, syntactic, or semantic information is used. Whereas it is now clear that such processing can take place preattentively, there also is evidence that the processing depth depends on the task-relevancy of the sound. This is consistent with the presence of a feedback loop in attentional control, triggering enhancement of to-be-selected input. Despite recent progress, there are still many unresolved issues: there is a need for integrative models that are neurophysiologically plausible, for research into grouping based on other than spatial or voice-related cues, for studies explicitly addressing endogenous and exogenous attention, for an explanation of the remarkable sluggishness of attention focused on dynamically changing sounds, and for research elucidating the distinction between binaural speech perception and sound localization.
format	Online Article Text
id	pubmed-4469089
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-44690892015-06-17 The cocktail-party problem revisited: early processing and selection of multi-talker speech Bronkhorst, Adelbert W. Atten Percept Psychophys Article How do we recognize what one person is saying when others are speaking at the same time? This review summarizes widespread research in psychoacoustics, auditory scene analysis, and attention, all dealing with early processing and selection of speech, which has been stimulated by this question. Important effects occurring at the peripheral and brainstem levels are mutual masking of sounds and “unmasking” resulting from binaural listening. Psychoacoustic models have been developed that can predict these effects accurately, albeit using computational approaches rather than approximations of neural processing. Grouping—the segregation and streaming of sounds—represents a subsequent processing stage that interacts closely with attention. Sounds can be easily grouped—and subsequently selected—using primitive features such as spatial location and fundamental frequency. More complex processing is required when lexical, syntactic, or semantic information is used. Whereas it is now clear that such processing can take place preattentively, there also is evidence that the processing depth depends on the task-relevancy of the sound. This is consistent with the presence of a feedback loop in attentional control, triggering enhancement of to-be-selected input. Despite recent progress, there are still many unresolved issues: there is a need for integrative models that are neurophysiologically plausible, for research into grouping based on other than spatial or voice-related cues, for studies explicitly addressing endogenous and exogenous attention, for an explanation of the remarkable sluggishness of attention focused on dynamically changing sounds, and for research elucidating the distinction between binaural speech perception and sound localization. Springer US 2015-04-01 2015 /pmc/articles/PMC4469089/ /pubmed/25828463 http://dx.doi.org/10.3758/s13414-015-0882-9 Text en © The Author(s) 2015 https://creativecommons.org/licenses/by/4.0/ Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle	Article Bronkhorst, Adelbert W. The cocktail-party problem revisited: early processing and selection of multi-talker speech
title	The cocktail-party problem revisited: early processing and selection of multi-talker speech
title_full	The cocktail-party problem revisited: early processing and selection of multi-talker speech
title_fullStr	The cocktail-party problem revisited: early processing and selection of multi-talker speech
title_full_unstemmed	The cocktail-party problem revisited: early processing and selection of multi-talker speech
title_short	The cocktail-party problem revisited: early processing and selection of multi-talker speech
title_sort	cocktail-party problem revisited: early processing and selection of multi-talker speech
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4469089/ https://www.ncbi.nlm.nih.gov/pubmed/25828463 http://dx.doi.org/10.3758/s13414-015-0882-9
work_keys_str_mv	AT bronkhorstadelbertw thecocktailpartyproblemrevisitedearlyprocessingandselectionofmultitalkerspeech AT bronkhorstadelbertw cocktailpartyproblemrevisitedearlyprocessingandselectionofmultitalkerspeech

The cocktail-party problem revisited: early processing and selection of multi-talker speech

Ejemplares similares