Cargando…

Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli

Many audio applications perform perception-based time-frequency (TF) analysis by decomposing sounds into a set of functions with good TF localization (i.e. with a small essential support in the TF domain) using TF transforms and applying psychoacoustic models of auditory masking to the transform coe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Necciari, Thibaud, Laback, Bernhard, Savel, Sophie, Ystad, Sølvi, Balazs, Peter, Meunier, Sabine, Kronland-Martinet, Richard
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2016
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5119819/ https://www.ncbi.nlm.nih.gov/pubmed/27875575 http://dx.doi.org/10.1371/journal.pone.0166937

_version_	1782469126244007936
author	Necciari, Thibaud Laback, Bernhard Savel, Sophie Ystad, Sølvi Balazs, Peter Meunier, Sabine Kronland-Martinet, Richard
author_facet	Necciari, Thibaud Laback, Bernhard Savel, Sophie Ystad, Sølvi Balazs, Peter Meunier, Sabine Kronland-Martinet, Richard
author_sort	Necciari, Thibaud
collection	PubMed
description	Many audio applications perform perception-based time-frequency (TF) analysis by decomposing sounds into a set of functions with good TF localization (i.e. with a small essential support in the TF domain) using TF transforms and applying psychoacoustic models of auditory masking to the transform coefficients. To accurately predict masking interactions between coefficients, the TF properties of the model should match those of the transform. This involves having masking data for stimuli with good TF localization. However, little is known about TF masking for mathematically well-localized signals. Most existing masking studies used stimuli that are broad in time and/or frequency and few studies involved TF conditions. Consequently, the present study had two goals. The first was to collect TF masking data for well-localized stimuli in humans. Masker and target were 10-ms Gaussian-shaped sinusoids with a bandwidth of approximately one critical band. The overall pattern of results is qualitatively similar to existing data for long maskers. To facilitate implementation in audio processing algorithms, a dataset provides the measured TF masking function. The second goal was to assess the potential effect of auditory efferents on TF masking using a modeling approach. The temporal window model of masking was used to predict present and existing data in two configurations: (1) with standard model parameters (i.e. without efferents), (2) with cochlear gain reduction to simulate the activation of efferents. The ability of the model to predict the present data was quite good with the standard configuration but highly degraded with gain reduction. Conversely, the ability of the model to predict existing data for long maskers was better with than without gain reduction. Overall, the model predictions suggest that TF masking can be affected by efferent (or other) effects that reduce cochlear gain. Such effects were avoided in the experiment of this study by using maximally-compact stimuli.
format	Online Article Text
id	pubmed-5119819
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-51198192016-12-15 Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli Necciari, Thibaud Laback, Bernhard Savel, Sophie Ystad, Sølvi Balazs, Peter Meunier, Sabine Kronland-Martinet, Richard PLoS One Research Article Many audio applications perform perception-based time-frequency (TF) analysis by decomposing sounds into a set of functions with good TF localization (i.e. with a small essential support in the TF domain) using TF transforms and applying psychoacoustic models of auditory masking to the transform coefficients. To accurately predict masking interactions between coefficients, the TF properties of the model should match those of the transform. This involves having masking data for stimuli with good TF localization. However, little is known about TF masking for mathematically well-localized signals. Most existing masking studies used stimuli that are broad in time and/or frequency and few studies involved TF conditions. Consequently, the present study had two goals. The first was to collect TF masking data for well-localized stimuli in humans. Masker and target were 10-ms Gaussian-shaped sinusoids with a bandwidth of approximately one critical band. The overall pattern of results is qualitatively similar to existing data for long maskers. To facilitate implementation in audio processing algorithms, a dataset provides the measured TF masking function. The second goal was to assess the potential effect of auditory efferents on TF masking using a modeling approach. The temporal window model of masking was used to predict present and existing data in two configurations: (1) with standard model parameters (i.e. without efferents), (2) with cochlear gain reduction to simulate the activation of efferents. The ability of the model to predict the present data was quite good with the standard configuration but highly degraded with gain reduction. Conversely, the ability of the model to predict existing data for long maskers was better with than without gain reduction. Overall, the model predictions suggest that TF masking can be affected by efferent (or other) effects that reduce cochlear gain. Such effects were avoided in the experiment of this study by using maximally-compact stimuli. Public Library of Science 2016-11-22 /pmc/articles/PMC5119819/ /pubmed/27875575 http://dx.doi.org/10.1371/journal.pone.0166937 Text en © 2016 Necciari et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Necciari, Thibaud Laback, Bernhard Savel, Sophie Ystad, Sølvi Balazs, Peter Meunier, Sabine Kronland-Martinet, Richard Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli
title	Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli
title_full	Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli
title_fullStr	Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli
title_full_unstemmed	Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli
title_short	Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli
title_sort	auditory time-frequency masking for spectrally and temporally maximally-compact stimuli
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5119819/ https://www.ncbi.nlm.nih.gov/pubmed/27875575 http://dx.doi.org/10.1371/journal.pone.0166937
work_keys_str_mv	AT necciarithibaud auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli AT labackbernhard auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli AT savelsophie auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli AT ystadsølvi auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli AT balazspeter auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli AT meuniersabine auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli AT kronlandmartinetrichard auditorytimefrequencymaskingforspectrallyandtemporallymaximallycompactstimuli

Auditory Time-Frequency Masking for Spectrally and Temporally Maximally-Compact Stimuli

Ejemplares similares