Cargando…

One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis

The mammalian auditory system extracts features from the acoustic environment based on the responses of spatially distributed sets of neurons in the subcortical and cortical auditory structures. The characteristic responses of these neurons (linearly approximated by their spectro-temporal receptive...

Descripción completa

Detalles Bibliográficos
Autores principales: Hemery, Edgar, Aucouturier, Jean-Julien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4490656/
https://www.ncbi.nlm.nih.gov/pubmed/26190996
http://dx.doi.org/10.3389/fncom.2015.00080
_version_ 1782379547132428288
author Hemery, Edgar
Aucouturier, Jean-Julien
author_facet Hemery, Edgar
Aucouturier, Jean-Julien
author_sort Hemery, Edgar
collection PubMed
description The mammalian auditory system extracts features from the acoustic environment based on the responses of spatially distributed sets of neurons in the subcortical and cortical auditory structures. The characteristic responses of these neurons (linearly approximated by their spectro-temporal receptive fields, or STRFs) suggest that auditory representations are formed, as early as in the inferior colliculi, on the basis of a time, frequency, rate (temporal modulations) and scale (spectral modulations) analysis of sound. However, how these four dimensions are integrated and processed in subsequent neural networks remains unclear. In this work, we present a new methodology to generate computational insights into the functional organization of such processes. We first propose a systematic framework to explore more than a hundred different computational strategies proposed in the literature to process the output of a generic STRF model. We then evaluate these strategies on their ability to compute perceptual distances between pairs of environmental sounds. Finally, we conduct a meta-analysis of the dataset of all these algorithms' accuracies to examine whether certain combinations of dimensions and certain ways to treat such dimensions are, on the whole, more computationally effective than others. We present an application of this methodology to a dataset of ten environmental sound categories, in which the analysis reveals that (1) models are most effective when they organize STRF data into frequency groupings—which is consistent with the known tonotopic organization of receptive fields in auditory structures -, and that (2) models that treat STRF data as time series are no more effective than models that rely only on summary statistics along time—which corroborates recent experimental evidence on texture discrimination by summary statistics.
format Online
Article
Text
id pubmed-4490656
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-44906562015-07-17 One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis Hemery, Edgar Aucouturier, Jean-Julien Front Comput Neurosci Neuroscience The mammalian auditory system extracts features from the acoustic environment based on the responses of spatially distributed sets of neurons in the subcortical and cortical auditory structures. The characteristic responses of these neurons (linearly approximated by their spectro-temporal receptive fields, or STRFs) suggest that auditory representations are formed, as early as in the inferior colliculi, on the basis of a time, frequency, rate (temporal modulations) and scale (spectral modulations) analysis of sound. However, how these four dimensions are integrated and processed in subsequent neural networks remains unclear. In this work, we present a new methodology to generate computational insights into the functional organization of such processes. We first propose a systematic framework to explore more than a hundred different computational strategies proposed in the literature to process the output of a generic STRF model. We then evaluate these strategies on their ability to compute perceptual distances between pairs of environmental sounds. Finally, we conduct a meta-analysis of the dataset of all these algorithms' accuracies to examine whether certain combinations of dimensions and certain ways to treat such dimensions are, on the whole, more computationally effective than others. We present an application of this methodology to a dataset of ten environmental sound categories, in which the analysis reveals that (1) models are most effective when they organize STRF data into frequency groupings—which is consistent with the known tonotopic organization of receptive fields in auditory structures -, and that (2) models that treat STRF data as time series are no more effective than models that rely only on summary statistics along time—which corroborates recent experimental evidence on texture discrimination by summary statistics. Frontiers Media S.A. 2015-07-03 /pmc/articles/PMC4490656/ /pubmed/26190996 http://dx.doi.org/10.3389/fncom.2015.00080 Text en Copyright © 2015 Hemery and Aucouturier. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Hemery, Edgar
Aucouturier, Jean-Julien
One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
title One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
title_full One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
title_fullStr One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
title_full_unstemmed One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
title_short One hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
title_sort one hundred ways to process time, frequency, rate and scale in the central auditory system: a pattern-recognition meta-analysis
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4490656/
https://www.ncbi.nlm.nih.gov/pubmed/26190996
http://dx.doi.org/10.3389/fncom.2015.00080
work_keys_str_mv AT hemeryedgar onehundredwaystoprocesstimefrequencyrateandscaleinthecentralauditorysystemapatternrecognitionmetaanalysis
AT aucouturierjeanjulien onehundredwaystoprocesstimefrequencyrateandscaleinthecentralauditorysystemapatternrecognitionmetaanalysis