Cargando…

Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable

Sounds in our environment like voices, animal calls or musical instruments are easily recognized by human listeners. Understanding the key features underlying this robust sound recognition is an important question in auditory science. Here, we studied the recognition by human listeners of new classe...

Descripción completa

Detalles Bibliográficos
Autores principales: Isnard, Vincent, Taffou, Marine, Viaud-Delmon, Isabelle, Suied, Clara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4780819/
https://www.ncbi.nlm.nih.gov/pubmed/26950589
http://dx.doi.org/10.1371/journal.pone.0150313
_version_ 1782419815799980032
author Isnard, Vincent
Taffou, Marine
Viaud-Delmon, Isabelle
Suied, Clara
author_facet Isnard, Vincent
Taffou, Marine
Viaud-Delmon, Isabelle
Suied, Clara
author_sort Isnard, Vincent
collection PubMed
description Sounds in our environment like voices, animal calls or musical instruments are easily recognized by human listeners. Understanding the key features underlying this robust sound recognition is an important question in auditory science. Here, we studied the recognition by human listeners of new classes of sounds: acoustic and auditory sketches, sounds that are severely impoverished but still recognizable. Starting from a time-frequency representation, a sketch is obtained by keeping only sparse elements of the original signal, here, by means of a simple peak-picking algorithm. Two time-frequency representations were compared: a biologically grounded one, the auditory spectrogram, which simulates peripheral auditory filtering, and a simple acoustic spectrogram, based on a Fourier transform. Three degrees of sparsity were also investigated. Listeners were asked to recognize the category to which a sketch sound belongs: singing voices, bird calls, musical instruments, and vehicle engine noises. Results showed that, with the exception of voice sounds, very sparse representations of sounds (10 features, or energy peaks, per second) could be recognized above chance. No clear differences could be observed between the acoustic and the auditory sketches. For the voice sounds, however, a completely different pattern of results emerged, with at-chance or even below-chance recognition performances, suggesting that the important features of the voice, whatever they are, were removed by the sketch process. Overall, these perceptual results were well correlated with a model of auditory distances, based on spectro-temporal excitation patterns (STEPs). This study confirms the potential of these new classes of sounds, acoustic and auditory sketches, to study sound recognition.
format Online
Article
Text
id pubmed-4780819
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47808192016-03-23 Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable Isnard, Vincent Taffou, Marine Viaud-Delmon, Isabelle Suied, Clara PLoS One Research Article Sounds in our environment like voices, animal calls or musical instruments are easily recognized by human listeners. Understanding the key features underlying this robust sound recognition is an important question in auditory science. Here, we studied the recognition by human listeners of new classes of sounds: acoustic and auditory sketches, sounds that are severely impoverished but still recognizable. Starting from a time-frequency representation, a sketch is obtained by keeping only sparse elements of the original signal, here, by means of a simple peak-picking algorithm. Two time-frequency representations were compared: a biologically grounded one, the auditory spectrogram, which simulates peripheral auditory filtering, and a simple acoustic spectrogram, based on a Fourier transform. Three degrees of sparsity were also investigated. Listeners were asked to recognize the category to which a sketch sound belongs: singing voices, bird calls, musical instruments, and vehicle engine noises. Results showed that, with the exception of voice sounds, very sparse representations of sounds (10 features, or energy peaks, per second) could be recognized above chance. No clear differences could be observed between the acoustic and the auditory sketches. For the voice sounds, however, a completely different pattern of results emerged, with at-chance or even below-chance recognition performances, suggesting that the important features of the voice, whatever they are, were removed by the sketch process. Overall, these perceptual results were well correlated with a model of auditory distances, based on spectro-temporal excitation patterns (STEPs). This study confirms the potential of these new classes of sounds, acoustic and auditory sketches, to study sound recognition. Public Library of Science 2016-03-07 /pmc/articles/PMC4780819/ /pubmed/26950589 http://dx.doi.org/10.1371/journal.pone.0150313 Text en © 2016 Isnard et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Isnard, Vincent
Taffou, Marine
Viaud-Delmon, Isabelle
Suied, Clara
Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
title Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
title_full Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
title_fullStr Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
title_full_unstemmed Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
title_short Auditory Sketches: Very Sparse Representations of Sounds Are Still Recognizable
title_sort auditory sketches: very sparse representations of sounds are still recognizable
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4780819/
https://www.ncbi.nlm.nih.gov/pubmed/26950589
http://dx.doi.org/10.1371/journal.pone.0150313
work_keys_str_mv AT isnardvincent auditorysketchesverysparserepresentationsofsoundsarestillrecognizable
AT taffoumarine auditorysketchesverysparserepresentationsofsoundsarestillrecognizable
AT viauddelmonisabelle auditorysketchesverysparserepresentationsofsoundsarestillrecognizable
AT suiedclara auditorysketchesverysparserepresentationsofsoundsarestillrecognizable