Cargando…

A New Approach to Model Pitch Perception Using Sparse Coding

Our acoustical environment abounds with repetitive sounds, some of which are related to pitch perception. It is still unknown how the auditory system, in processing these sounds, relates a physical stimulus and its percept. Since, in mammals, all auditory stimuli are conveyed into the nervous system...

Descripción completa

Detalles Bibliográficos
Autores principales: Barzelay, Oded, Furst, Miriam, Barak, Omri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5308863/
https://www.ncbi.nlm.nih.gov/pubmed/28099436
http://dx.doi.org/10.1371/journal.pcbi.1005338
_version_ 1782507615721357312
author Barzelay, Oded
Furst, Miriam
Barak, Omri
author_facet Barzelay, Oded
Furst, Miriam
Barak, Omri
author_sort Barzelay, Oded
collection PubMed
description Our acoustical environment abounds with repetitive sounds, some of which are related to pitch perception. It is still unknown how the auditory system, in processing these sounds, relates a physical stimulus and its percept. Since, in mammals, all auditory stimuli are conveyed into the nervous system through the auditory nerve (AN) fibers, a model should explain the perception of pitch as a function of this particular input. However, pitch perception is invariant to certain features of the physical stimulus. For example, a missing fundamental stimulus with resolved or unresolved harmonics, or a low and high-level amplitude stimulus with the same spectral content–these all give rise to the same percept of pitch. In contrast, the AN representations for these different stimuli are not invariant to these effects. In fact, due to saturation and non-linearity of both cochlear and inner hair cells responses, these differences are enhanced by the AN fibers. Thus there is a difficulty in explaining how pitch percept arises from the activity of the AN fibers. We introduce a novel approach for extracting pitch cues from the AN population activity for a given arbitrary stimulus. The method is based on a technique known as sparse coding (SC). It is the representation of pitch cues by a few spatiotemporal atoms (templates) from among a large set of possible ones (a dictionary). The amount of activity of each atom is represented by a non-zero coefficient, analogous to an active neuron. Such a technique has been successfully applied to other modalities, particularly vision. The model is composed of a cochlear model, an SC processing unit, and a harmonic sieve. We show that the model copes with different pitch phenomena: extracting resolved and non-resolved harmonics, missing fundamental pitches, stimuli with both high and low amplitudes, iterated rippled noises, and recorded musical instruments.
format Online
Article
Text
id pubmed-5308863
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53088632017-03-03 A New Approach to Model Pitch Perception Using Sparse Coding Barzelay, Oded Furst, Miriam Barak, Omri PLoS Comput Biol Research Article Our acoustical environment abounds with repetitive sounds, some of which are related to pitch perception. It is still unknown how the auditory system, in processing these sounds, relates a physical stimulus and its percept. Since, in mammals, all auditory stimuli are conveyed into the nervous system through the auditory nerve (AN) fibers, a model should explain the perception of pitch as a function of this particular input. However, pitch perception is invariant to certain features of the physical stimulus. For example, a missing fundamental stimulus with resolved or unresolved harmonics, or a low and high-level amplitude stimulus with the same spectral content–these all give rise to the same percept of pitch. In contrast, the AN representations for these different stimuli are not invariant to these effects. In fact, due to saturation and non-linearity of both cochlear and inner hair cells responses, these differences are enhanced by the AN fibers. Thus there is a difficulty in explaining how pitch percept arises from the activity of the AN fibers. We introduce a novel approach for extracting pitch cues from the AN population activity for a given arbitrary stimulus. The method is based on a technique known as sparse coding (SC). It is the representation of pitch cues by a few spatiotemporal atoms (templates) from among a large set of possible ones (a dictionary). The amount of activity of each atom is represented by a non-zero coefficient, analogous to an active neuron. Such a technique has been successfully applied to other modalities, particularly vision. The model is composed of a cochlear model, an SC processing unit, and a harmonic sieve. We show that the model copes with different pitch phenomena: extracting resolved and non-resolved harmonics, missing fundamental pitches, stimuli with both high and low amplitudes, iterated rippled noises, and recorded musical instruments. Public Library of Science 2017-01-18 /pmc/articles/PMC5308863/ /pubmed/28099436 http://dx.doi.org/10.1371/journal.pcbi.1005338 Text en © 2017 Barzelay et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Barzelay, Oded
Furst, Miriam
Barak, Omri
A New Approach to Model Pitch Perception Using Sparse Coding
title A New Approach to Model Pitch Perception Using Sparse Coding
title_full A New Approach to Model Pitch Perception Using Sparse Coding
title_fullStr A New Approach to Model Pitch Perception Using Sparse Coding
title_full_unstemmed A New Approach to Model Pitch Perception Using Sparse Coding
title_short A New Approach to Model Pitch Perception Using Sparse Coding
title_sort new approach to model pitch perception using sparse coding
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5308863/
https://www.ncbi.nlm.nih.gov/pubmed/28099436
http://dx.doi.org/10.1371/journal.pcbi.1005338
work_keys_str_mv AT barzelayoded anewapproachtomodelpitchperceptionusingsparsecoding
AT furstmiriam anewapproachtomodelpitchperceptionusingsparsecoding
AT barakomri anewapproachtomodelpitchperceptionusingsparsecoding
AT barzelayoded newapproachtomodelpitchperceptionusingsparsecoding
AT furstmiriam newapproachtomodelpitchperceptionusingsparsecoding
AT barakomri newapproachtomodelpitchperceptionusingsparsecoding