Cargando…
Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification
The ability of a listener to recognize sound sources, and in particular musical instruments from the sounds they produce, raises the question of determining the acoustical information used to achieve such a task. It is now well known that the shapes of the temporal and spectral envelopes are crucial...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5390014/ https://www.ncbi.nlm.nih.gov/pubmed/28450846 http://dx.doi.org/10.3389/fpsyg.2017.00587 |
_version_ | 1782521371850440704 |
---|---|
author | Thoret, Etienne Depalle, Philippe McAdams, Stephen |
author_facet | Thoret, Etienne Depalle, Philippe McAdams, Stephen |
author_sort | Thoret, Etienne |
collection | PubMed |
description | The ability of a listener to recognize sound sources, and in particular musical instruments from the sounds they produce, raises the question of determining the acoustical information used to achieve such a task. It is now well known that the shapes of the temporal and spectral envelopes are crucial to the recognition of a musical instrument. More recently, Modulation Power Spectra (MPS) have been shown to be a representation that potentially explains the perception of musical instrument sounds. Nevertheless, the question of which specific regions of this representation characterize a musical instrument is still open. An identification task was applied to two subsets of musical instruments: tuba, trombone, cello, saxophone, and clarinet on the one hand, and marimba, vibraphone, guitar, harp, and viola pizzicato on the other. The sounds were processed with filtered spectrotemporal modulations with 2D Gaussian windows. The most relevant regions of this representation for instrument identification were determined for each instrument and reveal the regions essential for their identification. The method used here is based on a “molecular approach,” the so-called bubbles method. Globally, the instruments were correctly identified and the lower values of spectrotemporal modulations are the most important regions of the MPS for recognizing instruments. Interestingly, instruments that were confused with each other led to non-overlapping regions and were confused when they were filtered in the most salient region of the other instrument. These results suggest that musical instrument timbres are characterized by specific spectrotemporal modulations, information which could contribute to music information retrieval tasks such as automatic source recognition. |
format | Online Article Text |
id | pubmed-5390014 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-53900142017-04-27 Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification Thoret, Etienne Depalle, Philippe McAdams, Stephen Front Psychol Psychology The ability of a listener to recognize sound sources, and in particular musical instruments from the sounds they produce, raises the question of determining the acoustical information used to achieve such a task. It is now well known that the shapes of the temporal and spectral envelopes are crucial to the recognition of a musical instrument. More recently, Modulation Power Spectra (MPS) have been shown to be a representation that potentially explains the perception of musical instrument sounds. Nevertheless, the question of which specific regions of this representation characterize a musical instrument is still open. An identification task was applied to two subsets of musical instruments: tuba, trombone, cello, saxophone, and clarinet on the one hand, and marimba, vibraphone, guitar, harp, and viola pizzicato on the other. The sounds were processed with filtered spectrotemporal modulations with 2D Gaussian windows. The most relevant regions of this representation for instrument identification were determined for each instrument and reveal the regions essential for their identification. The method used here is based on a “molecular approach,” the so-called bubbles method. Globally, the instruments were correctly identified and the lower values of spectrotemporal modulations are the most important regions of the MPS for recognizing instruments. Interestingly, instruments that were confused with each other led to non-overlapping regions and were confused when they were filtered in the most salient region of the other instrument. These results suggest that musical instrument timbres are characterized by specific spectrotemporal modulations, information which could contribute to music information retrieval tasks such as automatic source recognition. Frontiers Media S.A. 2017-04-13 /pmc/articles/PMC5390014/ /pubmed/28450846 http://dx.doi.org/10.3389/fpsyg.2017.00587 Text en Copyright © 2017 Thoret, Depalle and McAdams. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Thoret, Etienne Depalle, Philippe McAdams, Stephen Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification |
title | Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification |
title_full | Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification |
title_fullStr | Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification |
title_full_unstemmed | Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification |
title_short | Perceptually Salient Regions of the Modulation Power Spectrum for Musical Instrument Identification |
title_sort | perceptually salient regions of the modulation power spectrum for musical instrument identification |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5390014/ https://www.ncbi.nlm.nih.gov/pubmed/28450846 http://dx.doi.org/10.3389/fpsyg.2017.00587 |
work_keys_str_mv | AT thoretetienne perceptuallysalientregionsofthemodulationpowerspectrumformusicalinstrumentidentification AT depallephilippe perceptuallysalientregionsofthemodulationpowerspectrumformusicalinstrumentidentification AT mcadamsstephen perceptuallysalientregionsofthemodulationpowerspectrumformusicalinstrumentidentification |