Cargando…

Robust Real-Time Music Transcription with a Compositional Hierarchical Model

The paper presents a new compositional hierarchical model for robust music transcription. Its main features are unsupervised learning of a hierarchical representation of input data, transparency, which enables insights into the learned representation, as well as robustness and speed which make it su...

Descripción completa

Detalles Bibliográficos
Autores principales: Pesek, Matevž, Leonardis, Aleš, Marolt, Matija
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5207709/
https://www.ncbi.nlm.nih.gov/pubmed/28046074
http://dx.doi.org/10.1371/journal.pone.0169411
_version_ 1782490418394431488
author Pesek, Matevž
Leonardis, Aleš
Marolt, Matija
author_facet Pesek, Matevž
Leonardis, Aleš
Marolt, Matija
author_sort Pesek, Matevž
collection PubMed
description The paper presents a new compositional hierarchical model for robust music transcription. Its main features are unsupervised learning of a hierarchical representation of input data, transparency, which enables insights into the learned representation, as well as robustness and speed which make it suitable for real-world and real-time use. The model consists of multiple layers, each composed of a number of parts. The hierarchical nature of the model corresponds well to hierarchical structures in music. The parts in lower layers correspond to low-level concepts (e.g. tone partials), while the parts in higher layers combine lower-level representations into more complex concepts (tones, chords). The layers are learned in an unsupervised manner from music signals. Parts in each layer are compositions of parts from previous layers based on statistical co-occurrences as the driving force of the learning process. In the paper, we present the model’s structure and compare it to other hierarchical approaches in the field of music information retrieval. We evaluate the model’s performance for the multiple fundamental frequency estimation. Finally, we elaborate on extensions of the model towards other music information retrieval tasks.
format Online
Article
Text
id pubmed-5207709
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-52077092017-01-19 Robust Real-Time Music Transcription with a Compositional Hierarchical Model Pesek, Matevž Leonardis, Aleš Marolt, Matija PLoS One Research Article The paper presents a new compositional hierarchical model for robust music transcription. Its main features are unsupervised learning of a hierarchical representation of input data, transparency, which enables insights into the learned representation, as well as robustness and speed which make it suitable for real-world and real-time use. The model consists of multiple layers, each composed of a number of parts. The hierarchical nature of the model corresponds well to hierarchical structures in music. The parts in lower layers correspond to low-level concepts (e.g. tone partials), while the parts in higher layers combine lower-level representations into more complex concepts (tones, chords). The layers are learned in an unsupervised manner from music signals. Parts in each layer are compositions of parts from previous layers based on statistical co-occurrences as the driving force of the learning process. In the paper, we present the model’s structure and compare it to other hierarchical approaches in the field of music information retrieval. We evaluate the model’s performance for the multiple fundamental frequency estimation. Finally, we elaborate on extensions of the model towards other music information retrieval tasks. Public Library of Science 2017-01-03 /pmc/articles/PMC5207709/ /pubmed/28046074 http://dx.doi.org/10.1371/journal.pone.0169411 Text en © 2017 Pesek et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pesek, Matevž
Leonardis, Aleš
Marolt, Matija
Robust Real-Time Music Transcription with a Compositional Hierarchical Model
title Robust Real-Time Music Transcription with a Compositional Hierarchical Model
title_full Robust Real-Time Music Transcription with a Compositional Hierarchical Model
title_fullStr Robust Real-Time Music Transcription with a Compositional Hierarchical Model
title_full_unstemmed Robust Real-Time Music Transcription with a Compositional Hierarchical Model
title_short Robust Real-Time Music Transcription with a Compositional Hierarchical Model
title_sort robust real-time music transcription with a compositional hierarchical model
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5207709/
https://www.ncbi.nlm.nih.gov/pubmed/28046074
http://dx.doi.org/10.1371/journal.pone.0169411
work_keys_str_mv AT pesekmatevz robustrealtimemusictranscriptionwithacompositionalhierarchicalmodel
AT leonardisales robustrealtimemusictranscriptionwithacompositionalhierarchicalmodel
AT maroltmatija robustrealtimemusictranscriptionwithacompositionalhierarchicalmodel