Cargando…
ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs
Various disconnected chord datasets are currently available for music analysis and information retrieval, but they are often limited by either their size, non-openness, lack of timed information, and interoperability. Together with the lack of overlapping repertoire coverage, this limits cross-corpu...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10511441/ https://www.ncbi.nlm.nih.gov/pubmed/37730822 http://dx.doi.org/10.1038/s41597-023-02410-w |
_version_ | 1785108140959727616 |
---|---|
author | de Berardinis, Jacopo Meroño-Peñuela, Albert Poltronieri, Andrea Presutti, Valentina |
author_facet | de Berardinis, Jacopo Meroño-Peñuela, Albert Poltronieri, Andrea Presutti, Valentina |
author_sort | de Berardinis, Jacopo |
collection | PubMed |
description | Various disconnected chord datasets are currently available for music analysis and information retrieval, but they are often limited by either their size, non-openness, lack of timed information, and interoperability. Together with the lack of overlapping repertoire coverage, this limits cross-corpus studies on harmony over time and across genres, and hampers research in computational music analysis (chord recognition, pattern mining, computational creativity), which needs access to large datasets. We contribute to address this gap, by releasing the Chord Corpus (ChoCo), a large-scale dataset that semantically integrates harmonic data from 18 different sources using heterogeneous representations and formats (Harte, Leadsheet, Roman numerals, ABC, etc.). We rely on JAMS (JSON Annotated Music Specification), a popular data structure for annotations in Music Information Retrieval, to represent and enrich chord-related information (chord, key, mode, etc.) in a uniform way. To achieve semantic integration, we design a novel ontology for modelling music annotations and the entities they involve (artists, scores, etc.), and we build a 30M-triple knowledge graph, including 4 K+ links to other datasets (MIDI-LD, LED). |
format | Online Article Text |
id | pubmed-10511441 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-105114412023-09-22 ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs de Berardinis, Jacopo Meroño-Peñuela, Albert Poltronieri, Andrea Presutti, Valentina Sci Data Data Descriptor Various disconnected chord datasets are currently available for music analysis and information retrieval, but they are often limited by either their size, non-openness, lack of timed information, and interoperability. Together with the lack of overlapping repertoire coverage, this limits cross-corpus studies on harmony over time and across genres, and hampers research in computational music analysis (chord recognition, pattern mining, computational creativity), which needs access to large datasets. We contribute to address this gap, by releasing the Chord Corpus (ChoCo), a large-scale dataset that semantically integrates harmonic data from 18 different sources using heterogeneous representations and formats (Harte, Leadsheet, Roman numerals, ABC, etc.). We rely on JAMS (JSON Annotated Music Specification), a popular data structure for annotations in Music Information Retrieval, to represent and enrich chord-related information (chord, key, mode, etc.) in a uniform way. To achieve semantic integration, we design a novel ontology for modelling music annotations and the entities they involve (artists, scores, etc.), and we build a 30M-triple knowledge graph, including 4 K+ links to other datasets (MIDI-LD, LED). Nature Publishing Group UK 2023-09-20 /pmc/articles/PMC10511441/ /pubmed/37730822 http://dx.doi.org/10.1038/s41597-023-02410-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor de Berardinis, Jacopo Meroño-Peñuela, Albert Poltronieri, Andrea Presutti, Valentina ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs |
title | ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs |
title_full | ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs |
title_fullStr | ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs |
title_full_unstemmed | ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs |
title_short | ChoCo: a Chord Corpus and a Data Transformation Workflow for Musical Harmony Knowledge Graphs |
title_sort | choco: a chord corpus and a data transformation workflow for musical harmony knowledge graphs |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10511441/ https://www.ncbi.nlm.nih.gov/pubmed/37730822 http://dx.doi.org/10.1038/s41597-023-02410-w |
work_keys_str_mv | AT deberardinisjacopo chocoachordcorpusandadatatransformationworkflowformusicalharmonyknowledgegraphs AT meronopenuelaalbert chocoachordcorpusandadatatransformationworkflowformusicalharmonyknowledgegraphs AT poltronieriandrea chocoachordcorpusandadatatransformationworkflowformusicalharmonyknowledgegraphs AT presuttivalentina chocoachordcorpusandadatatransformationworkflowformusicalharmonyknowledgegraphs |