Cargando…

A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks

Music is an extremely subjective art form whose commodification via the recording industry in the 20th century has led to an increasingly subdivided set of genre labels that attempt to organize musical styles into definite categories. Music psychology has been studying the processes through which mu...

Descripción completa

Detalles Bibliográficos
Autores principales:	Angioni, Simone, Lincoln-DeCusatis, Nathan, Ibba, Andrea, Reforgiato Recupero, Diego
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2023
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10319258/ https://www.ncbi.nlm.nih.gov/pubmed/37409082 http://dx.doi.org/10.7717/peerj-cs.1410

_version_	1785068209777410048
author	Angioni, Simone Lincoln-DeCusatis, Nathan Ibba, Andrea Reforgiato Recupero, Diego
author_facet	Angioni, Simone Lincoln-DeCusatis, Nathan Ibba, Andrea Reforgiato Recupero, Diego
author_sort	Angioni, Simone
collection	PubMed
description	Music is an extremely subjective art form whose commodification via the recording industry in the 20th century has led to an increasingly subdivided set of genre labels that attempt to organize musical styles into definite categories. Music psychology has been studying the processes through which music is perceived, created, responded to, and incorporated into everyday life, and, modern artificial intelligence technology can be exploited in such a direction. Music classification and generation are emerging fields that gained much attention recently, especially with the latest discoveries within deep learning technologies. Self attention networks have in fact brought huge benefits for several tasks of classification and generation in different domains where data of different types were used (text, images, videos, sounds). In this article, we want to analyze the effectiveness of Transformers for both classification and generation tasks and study the performances of classification at different granularity and of generation using different human and automatic metrics. The input data consist of MIDI sounds that we have considered from different datasets: sounds from 397 Nintendo Entertainment System video games, classical pieces, and rock songs from different composers and bands. We have performed classification tasks within each dataset to identify the types or composers of each sample (fine-grained) and classification at a higher level. In the latter, we combined the three datasets together with the goal of identifying for each sample just NES, rock, or classical (coarse-grained) pieces. The proposed transformers-based approach outperformed competitors based on deep learning and machine learning approaches. Finally, the generation task has been carried out on each dataset and the resulting samples have been evaluated using human and automatic metrics (the local alignment).
format	Online Article Text
id	pubmed-10319258
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-103192582023-07-05 A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks Angioni, Simone Lincoln-DeCusatis, Nathan Ibba, Andrea Reforgiato Recupero, Diego PeerJ Comput Sci Artificial Intelligence Music is an extremely subjective art form whose commodification via the recording industry in the 20th century has led to an increasingly subdivided set of genre labels that attempt to organize musical styles into definite categories. Music psychology has been studying the processes through which music is perceived, created, responded to, and incorporated into everyday life, and, modern artificial intelligence technology can be exploited in such a direction. Music classification and generation are emerging fields that gained much attention recently, especially with the latest discoveries within deep learning technologies. Self attention networks have in fact brought huge benefits for several tasks of classification and generation in different domains where data of different types were used (text, images, videos, sounds). In this article, we want to analyze the effectiveness of Transformers for both classification and generation tasks and study the performances of classification at different granularity and of generation using different human and automatic metrics. The input data consist of MIDI sounds that we have considered from different datasets: sounds from 397 Nintendo Entertainment System video games, classical pieces, and rock songs from different composers and bands. We have performed classification tasks within each dataset to identify the types or composers of each sample (fine-grained) and classification at a higher level. In the latter, we combined the three datasets together with the goal of identifying for each sample just NES, rock, or classical (coarse-grained) pieces. The proposed transformers-based approach outperformed competitors based on deep learning and machine learning approaches. Finally, the generation task has been carried out on each dataset and the resulting samples have been evaluated using human and automatic metrics (the local alignment). PeerJ Inc. 2023-06-19 /pmc/articles/PMC10319258/ /pubmed/37409082 http://dx.doi.org/10.7717/peerj-cs.1410 Text en ©2023 Angioni et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Artificial Intelligence Angioni, Simone Lincoln-DeCusatis, Nathan Ibba, Andrea Reforgiato Recupero, Diego A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks
title	A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks
title_full	A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks
title_fullStr	A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks
title_full_unstemmed	A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks
title_short	A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks
title_sort	transformers-based approach for fine and coarse-grained classification and generation of midi songs and soundtracks
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10319258/ https://www.ncbi.nlm.nih.gov/pubmed/37409082 http://dx.doi.org/10.7717/peerj-cs.1410
work_keys_str_mv	AT angionisimone atransformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT lincolndecusatisnathan atransformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT ibbaandrea atransformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT reforgiatorecuperodiego atransformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT angionisimone transformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT lincolndecusatisnathan transformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT ibbaandrea transformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks AT reforgiatorecuperodiego transformersbasedapproachforfineandcoarsegrainedclassificationandgenerationofmidisongsandsoundtracks

A transformers-based approach for fine and coarse-grained classification and generation of MIDI songs and soundtracks

Ejemplares similares