Cargando…

Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics

In recent years, many researchers have shown increasing interest in music information retrieval (MIR) applications, with automatic chord recognition being one of the popular tasks. Many studies have achieved/demonstrated considerable improvement using deep learning based models in automatic chord re...

Descripción completa

Detalles Bibliográficos
Autores principales: Byambatsogt, Gerelmaa, Choimaa, Lodoiravsal, Koutaki, Gou
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7663498/
https://www.ncbi.nlm.nih.gov/pubmed/33114599
http://dx.doi.org/10.3390/s20216077
_version_ 1783609640631664640
author Byambatsogt, Gerelmaa
Choimaa, Lodoiravsal
Koutaki, Gou
author_facet Byambatsogt, Gerelmaa
Choimaa, Lodoiravsal
Koutaki, Gou
author_sort Byambatsogt, Gerelmaa
collection PubMed
description In recent years, many researchers have shown increasing interest in music information retrieval (MIR) applications, with automatic chord recognition being one of the popular tasks. Many studies have achieved/demonstrated considerable improvement using deep learning based models in automatic chord recognition problems. However, most of the existing models have focused on simple chord recognition, which classifies the root note with the major, minor, and seventh chords. Furthermore, in learning-based recognition, it is critical to collect high-quality and large amounts of training data to achieve the desired performance. In this paper, we present a multi-task learning (MTL) model for a guitar chord recognition task, where the model is trained using a relatively large-vocabulary guitar chord dataset. To solve data scarcity issues, a physical data augmentation method that directly records the chord dataset from a robotic performer is employed. Deep learning based MTL is proposed to improve the performance of automatic chord recognition with the proposed physical data augmentation dataset. The proposed MTL model is compared with four baseline models and its corresponding single-task learning model using two types of datasets, including a human dataset and a human combined with the augmented dataset. The proposed methods outperform the baseline models, and the results show that most scores of the proposed multi-task learning model are better than those of the corresponding single-task learning model. The experimental results demonstrate that physical data augmentation is an effective method for increasing the dataset size for guitar chord recognition tasks.
format Online
Article
Text
id pubmed-7663498
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-76634982020-11-14 Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics Byambatsogt, Gerelmaa Choimaa, Lodoiravsal Koutaki, Gou Sensors (Basel) Letter In recent years, many researchers have shown increasing interest in music information retrieval (MIR) applications, with automatic chord recognition being one of the popular tasks. Many studies have achieved/demonstrated considerable improvement using deep learning based models in automatic chord recognition problems. However, most of the existing models have focused on simple chord recognition, which classifies the root note with the major, minor, and seventh chords. Furthermore, in learning-based recognition, it is critical to collect high-quality and large amounts of training data to achieve the desired performance. In this paper, we present a multi-task learning (MTL) model for a guitar chord recognition task, where the model is trained using a relatively large-vocabulary guitar chord dataset. To solve data scarcity issues, a physical data augmentation method that directly records the chord dataset from a robotic performer is employed. Deep learning based MTL is proposed to improve the performance of automatic chord recognition with the proposed physical data augmentation dataset. The proposed MTL model is compared with four baseline models and its corresponding single-task learning model using two types of datasets, including a human dataset and a human combined with the augmented dataset. The proposed methods outperform the baseline models, and the results show that most scores of the proposed multi-task learning model are better than those of the corresponding single-task learning model. The experimental results demonstrate that physical data augmentation is an effective method for increasing the dataset size for guitar chord recognition tasks. MDPI 2020-10-26 /pmc/articles/PMC7663498/ /pubmed/33114599 http://dx.doi.org/10.3390/s20216077 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Letter
Byambatsogt, Gerelmaa
Choimaa, Lodoiravsal
Koutaki, Gou
Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics
title Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics
title_full Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics
title_fullStr Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics
title_full_unstemmed Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics
title_short Guitar Chord Sensing and Recognition Using Multi-Task Learning and Physical Data Augmentation with Robotics
title_sort guitar chord sensing and recognition using multi-task learning and physical data augmentation with robotics
topic Letter
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7663498/
https://www.ncbi.nlm.nih.gov/pubmed/33114599
http://dx.doi.org/10.3390/s20216077
work_keys_str_mv AT byambatsogtgerelmaa guitarchordsensingandrecognitionusingmultitasklearningandphysicaldataaugmentationwithrobotics
AT choimaalodoiravsal guitarchordsensingandrecognitionusingmultitasklearningandphysicaldataaugmentationwithrobotics
AT koutakigou guitarchordsensingandrecognitionusingmultitasklearningandphysicaldataaugmentationwithrobotics