Cargando…

Music Similarity Detection Guided by Deep Learning Model

Digital music has become a hot spot with the rapid development of network technology and digital audio technology. The general public is increasingly interested in music similarity detection (MSD). Similarity detection is mainly for music style classification. The core MSD process is to first extrac...

Descripción completa

Detalles Bibliográficos
Autor principal: Wang, Xiuli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9970704/
https://www.ncbi.nlm.nih.gov/pubmed/36860420
http://dx.doi.org/10.1155/2023/1263620
Descripción
Sumario:Digital music has become a hot spot with the rapid development of network technology and digital audio technology. The general public is increasingly interested in music similarity detection (MSD). Similarity detection is mainly for music style classification. The core MSD process is to first extract music features, then implement training modeling, and finally input music features into the model for detection. Deep learning (DL) is a relatively new feature extraction technology to improve the extraction efficiency of music features. This paper first introduces the convolutional neural network (CNN) of DL algorithms and MSD. Then, an MSD algorithm is constructed based on CNN. Besides, the Harmony and Percussive Source Separation (HPSS) algorithm separates the original music signal spectrogram and decomposes it into two components: time characteristic harmonics and frequency characteristic shocks. These two elements are input to the CNN together with the data in the original spectrogram for processing. In addition, the training-related hyperparameters are adjusted, and the dataset is expanded to explore the influence of different parameters in the network structure on the music detection rate. Experiments on the GTZAN Genre Collection music dataset show that this method can effectively improve MSD using a single feature. The final detection result is 75.6%, indicating the superiority of this method compared with other classical detection methods.