Cargando…
Music Similarity Detection Guided by Deep Learning Model
Digital music has become a hot spot with the rapid development of network technology and digital audio technology. The general public is increasingly interested in music similarity detection (MSD). Similarity detection is mainly for music style classification. The core MSD process is to first extrac...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9970704/ https://www.ncbi.nlm.nih.gov/pubmed/36860420 http://dx.doi.org/10.1155/2023/1263620 |
Sumario: | Digital music has become a hot spot with the rapid development of network technology and digital audio technology. The general public is increasingly interested in music similarity detection (MSD). Similarity detection is mainly for music style classification. The core MSD process is to first extract music features, then implement training modeling, and finally input music features into the model for detection. Deep learning (DL) is a relatively new feature extraction technology to improve the extraction efficiency of music features. This paper first introduces the convolutional neural network (CNN) of DL algorithms and MSD. Then, an MSD algorithm is constructed based on CNN. Besides, the Harmony and Percussive Source Separation (HPSS) algorithm separates the original music signal spectrogram and decomposes it into two components: time characteristic harmonics and frequency characteristic shocks. These two elements are input to the CNN together with the data in the original spectrogram for processing. In addition, the training-related hyperparameters are adjusted, and the dataset is expanded to explore the influence of different parameters in the network structure on the music detection rate. Experiments on the GTZAN Genre Collection music dataset show that this method can effectively improve MSD using a single feature. The final detection result is 75.6%, indicating the superiority of this method compared with other classical detection methods. |
---|