Cargando…

Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition

We present a multi-modal genre recognition framework that considers the modalities audio, text, and image by features extracted from audio signals, album cover images, and lyrics of music tracks. In contrast to pure learning of features by a neural network as done in the related work, handcrafted fe...

Descripción completa

Detalles Bibliográficos
Autores principales: Wilkes, Ben, Vatolkin, Igor, Müller, Heinrich
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8621318/
https://www.ncbi.nlm.nih.gov/pubmed/34828199
http://dx.doi.org/10.3390/e23111502