Cargando…
Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition
We present a multi-modal genre recognition framework that considers the modalities audio, text, and image by features extracted from audio signals, album cover images, and lyrics of music tracks. In contrast to pure learning of features by a neural network as done in the related work, handcrafted fe...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8621318/ https://www.ncbi.nlm.nih.gov/pubmed/34828199 http://dx.doi.org/10.3390/e23111502 |