Cargando…

ACG-EmoCluster: A Novel Framework to Capture Spatial and Temporal Information from Emotional Speech Enhanced by DeepCluster

Speech emotion recognition (SER) is a task that tailors a matching function between the speech features and the emotion labels. Speech data have higher information saturation than images and stronger temporal coherence than text. This makes entirely and effectively learning speech features challengi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhao, Huan, Li, Lixuan, Zha, Xupeng, Wang, Yujiang, Xie, Zhaoxin, Zhang, Zixing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223526/ https://www.ncbi.nlm.nih.gov/pubmed/37430691 http://dx.doi.org/10.3390/s23104777

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10223526/
https://www.ncbi.nlm.nih.gov/pubmed/37430691
http://dx.doi.org/10.3390/s23104777

ACG-EmoCluster: A Novel Framework to Capture Spatial and Temporal Information from Emotional Speech Enhanced by DeepCluster

Internet

Ejemplares similares