Cargando…

Hybrid Attention based Multimodal Network for Spoken Language Classification

We examine the utility of linguistic content and vocal characteristics for multimodal deep learning in human spoken language understanding. We present a deep multimodal network with both feature attention and modality attention to classify utterance-level speech data. The proposed hybrid attention a...

Descripción completa

Detalles Bibliográficos
Autores principales: Gu, Yue, Yang, Kangning, Fu, Shiyu, Chen, Shuhong, Li, Xinyu, Marsic, Ivan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6217979/
https://www.ncbi.nlm.nih.gov/pubmed/30410219