Cargando…

Multi-Modal Representation via Contrastive Learning with Attention Bottleneck Fusion and Attentive Statistics Features

The integration of information from multiple modalities is a highly active area of research. Previous techniques have predominantly focused on fusing shallow features or high-level representations generated by deep unimodal networks, which only capture a subset of the hierarchical relationships acro...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Qinglang, Liao, Yong, Li, Zhe, Liang, Shenglin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10606612/
https://www.ncbi.nlm.nih.gov/pubmed/37895542
http://dx.doi.org/10.3390/e25101421