Cargando…
Class-dependent and cross-modal memory network considering sentimental features for video-based captioning
The video-based commonsense captioning task aims to add multiple commonsense descriptions to video captions to understand video content better. This paper aims to consider the importance of cross-modal mapping. We propose a combined framework called Class-dependent and Cross-modal Memory Network con...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9975600/ https://www.ncbi.nlm.nih.gov/pubmed/36874867 http://dx.doi.org/10.3389/fpsyg.2023.1124369 |