Cargando…
Natural-Language-Driven Multimodal Representation Learning for Audio-Visual Scene-Aware Dialog System
With the development of multimedia systems in wireless environments, the rising need for artificial intelligence is to design a system that can properly communicate with humans with a comprehensive understanding of various types of information in a human-like manner. Therefore, this paper addresses...
Autores principales: | Heo, Yoonseok, Kang, Sangwoo, Seo, Jungyun |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10536977/ https://www.ncbi.nlm.nih.gov/pubmed/37765933 http://dx.doi.org/10.3390/s23187875 |
Ejemplares similares
-
Long-term memory representations for audio-visual scenes
por: Meyerhoff, Hauke S., et al.
Publicado: (2022) -
An Efficient Framework for Development of Task-Oriented Dialog Systems in a Smart Home Environment
por: Park, Youngmin, et al.
Publicado: (2018) -
A3CarScene: An audio-visual dataset for driving scene understanding
por: Cantarini, Michela, et al.
Publicado: (2023) -
Spoken natural language dialog systems : a practical approach
por: Smith Ronnie W
Publicado: (1944) -
Spoken natural language dialog systems: a practical approach
por: Smith, Ronnie W, et al.
Publicado: (1994)