Cargando…

A Review of Multi-Modal Learning from the Text-Guided Visual Processing Viewpoint

For decades, co-relating different data domains to attain the maximum potential of machines has driven research, especially in neural networks. Similarly, text and visual data (images and videos) are two distinct data domains with extensive research in the past. Recently, using natural language to p...

Descripción completa

Detalles Bibliográficos
Autores principales: Ullah, Ubaid, Lee, Jeong-Sik, An, Chang-Hyeon, Lee, Hyeonjin, Park, Su-Yeong, Baek, Rock-Hyun, Choi, Hyun-Chul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503702/
https://www.ncbi.nlm.nih.gov/pubmed/36146161
http://dx.doi.org/10.3390/s22186816