Cargando…

A Review of Multi-Modal Learning from the Text-Guided Visual Processing Viewpoint

For decades, co-relating different data domains to attain the maximum potential of machines has driven research, especially in neural networks. Similarly, text and visual data (images and videos) are two distinct data domains with extensive research in the past. Recently, using natural language to p...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ullah, Ubaid, Lee, Jeong-Sik, An, Chang-Hyeon, Lee, Hyeonjin, Park, Su-Yeong, Baek, Rock-Hyun, Choi, Hyun-Chul
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Review
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503702/ https://www.ncbi.nlm.nih.gov/pubmed/36146161 http://dx.doi.org/10.3390/s22186816

Ejemplares similares

Learning-Based Ordering Characters on Ancient Document
por: Lee, Hyeonjin, et al.
Publicado: (2022)

Arbitrary Font Generation by Encoder Learning of Disentangled Features
por: Lee, Jeong-Sik, et al.
Publicado: (2022)

Cross-Modal Object Recognition Is Viewpoint-Independent
por: Lacey, Simon, et al.
Publicado: (2007)

Statistical and Visual Analysis of Audio, Text, and Image Features for Multi-Modal Music Genre Recognition
por: Wilkes, Ben, et al.
Publicado: (2021)

Multi-View Visual Question Answering with Active Viewpoint Selection
por: Qiu, Yue, et al.
Publicado: (2020)

A Viewpoint on Treatment of Traumatic Bilateral Basal Ganglia Hemorrhage in a Child: Case Report
por: Baek, Kyeong Hee, et al.
Publicado: (2016)

Retethering : A Neurosurgical Viewpoint
por: Lee, Ji Yeoun, et al.
Publicado: (2020)

Manipulation Direction: Evaluating Text-Guided Image Manipulation Based on Similarity between Changes in Image and Text Modalities
por: Watanabe, Yuto, et al.
Publicado: (2023)

An effective assessment of cluster tendency through sampling based multi-viewpoints visual method
por: Prasad, K. Rajendra, et al.
Publicado: (2021)

[Formula: see text]: Similarity-Aware Multi-modal Fake News Detection
por: Zhou, Xinyi, et al.
Publicado: (2020)

Object-Level Visual-Text Correlation Graph Hashing for Unsupervised Cross-Modal Retrieval
por: Shi, Ge, et al.
Publicado: (2022)

Cross-Modal and Intra-Modal Characteristics of Visual Function and Speech Perception Performance in Postlingually Deafened, Cochlear Implant Users
por: Kim, Min-Beom, et al.
Publicado: (2016)

Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content
por: Gasparini, Francesca, et al.
Publicado: (2022)

Sensitivity of Inner Spacer Thickness Variations for Sub-3-nm Node Silicon Nanosheet Field-Effect Transistors
por: Lee, Sanguk, et al.
Publicado: (2022)

Visual Sensory Experiences From the Viewpoint of Autistic Adults
por: Parmar, Ketan R., et al.
Publicado: (2021)

Visual Scene-Aware Hybrid and Multi-Modal Feature Aggregation for Facial Expression Recognition †
por: Lee, Min Kyu, et al.
Publicado: (2020)

Multi-modal adaptive gated mechanism for visual question answering
por: Xu, Yangshuyi, et al.
Publicado: (2023)

Multi-modal recommendation algorithm fusing visual and textual features
por: Hu, Xuefeng, et al.
Publicado: (2023)

Multi-modal chemical information reconstruction from images and texts for exploring the near-drug space
por: Wang, Jie, et al.
Publicado: (2022)

Analysis on the Effectiveness and Characteristics of Treatment Modalities for Bowen’s Disease: An Observational Study
por: Park, Hae-Eun, et al.
Publicado: (2022)

Novel Modeling Approach to Analyze Threshold Voltage Variability in Short Gate-Length (15–22 nm) Nanowire FETs with Various Channel Diameters
por: Lee, Seunghwan, et al.
Publicado: (2022)

Visualizing with text
por: Brath, Richard
Publicado: (2020)

Viewpoint
Publicado: (2017)

Viewpoint
Publicado: (2017)

Viewpoint
Publicado: (2017)

Viewpoint
Publicado: (2017)

Viewpoint
Publicado: (2017)

Viewpoint
Publicado: (2017)

Viewpoint
Publicado: (1995)

Viewpoint
Publicado: (1995)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2003)

Viewpoint
Publicado: (2000)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2002)

Viewpoint
Publicado: (2003)

Cannot write session to /tmp/vufind_sessions/sess_oftj8ugm0nefr0r2cskg15hcco