Cargando…
A Review of Multi-Modal Learning from the Text-Guided Visual Processing Viewpoint
For decades, co-relating different data domains to attain the maximum potential of machines has driven research, especially in neural networks. Similarly, text and visual data (images and videos) are two distinct data domains with extensive research in the past. Recently, using natural language to p...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9503702/ https://www.ncbi.nlm.nih.gov/pubmed/36146161 http://dx.doi.org/10.3390/s22186816 |