Cargando…

Visual-Text Reference Pretraining Model for Image Captioning

People can accurately describe an image by constantly referring to the visual information and key text information of the image. Inspired by this idea, we propose the VTR-PTM (Visual-Text Reference Pretraining Model) for image captioning. First, based on the pretraining model (BERT/UNIML), we design...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Pengfei, Zhang, Min, Lin, Peijie, Wan, Jian, Jiang, Ming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8799330/ https://www.ncbi.nlm.nih.gov/pubmed/35096050 http://dx.doi.org/10.1155/2022/9400999

Ejemplares similares

Medical image captioning via generative pretrained transformers
por: Selivanov, Alexander, et al.
Publicado: (2023)

Hotel Review Classification Based on the Text Pretraining Heterogeneous Graph Neural Network Model
por: Zhang, Liyan, et al.
Publicado: (2022)

Comparison of Pretraining Models and Strategies for Health-Related Social Media Text Classification
por: Guo, Yuting, et al.
Publicado: (2022)

An Improved Math Word Problem (MWP) Model Using Unified Pretrained Language Model (UniLM) for Pretraining
por: Zhang, Dongqiu, et al.
Publicado: (2022)

To pretrain or not? A systematic analysis of the benefits of pretraining in diabetic retinopathy
por: Srinivasan, Vignesh, et al.
Publicado: (2022)

N-GPETS: Neural Attention Graph-Based Pretrained Statistical Model for Extractive Text Summarization
por: Umair, Muhammad, et al.
Publicado: (2022)

Visual Pretraining via Contrastive Predictive Model for Pixel-Based Reinforcement Learning
por: Luu, Tung M., et al.
Publicado: (2022)

The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study
por: Rivera Zavala, Renzo, et al.
Publicado: (2020)

Regularized Denoising Masked Visual Pretraining for Robust Embodied PointGoal Navigation
por: Peng, Jie, et al.
Publicado: (2023)

EpiGePT: a Pretrained Transformer model for epigenomics
por: Gao, Zijing, et al.
Publicado: (2023)

Comparison of pretrained transformer-based models for influenza and COVID-19 detection using social media text data in Saskatchewan, Canada
por: Tian, Yuan, et al.
Publicado: (2023)

Cough event classification by pretrained deep neural network
por: Liu, Jia-Ming, et al.
Publicado: (2015)

Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study
por: Albalawi, Yahya, et al.
Publicado: (2022)

Self-supervised pretraining improves the performance of classification of task functional magnetic resonance imaging
por: Shi, Chenwei, et al.
Publicado: (2023)

Critical Analysis of Deconfounded Pretraining to Improve Visio-Linguistic Models
por: Cornille, Nathan, et al.
Publicado: (2022)

Diagnosis of Lumbar Spondylolisthesis Using Optimized Pretrained CNN Models
por: Saravagi, Deepika, et al.
Publicado: (2022)

Pretrained transformer models for predicting the withdrawal of drugs from the market
por: Mazuz, Eyal, et al.
Publicado: (2023)

PRAGAN: Progressive Recurrent Attention GAN with Pretrained ViT Discriminator for Single-Image Deraining
por: Wei, Bingcai, et al.
Publicado: (2022)

Heuristic Attention Representation Learning for Self-Supervised Pretraining
por: Tran, Van Nhiem, et al.
Publicado: (2022)

BatteryBERT: A Pretrained Language Model for Battery Database Enhancement
por: Huang, Shu, et al.
Publicado: (2022)

Enhanced Tooth Region Detection Using Pretrained Deep Learning Models
por: Al-Sarem, Mohammed, et al.
Publicado: (2022)

Gastrointestinal Tract Disease Classification from Wireless Endoscopy Images Using Pretrained Deep Learning Model
por: Yogapriya, J., et al.
Publicado: (2021)

Operationalizing and Implementing Pretrained, Large Artificial Intelligence Linguistic Models in the US Health Care System: Outlook of Generative Pretrained Transformer 3 (GPT-3) as a Service Model
por: Sezgin, Emre, et al.
Publicado: (2022)

MAE-Based Self-Supervised Pretraining Algorithm for Heart Rate Estimation of Radar Signals
por: Xiang, Yashan, et al.
Publicado: (2023)

Evaluating Scoliosis Severity Based on Posturographic X-ray Images Using a Contrastive Language–Image Pretraining Model
por: Fabijan, Artur, et al.
Publicado: (2023)

Identification and Impact Analysis of Family History of Psychiatric Disorder in Mood Disorder Patients With Pretrained Language Model
por: Wan, Cheng, et al.
Publicado: (2022)

Document Network Projection in Pretrained Word Embedding Space
por: Gourru, Antoine, et al.
Publicado: (2020)

Sentiment Analysis of Image with Text Caption using Deep Learning Techniques
por: Chaubey, Pavan Kumar, et al.
Publicado: (2022)

Denoise Pretraining on Nonequilibrium Molecules for Accurate and Transferable Neural Potentials
por: Wang, Yuyang, et al.
Publicado: (2023)

Improving Model Transferability for Clinical Note Section Classification Models Using Continued Pretraining
por: Zhou, Weipeng, et al.
Publicado: (2023)

An Empirical Analysis of an Optimized Pretrained Deep Learning Model for COVID-19 Diagnosis
por: Sangeetha, S. K. B., et al.
Publicado: (2022)

Chemical–protein relation extraction with ensembles of carefully tuned pretrained language models
por: Weber, Leon, et al.
Publicado: (2022)

Explaining pretrained language models' understanding of linguistic structures using construction grammar
por: Weissweiler, Leonie, et al.
Publicado: (2023)

PLM-ARG: antibiotic resistance gene identification using a pretrained protein language model
por: Wu, Jun, et al.
Publicado: (2023)

Generative pretraining from large-scale transcriptomes for single-cell deciphering
por: Shen, Hongru, et al.
Publicado: (2023)

Pretraining strategies for effective promoter-driven gene expression prediction
por: Reddy, Aniketh Janardhan, et al.
Publicado: (2023)

Sequence-to-sequence pretraining for a less-resourced Slovenian language
por: Ulčar, Matej, et al.
Publicado: (2023)

A Future of Smarter Digital Health Empowered by Generative Pretrained Transformer
por: Miao, Hongyu, et al.
Publicado: (2023)

Predictive performance of radiomic models based on features extracted from pretrained deep networks
por: Demircioğlu, Aydin
Publicado: (2022)

Contrastive learning-based pretraining improves representation and transferability of diabetic retinopathy classification models
por: Alam, Minhaj Nur, et al.
Publicado: (2023)

Cannot write session to /tmp/vufind_sessions/sess_vb0ju5s7b69mhbke52s7pqv455