Cargando…

UAT: Universal Attention Transformer for Video Captioning

Video captioning via encoder–decoder structures is a successful sentence generation method. In addition, using various feature extraction networks for extracting multiple features to obtain multiple kinds of visual features in the encoding process is a standard method for improving model performance...

Descripción completa

Detalles Bibliográficos
Autores principales:	Im, Heeju, Choi, Yong-Suk
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9269373/ https://www.ncbi.nlm.nih.gov/pubmed/35808316 http://dx.doi.org/10.3390/s22134817

Ejemplares similares

Hydrocarbon Sorption in Flexible MOFs—Part I: Thermodynamic Analysis with the Dubinin-Based Universal Adsorption Theory (D-UAT)
por: Preißler-Kurzhöfer, Hannes, et al.
Publicado: (2022)

UAT defined: a guide to practical user acceptance testing as a silver bullet
por: Cimperman, Rob
Publicado: (2006)

Video captioning with stacked attention and semantic hard pull
por: Rahman, Md. Mushfiqur, et al.
Publicado: (2021)

Video captioning based on vision transformer and reinforcement learning
por: Zhao, Hong, et al.
Publicado: (2022)

Lightweight dense video captioning with cross-modal attention and knowledge-enhanced unbiased scene graph
por: Han, Shixing, et al.
Publicado: (2023)

Semantic guidance network for video captioning
por: Guo, Lan, et al.
Publicado: (2023)

Social Image Captioning: Exploring Visual Attention and User Attention
por: Wang, Leiquan, et al.
Publicado: (2018)

Research on Video Captioning Based on Multifeature Fusion
por: Zhao, Hong, et al.
Publicado: (2022)

Evaluation of automatic video captioning using direct assessment
por: Graham, Yvette, et al.
Publicado: (2018)

Attention-Guided Image Captioning through Word Information
por: Tang, Ziwei, et al.
Publicado: (2021)

Infrared Image Caption Based on Object-Oriented Attention
por: Lv, Junfeng, et al.
Publicado: (2023)

Fusion of Multi-Modal Features to Enhance Dense Video Caption
por: Huang, Xuefei, et al.
Publicado: (2023)

Exploring collaborative caption editing to augment video-based learning
por: Bhavya, Bhavya, et al.
Publicado: (2022)

Effective Pre-Training Method and Its Compositional Intelligence for Image Captioning
por: Choi, Won-Hyuk, et al.
Publicado: (2022)

Impact of Action Video Gaming Behavior on Attention, Anxiety, and Sleep Among University Students
por: Alsaad, Fatimah, et al.
Publicado: (2022)

Medical image captioning via generative pretrained transformers
por: Selivanov, Alexander, et al.
Publicado: (2023)

A Semantics-Assisted Video Captioning Model Trained With Scheduled Sampling
por: Chen, Haoran, et al.
Publicado: (2020)

Universal access: user needs for immersive captioning
por: Hughes, Chris. J.
Publicado: (2021)

A Comparison of Comprehension Processes in Sign Language Interpreter Videos with or without Captions
por: Debevc, Matjaž, et al.
Publicado: (2015)

Novel Video Surveillance-Based Fire and Smoke Classification Using Attentional Feature Map in Capsule Networks
por: Shakhnoza, Muksimova, et al.
Publicado: (2021)

Model of visual attention for video sequences
por: Milanova, Mariofanna
Publicado: (2008)

Prediction of Head Movement in 360-Degree Videos Using Attention Model
por: Lee, Dongwon, et al.
Publicado: (2021)

An image caption model based on attention mechanism and deep reinforcement learning
por: Bai, Tong, et al.
Publicado: (2023)

WLiT: Windows and Linear Transformer for Video Action Recognition
por: Sun, Ruoxi, et al.
Publicado: (2023)

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates
por: Moratelli, Nicholas, et al.
Publicado: (2023)

RandomiSed clinical trial assessing Use of an anti-inflammatoRy aGent in attenUating peri-operatiVe inflAmmatioN in non-meTastatic colon cancer – the S.U.R.G.U.V.A.N.T. trial
por: Redmond, H. Paul, et al.
Publicado: (2018)

Emphasis on Adipocyte Transformation: Anti-Inflammatory Agents to Prevent the Development of Cancer-Associated Adipocytes
por: Na, Heeju, et al.
Publicado: (2023)

Transformer-Based Fire Detection in Videos
por: Mardani, Konstantina, et al.
Publicado: (2023)

Multi-Stage Network for Event-Based Video Deblurring with Residual Hint Attention
por: Kim, Jeongmin, et al.
Publicado: (2023)

Image Captioning with Bidirectional Semantic Attention-Based Guiding of Long Short-Term Memory
por: Cao, Pengfei, et al.
Publicado: (2019)

Relieving the Attentional Blink in the Amblyopic Brain with Video Games
por: Li, Roger W., et al.
Publicado: (2015)

Cultural and Developmental Influences on Overt Visual Attention to Videos
por: Kardan, Omid, et al.
Publicado: (2017)

Modality attention fusion model with hybrid multi-head self-attention for video understanding
por: Zhuang, Xuqiang, et al.
Publicado: (2022)

Region Dual Attention-Based Video Emotion Recognition
por: Liu, Xiaodong, et al.
Publicado: (2022)

Class-dependent and cross-modal memory network considering sentimental features for video-based captioning
por: Xiong, Haitao, et al.
Publicado: (2023)

An Arrhythmia Classification Model Based on Vision Transformer with Deformable Attention
por: Dong, Yanfang, et al.
Publicado: (2023)

Reduced Lateralization of Attention in Action Video Game Players
por: Li, Yu, et al.
Publicado: (2019)

Graph–sequence attention and transformer for predicting drug–target affinity
por: Yan, Xiangfeng, et al.
Publicado: (2022)

Attention-Guided Disentangled Feature Aggregation for Video Object Detection
por: Muralidhara, Shishir, et al.
Publicado: (2022)

Attentional advantages in video-game experts are not related to perceptual tendencies
por: Wong, Nicole H. L., et al.
Publicado: (2018)

Cannot write session to /tmp/vufind_sessions/sess_cqhgkunrvl98dchfojfntbdj3i