Cargando…

WLiT: Windows and Linear Transformer for Video Action Recognition

The emergence of Transformer has led to the rapid development of video understanding, but it also brings the problem of high computational complexity. Previously, there were methods to divide the feature maps into windows along the spatiotemporal dimensions and then calculate the attention. There ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Ruoxi, Zhang, Tianzhao, Wan, Yong, Zhang, Fuping, Wei, Jianming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919352/
https://www.ncbi.nlm.nih.gov/pubmed/36772658
http://dx.doi.org/10.3390/s23031616