Cargando…
WLiT: Windows and Linear Transformer for Video Action Recognition
The emergence of Transformer has led to the rapid development of video understanding, but it also brings the problem of high computational complexity. Previously, there were methods to divide the feature maps into windows along the spatiotemporal dimensions and then calculate the attention. There ar...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9919352/ https://www.ncbi.nlm.nih.gov/pubmed/36772658 http://dx.doi.org/10.3390/s23031616 |