Cargando…

What the Appearance Channel from Two-Stream Architectures for Activity Recognition Is Learning?

The automatic recognition of human activities from video data is being led by spatio-temporal Convolutional Neural Networks (3D CNNs), in particular two-stream architectures such as I3D that reports the best accuracy so far. Despite the high performance in accuracy of this kind of architectures, ver...

Descripción completa

Detalles Bibliográficos
Autores principales: Oves García, Reinier, Sucar, L. Enrique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7297582/
http://dx.doi.org/10.1007/978-3-030-49076-8_24