Cargando…

Video Scene Detection Using Transformer Encoding Linker Network (TELNet)

This paper introduces a transformer encoding linker network (TELNet) for automatically identifying scene boundaries in videos without prior knowledge of their structure. Videos consist of sequences of semantically related shots or chapters, and recognizing scene boundaries is crucial for various vid...

Descripción completa

Detalles Bibliográficos
Autores principales: Tseng, Shu-Ming, Yeh, Zhi-Ting, Wu, Chia-Yang, Chang, Jia-Bin, Norouzi, Mehdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10458897/
https://www.ncbi.nlm.nih.gov/pubmed/37631590
http://dx.doi.org/10.3390/s23167050