Cargando…
Lip Reading by Alternating between Spatiotemporal and Spatial Convolutions
Lip reading (LR) is the task of predicting the speech utilizing only the visual information of the speaker. In this work, for the first time, the benefits of alternating between spatiotemporal and spatial convolutions for learning effective features from the LR sequences are studied. In this context...
Autores principales: | Tsourounis, Dimitrios, Kastaniotis, Dimitris, Fotopoulos, Spiros |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321361/ https://www.ncbi.nlm.nih.gov/pubmed/34460687 http://dx.doi.org/10.3390/jimaging7050091 |
Ejemplares similares
-
SIFT-CNN: When Convolutional Neural Networks Meet Dense SIFT Descriptors for Image and Sequence Classification
por: Tsourounis, Dimitrios, et al.
Publicado: (2022) -
Lip Reading in the Seventeenth Century
Publicado: (1904) -
Egocentric Gesture Recognition Using 3D Convolutional Neural Networks for the Spatiotemporal Adaptation of Collaborative Robots
por: Papanagiotou, Dimitris, et al.
Publicado: (2021) -
Lip reading role in the hearing aid fitting process
por: Bannwart Dell'Aringa, Ana Helena, et al.
Publicado: (2015) -
Single cell and spatial alternative splicing analysis with long read sequencing
por: Fu, Yuntian, et al.
Publicado: (2023)