Cargando…

Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level Visual Speech Recognition

In visual speech recognition (VSR), speech is transcribed using only visual information to interpret tongue and teeth movements. Recently, deep learning has shown outstanding performance in VSR, with accuracy exceeding that of lipreaders on benchmark datasets. However, several problems still exist w...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jeon, Sanghun, Elsharkawy, Ahmed, Kim, Mun Sang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747278/ https://www.ncbi.nlm.nih.gov/pubmed/35009612 http://dx.doi.org/10.3390/s22010072

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747278/
https://www.ncbi.nlm.nih.gov/pubmed/35009612
http://dx.doi.org/10.3390/s22010072

Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level Visual Speech Recognition

Internet

Ejemplares similares