Cargando…

Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition

Audio-visual speech recognition (AVSR) can significantly improve performance over audio-only recognition for small or medium vocabularies. However, current AVSR, whether hybrid or end-to-end (E2E), still does not appear to make optimal use of this secondary information stream as the performance is s...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Wentao, Zeiler, Steffen, Kolossa, Dorothea
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9370936/
https://www.ncbi.nlm.nih.gov/pubmed/35898005
http://dx.doi.org/10.3390/s22155501