Cargando…
Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition
Audio-visual speech recognition (AVSR) can significantly improve performance over audio-only recognition for small or medium vocabularies. However, current AVSR, whether hybrid or end-to-end (E2E), still does not appear to make optimal use of this secondary information stream as the performance is s...
Autores principales: | Yu, Wentao, Zeiler, Steffen, Kolossa, Dorothea |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9370936/ https://www.ncbi.nlm.nih.gov/pubmed/35898005 http://dx.doi.org/10.3390/s22155501 |
Ejemplares similares
-
Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices
por: Ryumin, Dmitry, et al.
Publicado: (2023) -
Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications
por: Jeon, Sanghun, et al.
Publicado: (2022) -
Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition
por: Wu, Jibin, et al.
Publicado: (2020) -
Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition
por: Pakoci, Edvin, et al.
Publicado: (2019) -
Audio-Visual Speech Cue Combination
por: Arnold, Derek H., et al.
Publicado: (2010)