Cargando…
End-to-End Sentence-Level Multi-View Lipreading Architecture with Spatial Attention Module Integrated Multiple CNNs and Cascaded Local Self-Attention-CTC
Concomitant with the recent advances in deep learning, automatic speech recognition and visual speech recognition (VSR) have received considerable attention. However, although VSR systems must identify speech from both frontal and profile faces in real-world scenarios, most VSR studies have focused...
Autores principales: | Jeon, Sanghun, Kim, Mun Sang |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9099765/ https://www.ncbi.nlm.nih.gov/pubmed/35591284 http://dx.doi.org/10.3390/s22093597 |
Ejemplares similares
-
Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level Visual Speech Recognition
por: Jeon, Sanghun, et al.
Publicado: (2021) -
End-to-End Automatic Pronunciation Error Detection Based on Improved Hybrid CTC/Attention Architecture
por: Zhang, Long, et al.
Publicado: (2020) -
End-to-End Lip-Reading Open Cloud-Based Speech Architecture
por: Jeon, Sanghun, et al.
Publicado: (2022) -
Improving Hybrid CTC/Attention Architecture for Agglutinative Language Speech Recognition
por: Ren, Zeyu, et al.
Publicado: (2022) -
End-To-End Deep Learning Architecture for Continuous Blood Pressure Estimation Using Attention Mechanism
por: Eom, Heesang, et al.
Publicado: (2020)