Cargando…

Lipreading Architecture Based on Multiple Convolutional Neural Networks for Sentence-Level Visual Speech Recognition

In visual speech recognition (VSR), speech is transcribed using only visual information to interpret tongue and teeth movements. Recently, deep learning has shown outstanding performance in VSR, with accuracy exceeding that of lipreaders on benchmark datasets. However, several problems still exist w...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jeon, Sanghun, Elsharkawy, Ahmed, Kim, Mun Sang
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8747278/ https://www.ncbi.nlm.nih.gov/pubmed/35009612 http://dx.doi.org/10.3390/s22010072

Ejemplares similares

End-to-End Sentence-Level Multi-View Lipreading Architecture with Spatial Attention Module Integrated Multiple CNNs and Cascaded Local Self-Attention-CTC
por: Jeon, Sanghun, et al.
Publicado: (2022)

Noise-Robust Multimodal Audio-Visual Speech Recognition System for Speech-Based Interaction Applications
por: Jeon, Sanghun, et al.
Publicado: (2022)

Visual Cortical Entrainment to Motion and Categorical Speech Features during Silent Lipreading
por: O’Sullivan, Aisling E., et al.
Publicado: (2017)

End-to-End Lip-Reading Open Cloud-Based Speech Architecture
por: Jeon, Sanghun, et al.
Publicado: (2022)

The Neural Basis of Speech Perception through Lipreading and Manual Cues: Evidence from Deaf Native Users of Cued Speech
por: Aparicio, Mario, et al.
Publicado: (2017)

Cascaded Convolutional Neural Network Architecture for Speech Emotion Recognition in Noisy Conditions
por: Nam, Youngja, et al.
Publicado: (2021)

Lipreading: A Review of Its Continuing Importance for Speech Recognition With an Acquired Hearing Loss and Possibilities for Effective Training
por: Bernstein, Lynne E., et al.
Publicado: (2022)

Visual inputs decrease brain activity in frontal areas during silent lipreading
por: Plata Bello, Julio, et al.
Publicado: (2019)

Modality-Specific Perceptual Learning of Vocoded Auditory versus Lipread Speech: Different Effects of Prior Information
por: Bernstein, Lynne E., et al.
Publicado: (2023)

Speech Recognition and Listening Effort of Meaningful Sentences Using Synthetic Speech
por: Ibelings, Saskia, et al.
Publicado: (2022)

Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer
por: Ullah, Rizwan, et al.
Publicado: (2023)

Effects of Speech Clarity on Recognition Memory for Spoken Sentences
por: Van Engen, Kristin J., et al.
Publicado: (2012)

The impact of speech rate on sentence recognition by elderly individuals
por: Lessa, Alexandre Hundertmarck, et al.
Publicado: (2015)

Speaker Recognition Using Constrained Convolutional Neural Networks in Emotional Speech
por: Simić, Nikola, et al.
Publicado: (2022)

Learning the Relative Dynamic Features for Word-Level Lipreading
por: Li, Hao, et al.
Publicado: (2022)

Convolutional Neural Network Based Real Time Arabic Speech Recognition to Arabic Braille for Hearing and Visually Impaired
por: Bhatia, Surbhi, et al.
Publicado: (2022)

Speech synthesis from neural decoding of spoken sentences
por: Anumanchipalli, Gopala K., et al.
Publicado: (2019)

Lipreading a naturalistic narrative in a female population: Neural characteristics shared with listening and reading
por: Saalasti, Satu, et al.
Publicado: (2022)

An Effective Conversion of Visemes to Words for High-Performance Automatic Lipreading
por: Fenghour, Souheil, et al.
Publicado: (2021)

Convolutional Neural Network Architecture for Recovering Watermark Synchronization
por: Kim, Wook-Hyung, et al.
Publicado: (2020)

Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition
por: Zhang, Hua, et al.
Publicado: (2021)

Speech emotion recognition based on improved masking EMD and convolutional recurrent neural network
por: Sun, Congshan, et al.
Publicado: (2023)

Visualization of Customized Convolutional Neural Network for Natural Language Recognition
por: Singh, Tajinder Pal, et al.
Publicado: (2022)

Impact of Feature Selection Algorithm on Speech Emotion Recognition Using Deep Convolutional Neural Network
por: Farooq, Misbah, et al.
Publicado: (2020)

Non-native speech recognition sentences: A new materials set for non-native speech perception research
por: Stringer, Louise, et al.
Publicado: (2019)

Depression Speech Recognition With a Three-Dimensional Convolutional Network
por: Wang, Hongbo, et al.
Publicado: (2021)

Efficient Binary Weight Convolutional Network Accelerator for Speech Recognition
por: Guo, Lunyi, et al.
Publicado: (2023)

A Danish Sentence Corpus for Assessing Speech Recognition in Noise in School-Age Children
por: Koiek, Shno, et al.
Publicado: (2020)

Unsupervised speech recognition through spike-timing-dependent plasticity in a convolutional spiking neural network
por: Dong, Meng, et al.
Publicado: (2018)

A Classroom Emotion Recognition Model Based on a Convolutional Neural Network Speech Emotion Algorithm
por: Yuan, Qinying
Publicado: (2022)

Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition
por: Tao, Huawei, et al.
Publicado: (2022)

Spectral Flux-Based Convolutional Neural Network Architecture for Speech Source Localization and Its Real-Time Implementation
por: HAO, YIYA, et al.
Publicado: (2020)

A Residual-Dense-Based Convolutional Neural Network Architecture for Recognition of Cardiac Health Based on ECG Signals
por: Ahmed, Alaa E. S., et al.
Publicado: (2023)

Convolutional-de-convolutional neural networks for recognition of surgical workflow
por: Chen, Yu-wen, et al.
Publicado: (2022)

Effect of Compression Release Time of a Hearing Aid on Sentence Recognition and the Quality Judgment of Speech
por: Shetty, Hemanth Narayan, et al.
Publicado: (2019)

3D convolutional neural network for machining feature recognition with gradient-based visual explanations from 3D CAD models
por: Lee, Jinwon, et al.
Publicado: (2022)

End-to-end speech emotion recognition using a novel context-stacking dilated convolution neural network
por: Tang, Duowei, et al.
Publicado: (2021)

Silent EEG-Speech Recognition Using Convolutional and Recurrent Neural Network with 85% Accuracy of 9 Words Classification
por: Vorontsova, Darya, et al.
Publicado: (2021)

Updating sentences lists for assessment speech perception
por: Pinheiro, Maria Madalena Canina, et al.
Publicado: (2022)

Multimodal Sensor-Input Architecture with Deep Learning for Audio-Visual Speech Recognition in Wild
por: He, Yibo, et al.
Publicado: (2023)

Cannot write session to /tmp/vufind_sessions/sess_t1k0eiiqq1jp4o999r63j8ah6a