Cargando…
A Facial Feature and Lip Movement Enhanced Audio-Visual Speech Separation Model
The cocktail party problem can be more effectively addressed by leveraging the speaker’s visual and audio information. This paper proposes a method to improve the audio’s separation using two visual cues: facial features and lip movement. Firstly, residual connections are introduced in the audio sep...
Autores principales: | Li, Guizhu, Fu, Min, Sun, Mengnan, Liu, Xuefeng, Zheng, Bing |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10647675/ https://www.ncbi.nlm.nih.gov/pubmed/37960477 http://dx.doi.org/10.3390/s23218770 |
Ejemplares similares
-
Audio source separation and speech enhancement
por: Vincent, Emmanuel, et al.
Publicado: (2018) -
Audio-Visual Speech Cue Combination
por: Arnold, Derek H., et al.
Publicado: (2010) -
Differential Auditory and Visual Phase-Locking Are Observed during Audio-Visual Benefit and Silent Lip-Reading for Speech Perception
por: Aller, Máté, et al.
Publicado: (2022) -
Audio-Visual Speech Timing Sensitivity Is Enhanced in Cluttered Conditions
por: Roseboom, Warrick, et al.
Publicado: (2011) -
Talker variability in audio-visual speech perception
por: Heald, Shannon L. M., et al.
Publicado: (2014)