Cargando…
Attention-Based Fusion of Ultrashort Voice Utterances and Depth Videos for Multimodal Person Identification
Multimodal deep learning, in the context of biometrics, encounters significant challenges due to the dependence on long speech utterances and RGB images, which are often impractical in certain situations. This paper presents a novel solution addressing these issues by leveraging ultrashort voice utt...
Autores principales: | Moufidi, Abderrazzaq, Rousseau, David, Rasti, Pejman |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346165/ https://www.ncbi.nlm.nih.gov/pubmed/37447739 http://dx.doi.org/10.3390/s23135890 |
Ejemplares similares
-
Enhancing the Tracking of Seedling Growth Using RGB-Depth Fusion and Deep Learning
por: Garbouge, Hadhami, et al.
Publicado: (2021) -
Two-Stage Voice Application Recommender System for Unhandled Utterances in Intelligent Personal Assistant
por: Xiao, Wei, et al.
Publicado: (2022) -
Oscillatory brain responses to own names uttered by unfamiliar and familiar voices
por: del Giudice, Renata, et al.
Publicado: (2014) -
Integrating Voice Quality Cues in the Pitch Perception of Speech and Non-speech Utterances
por: Kuang, Jianjing, et al.
Publicado: (2018) -
Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition
por: Liu, Xiaodong, et al.
Publicado: (2021)