Cargando…

Attention-Based Fusion of Ultrashort Voice Utterances and Depth Videos for Multimodal Person Identification

Multimodal deep learning, in the context of biometrics, encounters significant challenges due to the dependence on long speech utterances and RGB images, which are often impractical in certain situations. This paper presents a novel solution addressing these issues by leveraging ultrashort voice utt...

Descripción completa

Detalles Bibliográficos
Autores principales: Moufidi, Abderrazzaq, Rousseau, David, Rasti, Pejman
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346165/
https://www.ncbi.nlm.nih.gov/pubmed/37447739
http://dx.doi.org/10.3390/s23135890