Cargando…

Attention-Based Fusion of Ultrashort Voice Utterances and Depth Videos for Multimodal Person Identification

Multimodal deep learning, in the context of biometrics, encounters significant challenges due to the dependence on long speech utterances and RGB images, which are often impractical in certain situations. This paper presents a novel solution addressing these issues by leveraging ultrashort voice utt...

Descripción completa

Detalles Bibliográficos
Autores principales:	Moufidi, Abderrazzaq, Rousseau, David, Rasti, Pejman
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346165/ https://www.ncbi.nlm.nih.gov/pubmed/37447739 http://dx.doi.org/10.3390/s23135890

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346165/
https://www.ncbi.nlm.nih.gov/pubmed/37447739
http://dx.doi.org/10.3390/s23135890

Attention-Based Fusion of Ultrashort Voice Utterances and Depth Videos for Multimodal Person Identification

Internet

Ejemplares similares