Cargando…

Attention-Based Fusion of Ultrashort Voice Utterances and Depth Videos for Multimodal Person Identification

Multimodal deep learning, in the context of biometrics, encounters significant challenges due to the dependence on long speech utterances and RGB images, which are often impractical in certain situations. This paper presents a novel solution addressing these issues by leveraging ultrashort voice utt...

Descripción completa

Detalles Bibliográficos
Autores principales:	Moufidi, Abderrazzaq, Rousseau, David, Rasti, Pejman
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346165/ https://www.ncbi.nlm.nih.gov/pubmed/37447739 http://dx.doi.org/10.3390/s23135890

Ejemplares similares

Enhancing the Tracking of Seedling Growth Using RGB-Depth Fusion and Deep Learning
por: Garbouge, Hadhami, et al.
Publicado: (2021)

Two-Stage Voice Application Recommender System for Unhandled Utterances in Intelligent Personal Assistant
por: Xiao, Wei, et al.
Publicado: (2022)

Oscillatory brain responses to own names uttered by unfamiliar and familiar voices
por: del Giudice, Renata, et al.
Publicado: (2014)

Integrating Voice Quality Cues in the Pitch Perception of Speech and Non-speech Utterances
por: Kuang, Jianjing, et al.
Publicado: (2018)

Hierarchical Attention-Based Multimodal Fusion Network for Video Emotion Recognition
por: Liu, Xiaodong, et al.
Publicado: (2021)

Understanding utterances /
por: Blakemore, Diane
Publicado: (1992)

Multi-Level Fusion Temporal–Spatial Co-Attention for Video-Based Person Re-Identification
por: Pei, Shengyu, et al.
Publicado: (2021)

Performance Evaluation of a Voice-Based Depression Assessment System Considering the Number and Type of Input Utterances
por: Higuchi, Masakazu, et al.
Publicado: (2021)

Pettenkofer's Last Utterance on Cholera
Publicado: (1883)

‘All changed, changed utterly’
por: Trimble, Michael
Publicado: (2020)

Attention in post-lexical processes of utterance production: Dual-task cost in younger and older adults
por: Fournet, Maryll, et al.
Publicado: (2021)

A Cognitive Architecture for the Coordination of Utterances
por: Gambi, Chiara, et al.
Publicado: (2011)

Acoustic correlates of perceived personality from Korean utterances in a formal communicative setting
por: Song, Jieun, et al.
Publicado: (2023)

Spatio-Temporal Variation of Conversational Utterances on Twitter
por: Alis, Christian M., et al.
Publicado: (2013)

Sequential information in a great ape utterance
por: Fedurek, Pawel, et al.
Publicado: (2016)

“Without this journal, I am in utter darkness”
por: Ellison, Elmien Wolvaardt
Publicado: (2017)

Utterance Clustering Using Stereo Audio Channels
por: Dong, Yingjun, et al.
Publicado: (2021)

Numerical simulations of tunable ultrashort power splitters based on slotted multimode interference couplers
por: Huang, Chia-Chien, et al.
Publicado: (2019)

The Role of Auditory Feedback at Vocalization Onset and Mid-Utterance
por: Scheerer, Nichole E., et al.
Publicado: (2018)

Assessing Without Words: Verbally Incomplete Utterances in Complaints
por: Skogmyr Marian, Klara
Publicado: (2021)

NUVA: A Naming Utterance Verifier for Aphasia Treatment
por: Barbera, David S., et al.
Publicado: (2021)

A Criticism of Some Recent Utterances on Ectopic Gestation
por: Tait, Lawson
Publicado: (1890)

Utterances as Signals for Sharing Tacit Images in Collective Interaction
por: Shoji, Naoto, et al.
Publicado: (2022)

Online supervised attention-based recurrent depth estimation from monocular video
por: Maslov, Dmitrii, et al.
Publicado: (2020)

SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams
por: Abdrakhmanova, Madina, et al.
Publicado: (2021)

Voice and video conferencing fundamentals
por: Firestone, Scott, et al.
Publicado: (2007)

The evolution of flexible bronchoscopy: From historical luxury to utter necessity!!
por: Vaidya, Preyas J, et al.
Publicado: (2015)

Theory of mind in utterance interpretation: the case from clinical pragmatics
por: Cummings, Louise
Publicado: (2015)

Predicting mild cognitive impairment from spontaneous spoken utterances
por: Asgari, Meysam, et al.
Publicado: (2017)

Visual Grouping in Accordance With Utterance Planning Facilitates Speech Production
por: Zhao, Liming, et al.
Publicado: (2018)

LINGUISTIC MEASURES OF SPOKEN UTTERANCES FOR DETECTING MILD COGNITIVE IMPAIRMENT
por: Asgari, Meysam, et al.
Publicado: (2019)

Short-time speaker verification with different speaking style utterances
por: Mao, Hongwei, et al.
Publicado: (2020)

Modality attention fusion model with hybrid multi-head self-attention for video understanding
por: Zhuang, Xuqiang, et al.
Publicado: (2022)

ROSE-X: an annotated data set for evaluation of 3D plant organ segmentation methods
por: Dutagaci, Helin, et al.
Publicado: (2020)

Multimodal Attention Dynamic Fusion Network for Facial Micro-Expression Recognition
por: Yang, Hongling, et al.
Publicado: (2023)

Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks
por: Zazo, Ruben, et al.
Publicado: (2016)

Multiscale Attention Fusion for Depth Map Super-Resolution Generative Adversarial Networks
por: Xu, Dan, et al.
Publicado: (2023)

Rate distortion bounds for voice and video
por: Gibson, Jerry D, et al.
Publicado: (2014)

Atomic nuclei utter disintegration into nucleons by high energy nuclear projectiles
por: Strugalski, Z
Publicado: (1994)

Dynamics of Vocalization-Induced Modulation of Auditory Cortical Activity at Mid-utterance
por: Chen, Zhaocong, et al.
Publicado: (2013)

Cannot write session to /tmp/vufind_sessions/sess_npbv9p9t349hspbma7gv03emg0