Cargando…

Face mask effects on speaker verification performance in the presence of noise

Due to its spread via physical contact and the regulations on wearing face masks, COVID-19 has resulted in tough challenges for speaker recognition. Masks may aid in preventing COVID-19 transmission, although the implications of the mask on system performance in a clean environment and with varying...

Descripción completa

Detalles Bibliográficos
Autor principal: Al-Karawi, Khamis A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10225750/
https://www.ncbi.nlm.nih.gov/pubmed/37362700
http://dx.doi.org/10.1007/s11042-023-15824-w
Descripción
Sumario:Due to its spread via physical contact and the regulations on wearing face masks, COVID-19 has resulted in tough challenges for speaker recognition. Masks may aid in preventing COVID-19 transmission, although the implications of the mask on system performance in a clean environment and with varying levels of background noise are unclear. The face mask has an impact on speech output. The task of comprehending speech while wearing a face mask is made more difficult by the mask's frequency response and radiation qualities, which is vary depending on the material and design of the mask. In this study, we recorded speech while wearing a face mask to see how different masks affected a state-of-the-art text-independent speaker verification system using an i-vector speaker identification system. This research investigates the influence of facial coverings on speaker verification. To address this, we investigated the effect of fabric masks on speaker identification in a cafeteria setting. These results present preliminary speaker recognition rates as well as mask verification trials. The result shows that masks had little to no effect in low background noise, with an EER of 2.4–2.5% in 20 dB SNR for both masks compared to no mask at the same level. In noisy conditions, accuracy was 12.7–13.0% lowers than without a mask with a 5 dB SNR, indicating that while different masks perform similarly in low background noise levels, they become more noticeable in high noise levels.