Cargando…

Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications

Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system’s resources. The performance of these tasks is directly dependent on the certainty of establishin...

Descripción completa

Detalles Bibliográficos
Autores principales: Krčadinac, Olja, Šošević, Uroš, Starčević, Dušan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8473232/
https://www.ncbi.nlm.nih.gov/pubmed/34577440
http://dx.doi.org/10.3390/s21186231
_version_ 1784574939001520128
author Krčadinac, Olja
Šošević, Uroš
Starčević, Dušan
author_facet Krčadinac, Olja
Šošević, Uroš
Starčević, Dušan
author_sort Krčadinac, Olja
collection PubMed
description Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system’s resources. The performance of these tasks is directly dependent on the certainty of establishing the identity of the user. The main research focus of this paper is user identity verification approach based on voice recognition techniques. The paper presents research results connected to the usage of open-source speaker recognition technologies in e-commerce applications with an emphasis on evaluating the performance of the algorithms they use. Four open-source speaker recognition solutions (SPEAR, MARF, ALIZE, and HTK) have been evaluated in cases of mismatched conditions during training and recognition phases. In practice, mismatched conditions are influenced by various lengths of spoken sentences, different types of recording devices, and the usage of different languages in training and recognition phases. All tests conducted in this research were performed in laboratory conditions using the specially designed framework for multimodal biometrics. The obtained results show consistency with the findings of recent research which proves that i-vectors and solutions based on probabilistic linear discriminant analysis (PLDA) continue to be the dominant speaker recognition approaches for text-independent tasks.
format Online
Article
Text
id pubmed-8473232
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84732322021-09-28 Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications Krčadinac, Olja Šošević, Uroš Starčević, Dušan Sensors (Basel) Communication Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system’s resources. The performance of these tasks is directly dependent on the certainty of establishing the identity of the user. The main research focus of this paper is user identity verification approach based on voice recognition techniques. The paper presents research results connected to the usage of open-source speaker recognition technologies in e-commerce applications with an emphasis on evaluating the performance of the algorithms they use. Four open-source speaker recognition solutions (SPEAR, MARF, ALIZE, and HTK) have been evaluated in cases of mismatched conditions during training and recognition phases. In practice, mismatched conditions are influenced by various lengths of spoken sentences, different types of recording devices, and the usage of different languages in training and recognition phases. All tests conducted in this research were performed in laboratory conditions using the specially designed framework for multimodal biometrics. The obtained results show consistency with the findings of recent research which proves that i-vectors and solutions based on probabilistic linear discriminant analysis (PLDA) continue to be the dominant speaker recognition approaches for text-independent tasks. MDPI 2021-09-17 /pmc/articles/PMC8473232/ /pubmed/34577440 http://dx.doi.org/10.3390/s21186231 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Krčadinac, Olja
Šošević, Uroš
Starčević, Dušan
Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
title Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
title_full Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
title_fullStr Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
title_full_unstemmed Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
title_short Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
title_sort evaluating the performance of speaker recognition solutions in e-commerce applications
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8473232/
https://www.ncbi.nlm.nih.gov/pubmed/34577440
http://dx.doi.org/10.3390/s21186231
work_keys_str_mv AT krcadinacolja evaluatingtheperformanceofspeakerrecognitionsolutionsinecommerceapplications
AT sosevicuros evaluatingtheperformanceofspeakerrecognitionsolutionsinecommerceapplications
AT starcevicdusan evaluatingtheperformanceofspeakerrecognitionsolutionsinecommerceapplications