Cargando…
Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications
Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system’s resources. The performance of these tasks is directly dependent on the certainty of establishin...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8473232/ https://www.ncbi.nlm.nih.gov/pubmed/34577440 http://dx.doi.org/10.3390/s21186231 |
_version_ | 1784574939001520128 |
---|---|
author | Krčadinac, Olja Šošević, Uroš Starčević, Dušan |
author_facet | Krčadinac, Olja Šošević, Uroš Starčević, Dušan |
author_sort | Krčadinac, Olja |
collection | PubMed |
description | Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system’s resources. The performance of these tasks is directly dependent on the certainty of establishing the identity of the user. The main research focus of this paper is user identity verification approach based on voice recognition techniques. The paper presents research results connected to the usage of open-source speaker recognition technologies in e-commerce applications with an emphasis on evaluating the performance of the algorithms they use. Four open-source speaker recognition solutions (SPEAR, MARF, ALIZE, and HTK) have been evaluated in cases of mismatched conditions during training and recognition phases. In practice, mismatched conditions are influenced by various lengths of spoken sentences, different types of recording devices, and the usage of different languages in training and recognition phases. All tests conducted in this research were performed in laboratory conditions using the specially designed framework for multimodal biometrics. The obtained results show consistency with the findings of recent research which proves that i-vectors and solutions based on probabilistic linear discriminant analysis (PLDA) continue to be the dominant speaker recognition approaches for text-independent tasks. |
format | Online Article Text |
id | pubmed-8473232 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84732322021-09-28 Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications Krčadinac, Olja Šošević, Uroš Starčević, Dušan Sensors (Basel) Communication Two important tasks in many e-commerce applications are identity verification of the user accessing the system and determining the level of rights that the user has for accessing and manipulating system’s resources. The performance of these tasks is directly dependent on the certainty of establishing the identity of the user. The main research focus of this paper is user identity verification approach based on voice recognition techniques. The paper presents research results connected to the usage of open-source speaker recognition technologies in e-commerce applications with an emphasis on evaluating the performance of the algorithms they use. Four open-source speaker recognition solutions (SPEAR, MARF, ALIZE, and HTK) have been evaluated in cases of mismatched conditions during training and recognition phases. In practice, mismatched conditions are influenced by various lengths of spoken sentences, different types of recording devices, and the usage of different languages in training and recognition phases. All tests conducted in this research were performed in laboratory conditions using the specially designed framework for multimodal biometrics. The obtained results show consistency with the findings of recent research which proves that i-vectors and solutions based on probabilistic linear discriminant analysis (PLDA) continue to be the dominant speaker recognition approaches for text-independent tasks. MDPI 2021-09-17 /pmc/articles/PMC8473232/ /pubmed/34577440 http://dx.doi.org/10.3390/s21186231 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Krčadinac, Olja Šošević, Uroš Starčević, Dušan Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications |
title | Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications |
title_full | Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications |
title_fullStr | Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications |
title_full_unstemmed | Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications |
title_short | Evaluating the Performance of Speaker Recognition Solutions in E-Commerce Applications |
title_sort | evaluating the performance of speaker recognition solutions in e-commerce applications |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8473232/ https://www.ncbi.nlm.nih.gov/pubmed/34577440 http://dx.doi.org/10.3390/s21186231 |
work_keys_str_mv | AT krcadinacolja evaluatingtheperformanceofspeakerrecognitionsolutionsinecommerceapplications AT sosevicuros evaluatingtheperformanceofspeakerrecognitionsolutionsinecommerceapplications AT starcevicdusan evaluatingtheperformanceofspeakerrecognitionsolutionsinecommerceapplications |