Cargando…

Lambda-vector modeling temporal and channel interactions for text-independent speaker verification

Most of the current excellent models in speaker verification are ResNet-based deep models and attention-based models. These models have a general weakness, which is the large number of parameters and high hardware requirements. On the other hand, many deep structures only generate embedding features...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wei, Guangcun, Min, Hang, Xu, Yunfei, Zhang, Yanna
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9616814/ https://www.ncbi.nlm.nih.gov/pubmed/36307520 http://dx.doi.org/10.1038/s41598-022-22977-5

_version_	1784820722355404800
author	Wei, Guangcun Min, Hang Xu, Yunfei Zhang, Yanna
author_facet	Wei, Guangcun Min, Hang Xu, Yunfei Zhang, Yanna
author_sort	Wei, Guangcun
collection	PubMed
description	Most of the current excellent models in speaker verification are ResNet-based deep models and attention-based models. These models have a general weakness, which is the large number of parameters and high hardware requirements. On the other hand, many deep structures only generate embedding features from the features extracted by the last frame-level layer, which causes shallow features and channel-related features to be ignored. To solve these problems, this paper proposed a shallow speaker verification model based on Lambda-vector, its main structure is composed of three Lambda-SE modules. The module extracts long-distance dependencies between frame-level features and channel-related interaction information to enhance representation of features. Meanwhile, so that adequately mine the information in deep and shallow features, the model introduces multi-layer feature aggregation to fuse the features of different frame-level layers together. It can increase the detailed information in the deep features and improve the model's ability to represent complex information. The experimental results on the public datasets Voxceleb1 and Voxceleb2 show that the model has more stable training speed, fewer model parameters, and better identification performances than baseline models.
format	Online Article Text
id	pubmed-9616814
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Nature Publishing Group UK
record_format	MEDLINE/PubMed
spelling	pubmed-96168142022-10-30 Lambda-vector modeling temporal and channel interactions for text-independent speaker verification Wei, Guangcun Min, Hang Xu, Yunfei Zhang, Yanna Sci Rep Article Most of the current excellent models in speaker verification are ResNet-based deep models and attention-based models. These models have a general weakness, which is the large number of parameters and high hardware requirements. On the other hand, many deep structures only generate embedding features from the features extracted by the last frame-level layer, which causes shallow features and channel-related features to be ignored. To solve these problems, this paper proposed a shallow speaker verification model based on Lambda-vector, its main structure is composed of three Lambda-SE modules. The module extracts long-distance dependencies between frame-level features and channel-related interaction information to enhance representation of features. Meanwhile, so that adequately mine the information in deep and shallow features, the model introduces multi-layer feature aggregation to fuse the features of different frame-level layers together. It can increase the detailed information in the deep features and improve the model's ability to represent complex information. The experimental results on the public datasets Voxceleb1 and Voxceleb2 show that the model has more stable training speed, fewer model parameters, and better identification performances than baseline models. Nature Publishing Group UK 2022-10-28 /pmc/articles/PMC9616814/ /pubmed/36307520 http://dx.doi.org/10.1038/s41598-022-22977-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Article Wei, Guangcun Min, Hang Xu, Yunfei Zhang, Yanna Lambda-vector modeling temporal and channel interactions for text-independent speaker verification
title	Lambda-vector modeling temporal and channel interactions for text-independent speaker verification
title_full	Lambda-vector modeling temporal and channel interactions for text-independent speaker verification
title_fullStr	Lambda-vector modeling temporal and channel interactions for text-independent speaker verification
title_full_unstemmed	Lambda-vector modeling temporal and channel interactions for text-independent speaker verification
title_short	Lambda-vector modeling temporal and channel interactions for text-independent speaker verification
title_sort	lambda-vector modeling temporal and channel interactions for text-independent speaker verification
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9616814/ https://www.ncbi.nlm.nih.gov/pubmed/36307520 http://dx.doi.org/10.1038/s41598-022-22977-5
work_keys_str_mv	AT weiguangcun lambdavectormodelingtemporalandchannelinteractionsfortextindependentspeakerverification AT minhang lambdavectormodelingtemporalandchannelinteractionsfortextindependentspeakerverification AT xuyunfei lambdavectormodelingtemporalandchannelinteractionsfortextindependentspeakerverification AT zhangyanna lambdavectormodelingtemporalandchannelinteractionsfortextindependentspeakerverification

Lambda-vector modeling temporal and channel interactions for text-independent speaker verification

Ejemplares similares