Cargando…

Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning

The convolutional neural network (CNN) has made great strides in the area of voiceprint recognition; but it needs a huge number of data samples to train a deep neural network. In practice, it is too difficult to get a large number of training samples, and it cannot achieve a better convergence state...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sun, Cunwei, Yang, Yuxin, Wen, Chang, Xie, Kai, Wen, Fangqing
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2018
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6068892/ https://www.ncbi.nlm.nih.gov/pubmed/30041500 http://dx.doi.org/10.3390/s18072399

_version_	1783343371150950400
author	Sun, Cunwei Yang, Yuxin Wen, Chang Xie, Kai Wen, Fangqing
author_facet	Sun, Cunwei Yang, Yuxin Wen, Chang Xie, Kai Wen, Fangqing
author_sort	Sun, Cunwei
collection	PubMed
description	The convolutional neural network (CNN) has made great strides in the area of voiceprint recognition; but it needs a huge number of data samples to train a deep neural network. In practice, it is too difficult to get a large number of training samples, and it cannot achieve a better convergence state due to the limited dataset. In order to solve this question, a new method using a deep migration hybrid model is put forward, which makes it easier to realize voiceprint recognition for small samples. Firstly, it uses Transfer Learning to transfer the trained network from the big sample voiceprint dataset to our limited voiceprint dataset for the further training. Fully-connected layers of a pre-training model are replaced by restricted Boltzmann machine layers. Secondly, the approach of Data Augmentation is adopted to increase the number of voiceprint datasets. Finally, we introduce fast batch normalization algorithms to improve the speed of the network convergence and shorten the training time. Our new voiceprint recognition approach uses the TLCNN-RBM (convolutional neural network mixed restricted Boltzmann machine based on transfer learning) model, which is the deep migration hybrid model that is used to achieve an average accuracy of over 97%, which is higher than that when using either CNN or the TL-CNN network (convolutional neural network based on transfer learning). Thus, an effective method for a small sample of voiceprint recognition has been provided.
format	Online Article Text
id	pubmed-6068892
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-60688922018-08-07 Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning Sun, Cunwei Yang, Yuxin Wen, Chang Xie, Kai Wen, Fangqing Sensors (Basel) Article The convolutional neural network (CNN) has made great strides in the area of voiceprint recognition; but it needs a huge number of data samples to train a deep neural network. In practice, it is too difficult to get a large number of training samples, and it cannot achieve a better convergence state due to the limited dataset. In order to solve this question, a new method using a deep migration hybrid model is put forward, which makes it easier to realize voiceprint recognition for small samples. Firstly, it uses Transfer Learning to transfer the trained network from the big sample voiceprint dataset to our limited voiceprint dataset for the further training. Fully-connected layers of a pre-training model are replaced by restricted Boltzmann machine layers. Secondly, the approach of Data Augmentation is adopted to increase the number of voiceprint datasets. Finally, we introduce fast batch normalization algorithms to improve the speed of the network convergence and shorten the training time. Our new voiceprint recognition approach uses the TLCNN-RBM (convolutional neural network mixed restricted Boltzmann machine based on transfer learning) model, which is the deep migration hybrid model that is used to achieve an average accuracy of over 97%, which is higher than that when using either CNN or the TL-CNN network (convolutional neural network based on transfer learning). Thus, an effective method for a small sample of voiceprint recognition has been provided. MDPI 2018-07-23 /pmc/articles/PMC6068892/ /pubmed/30041500 http://dx.doi.org/10.3390/s18072399 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Sun, Cunwei Yang, Yuxin Wen, Chang Xie, Kai Wen, Fangqing Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
title	Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
title_full	Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
title_fullStr	Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
title_full_unstemmed	Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
title_short	Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning
title_sort	voiceprint identification for limited dataset using the deep migration hybrid model based on transfer learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6068892/ https://www.ncbi.nlm.nih.gov/pubmed/30041500 http://dx.doi.org/10.3390/s18072399
work_keys_str_mv	AT suncunwei voiceprintidentificationforlimiteddatasetusingthedeepmigrationhybridmodelbasedontransferlearning AT yangyuxin voiceprintidentificationforlimiteddatasetusingthedeepmigrationhybridmodelbasedontransferlearning AT wenchang voiceprintidentificationforlimiteddatasetusingthedeepmigrationhybridmodelbasedontransferlearning AT xiekai voiceprintidentificationforlimiteddatasetusingthedeepmigrationhybridmodelbasedontransferlearning AT wenfangqing voiceprintidentificationforlimiteddatasetusingthedeepmigrationhybridmodelbasedontransferlearning

Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning

Ejemplares similares