Cargando…

Cost-Sensitive Learning for Emotion Robust Speaker Recognition

In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique pass...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Dongdong, Yang, Yingchun, Dai, Weihui
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4066940/ https://www.ncbi.nlm.nih.gov/pubmed/24999492 http://dx.doi.org/10.1155/2014/628516

_version_	1782322239784353792
author	Li, Dongdong Yang, Yingchun Dai, Weihui
author_facet	Li, Dongdong Yang, Yingchun Dai, Weihui
author_sort	Li, Dongdong
collection	PubMed
description	In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique password for the user to prove his/her identity. However, speech with various emotions can cause an unacceptably high error rate and aggravate the performance of speaker recognition system. This paper deals with this problem by introducing a cost-sensitive learning technology to reweight the probability of test affective utterances in the pitch envelop level, which can enhance the robustness in emotion-dependent speaker recognition effectively. Based on that technology, a new architecture of recognition system as well as its components is proposed in this paper. The experiment conducted on the Mandarin Affective Speech Corpus shows that an improvement of 8% identification rate over the traditional speaker recognition is achieved.
format	Online Article Text
id	pubmed-4066940
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-40669402014-07-06 Cost-Sensitive Learning for Emotion Robust Speaker Recognition Li, Dongdong Yang, Yingchun Dai, Weihui ScientificWorldJournal Research Article In the field of information security, voice is one of the most important parts in biometrics. Especially, with the development of voice communication through the Internet or telephone system, huge voice data resources are accessed. In speaker recognition, voiceprint can be applied as the unique password for the user to prove his/her identity. However, speech with various emotions can cause an unacceptably high error rate and aggravate the performance of speaker recognition system. This paper deals with this problem by introducing a cost-sensitive learning technology to reweight the probability of test affective utterances in the pitch envelop level, which can enhance the robustness in emotion-dependent speaker recognition effectively. Based on that technology, a new architecture of recognition system as well as its components is proposed in this paper. The experiment conducted on the Mandarin Affective Speech Corpus shows that an improvement of 8% identification rate over the traditional speaker recognition is achieved. Hindawi Publishing Corporation 2014 2014-06-04 /pmc/articles/PMC4066940/ /pubmed/24999492 http://dx.doi.org/10.1155/2014/628516 Text en Copyright © 2014 Dongdong Li et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Li, Dongdong Yang, Yingchun Dai, Weihui Cost-Sensitive Learning for Emotion Robust Speaker Recognition
title	Cost-Sensitive Learning for Emotion Robust Speaker Recognition
title_full	Cost-Sensitive Learning for Emotion Robust Speaker Recognition
title_fullStr	Cost-Sensitive Learning for Emotion Robust Speaker Recognition
title_full_unstemmed	Cost-Sensitive Learning for Emotion Robust Speaker Recognition
title_short	Cost-Sensitive Learning for Emotion Robust Speaker Recognition
title_sort	cost-sensitive learning for emotion robust speaker recognition
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4066940/ https://www.ncbi.nlm.nih.gov/pubmed/24999492 http://dx.doi.org/10.1155/2014/628516
work_keys_str_mv	AT lidongdong costsensitivelearningforemotionrobustspeakerrecognition AT yangyingchun costsensitivelearningforemotionrobustspeakerrecognition AT daiweihui costsensitivelearningforemotionrobustspeakerrecognition

Cost-Sensitive Learning for Emotion Robust Speaker Recognition

Ejemplares similares