Cargando…

Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition

Humans are quite adept at communicating in presence of noise. However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions. The proposed work explores...

Descripción completa

Detalles Bibliográficos
Autores principales: Nemala, Sridhar Krishna, Patil, Kailash, Elhilali, Mounya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579853/
https://www.ncbi.nlm.nih.gov/pubmed/26412979
http://dx.doi.org/10.1007/s10772-012-9184-y
_version_ 1782391333616353280
author Nemala, Sridhar Krishna
Patil, Kailash
Elhilali, Mounya
author_facet Nemala, Sridhar Krishna
Patil, Kailash
Elhilali, Mounya
author_sort Nemala, Sridhar Krishna
collection PubMed
description Humans are quite adept at communicating in presence of noise. However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions. The proposed work explores the use of a biologically-motivated multi-resolution spectral analysis for speech representation. This approach focuses on the information-rich spectral attributes of speech and presents an intricate yet computationally-efficient analysis of the speech signal by careful choice of model parameters. Further, the approach takes advantage of an information-theoretic analysis of the message and speaker dominant regions in the speech signal, and defines feature representations to address two diverse tasks such as speech and speaker recognition. The proposed analysis surpasses the standard Mel-Frequency Cepstral Coefficients (MFCC), and its enhanced variants (via mean subtraction, variance normalization and time sequence filtering) and yields significant improvements over a state-of-the-art noise robust feature scheme, on both speech and speaker recognition tasks.
format Online
Article
Text
id pubmed-4579853
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-45798532015-09-25 Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition Nemala, Sridhar Krishna Patil, Kailash Elhilali, Mounya Int J Speech Technol Article Humans are quite adept at communicating in presence of noise. However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions. The proposed work explores the use of a biologically-motivated multi-resolution spectral analysis for speech representation. This approach focuses on the information-rich spectral attributes of speech and presents an intricate yet computationally-efficient analysis of the speech signal by careful choice of model parameters. Further, the approach takes advantage of an information-theoretic analysis of the message and speaker dominant regions in the speech signal, and defines feature representations to address two diverse tasks such as speech and speaker recognition. The proposed analysis surpasses the standard Mel-Frequency Cepstral Coefficients (MFCC), and its enhanced variants (via mean subtraction, variance normalization and time sequence filtering) and yields significant improvements over a state-of-the-art noise robust feature scheme, on both speech and speaker recognition tasks. Springer US 2012-12-18 2013 /pmc/articles/PMC4579853/ /pubmed/26412979 http://dx.doi.org/10.1007/s10772-012-9184-y Text en © The Author(s) 2012 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle Article
Nemala, Sridhar Krishna
Patil, Kailash
Elhilali, Mounya
Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
title Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
title_full Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
title_fullStr Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
title_full_unstemmed Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
title_short Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
title_sort recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579853/
https://www.ncbi.nlm.nih.gov/pubmed/26412979
http://dx.doi.org/10.1007/s10772-012-9184-y
work_keys_str_mv AT nemalasridharkrishna recognizingthemessageandthemessengerbiomimeticspectralanalysisforrobustspeechandspeakerrecognition
AT patilkailash recognizingthemessageandthemessengerbiomimeticspectralanalysisforrobustspeechandspeakerrecognition
AT elhilalimounya recognizingthemessageandthemessengerbiomimeticspectralanalysisforrobustspeechandspeakerrecognition