Cargando…
Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition
Humans are quite adept at communicating in presence of noise. However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions. The proposed work explores...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579853/ https://www.ncbi.nlm.nih.gov/pubmed/26412979 http://dx.doi.org/10.1007/s10772-012-9184-y |
_version_ | 1782391333616353280 |
---|---|
author | Nemala, Sridhar Krishna Patil, Kailash Elhilali, Mounya |
author_facet | Nemala, Sridhar Krishna Patil, Kailash Elhilali, Mounya |
author_sort | Nemala, Sridhar Krishna |
collection | PubMed |
description | Humans are quite adept at communicating in presence of noise. However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions. The proposed work explores the use of a biologically-motivated multi-resolution spectral analysis for speech representation. This approach focuses on the information-rich spectral attributes of speech and presents an intricate yet computationally-efficient analysis of the speech signal by careful choice of model parameters. Further, the approach takes advantage of an information-theoretic analysis of the message and speaker dominant regions in the speech signal, and defines feature representations to address two diverse tasks such as speech and speaker recognition. The proposed analysis surpasses the standard Mel-Frequency Cepstral Coefficients (MFCC), and its enhanced variants (via mean subtraction, variance normalization and time sequence filtering) and yields significant improvements over a state-of-the-art noise robust feature scheme, on both speech and speaker recognition tasks. |
format | Online Article Text |
id | pubmed-4579853 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-45798532015-09-25 Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition Nemala, Sridhar Krishna Patil, Kailash Elhilali, Mounya Int J Speech Technol Article Humans are quite adept at communicating in presence of noise. However most speech processing systems, like automatic speech and speaker recognition systems, suffer from a significant drop in performance when speech signals are corrupted with unseen background distortions. The proposed work explores the use of a biologically-motivated multi-resolution spectral analysis for speech representation. This approach focuses on the information-rich spectral attributes of speech and presents an intricate yet computationally-efficient analysis of the speech signal by careful choice of model parameters. Further, the approach takes advantage of an information-theoretic analysis of the message and speaker dominant regions in the speech signal, and defines feature representations to address two diverse tasks such as speech and speaker recognition. The proposed analysis surpasses the standard Mel-Frequency Cepstral Coefficients (MFCC), and its enhanced variants (via mean subtraction, variance normalization and time sequence filtering) and yields significant improvements over a state-of-the-art noise robust feature scheme, on both speech and speaker recognition tasks. Springer US 2012-12-18 2013 /pmc/articles/PMC4579853/ /pubmed/26412979 http://dx.doi.org/10.1007/s10772-012-9184-y Text en © The Author(s) 2012 https://creativecommons.org/licenses/by/4.0/ This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. |
spellingShingle | Article Nemala, Sridhar Krishna Patil, Kailash Elhilali, Mounya Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
title | Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
title_full | Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
title_fullStr | Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
title_full_unstemmed | Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
title_short | Recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
title_sort | recognizing the message and the messenger: biomimetic spectral analysis for robust speech and speaker recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579853/ https://www.ncbi.nlm.nih.gov/pubmed/26412979 http://dx.doi.org/10.1007/s10772-012-9184-y |
work_keys_str_mv | AT nemalasridharkrishna recognizingthemessageandthemessengerbiomimeticspectralanalysisforrobustspeechandspeakerrecognition AT patilkailash recognizingthemessageandthemessengerbiomimeticspectralanalysisforrobustspeechandspeakerrecognition AT elhilalimounya recognizingthemessageandthemessengerbiomimeticspectralanalysisforrobustspeechandspeakerrecognition |