Cargando…

Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy

Automatic estimation of a speaker's age is a challenging research topic in the area of speech analysis. In this paper, a novel approach to estimate a speaker's age is presented. The method features a “divide and conquer” strategy wherein the speech data are divided into six groups based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Mirhassani, Seyed Mostafa, Zourmand, Alireza, Ting, Hua-Nong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070543/
https://www.ncbi.nlm.nih.gov/pubmed/25006595
http://dx.doi.org/10.1155/2014/534064
_version_ 1782322704884432896
author Mirhassani, Seyed Mostafa
Zourmand, Alireza
Ting, Hua-Nong
author_facet Mirhassani, Seyed Mostafa
Zourmand, Alireza
Ting, Hua-Nong
author_sort Mirhassani, Seyed Mostafa
collection PubMed
description Automatic estimation of a speaker's age is a challenging research topic in the area of speech analysis. In this paper, a novel approach to estimate a speaker's age is presented. The method features a “divide and conquer” strategy wherein the speech data are divided into six groups based on the vowel classes. There are two reasons behind this strategy. First, reduction in the complicated distribution of the processing data improves the classifier's learning performance. Second, different vowel classes contain complementary information for age estimation. Mel-frequency cepstral coefficients are computed for each group and single layer feed-forward neural networks based on self-adaptive extreme learning machine are applied to the features to make a primary decision. Subsequently, fuzzy data fusion is employed to provide an overall decision by aggregating the classifier's outputs. The results are then compared with a number of state-of-the-art age estimation methods. Experiments conducted based on six age groups including children aged between 7 and 12 years revealed that fuzzy fusion of the classifier's outputs resulted in considerable improvement of up to 53.33% in age estimation accuracy. Moreover, the fuzzy fusion of decisions aggregated the complementary information of a speaker's age from various speech sources.
format Online
Article
Text
id pubmed-4070543
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-40705432014-07-08 Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy Mirhassani, Seyed Mostafa Zourmand, Alireza Ting, Hua-Nong ScientificWorldJournal Research Article Automatic estimation of a speaker's age is a challenging research topic in the area of speech analysis. In this paper, a novel approach to estimate a speaker's age is presented. The method features a “divide and conquer” strategy wherein the speech data are divided into six groups based on the vowel classes. There are two reasons behind this strategy. First, reduction in the complicated distribution of the processing data improves the classifier's learning performance. Second, different vowel classes contain complementary information for age estimation. Mel-frequency cepstral coefficients are computed for each group and single layer feed-forward neural networks based on self-adaptive extreme learning machine are applied to the features to make a primary decision. Subsequently, fuzzy data fusion is employed to provide an overall decision by aggregating the classifier's outputs. The results are then compared with a number of state-of-the-art age estimation methods. Experiments conducted based on six age groups including children aged between 7 and 12 years revealed that fuzzy fusion of the classifier's outputs resulted in considerable improvement of up to 53.33% in age estimation accuracy. Moreover, the fuzzy fusion of decisions aggregated the complementary information of a speaker's age from various speech sources. Hindawi Publishing Corporation 2014 2014-06-05 /pmc/articles/PMC4070543/ /pubmed/25006595 http://dx.doi.org/10.1155/2014/534064 Text en Copyright © 2014 Seyed Mostafa Mirhassani et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Mirhassani, Seyed Mostafa
Zourmand, Alireza
Ting, Hua-Nong
Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy
title Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy
title_full Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy
title_fullStr Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy
title_full_unstemmed Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy
title_short Age Estimation Based on Children's Voice: A Fuzzy-Based Decision Fusion Strategy
title_sort age estimation based on children's voice: a fuzzy-based decision fusion strategy
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070543/
https://www.ncbi.nlm.nih.gov/pubmed/25006595
http://dx.doi.org/10.1155/2014/534064
work_keys_str_mv AT mirhassaniseyedmostafa ageestimationbasedonchildrensvoiceafuzzybaseddecisionfusionstrategy
AT zourmandalireza ageestimationbasedonchildrensvoiceafuzzybaseddecisionfusionstrategy
AT tinghuanong ageestimationbasedonchildrensvoiceafuzzybaseddecisionfusionstrategy