Cargando…

A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets

A new wavelet-based method is presented in this work for estimating and tracking the pitch period. The main idea of the proposed new approach consists in extracting the cepstrum excitation signal and applying on it a wavelet transform whose resulting approximation coefficients are smoothed, for a be...

Descripción completa

Detalles Bibliográficos
Autores principales: Bahja, Fadoua, Di Martino, Joseph, Ibn Elhaj, Elhassan, Aboutajdine, Driss
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4859324/
https://www.ncbi.nlm.nih.gov/pubmed/27213131
http://dx.doi.org/10.1186/s40064-016-2162-0
_version_ 1782430947185000448
author Bahja, Fadoua
Di Martino, Joseph
Ibn Elhaj, Elhassan
Aboutajdine, Driss
author_facet Bahja, Fadoua
Di Martino, Joseph
Ibn Elhaj, Elhassan
Aboutajdine, Driss
author_sort Bahja, Fadoua
collection PubMed
description A new wavelet-based method is presented in this work for estimating and tracking the pitch period. The main idea of the proposed new approach consists in extracting the cepstrum excitation signal and applying on it a wavelet transform whose resulting approximation coefficients are smoothed, for a better pitch determination. Although the principle of the algorithms proposed has already been considered previously, the novelty of our methods relies in the use of powerful wavelet transforms well adapted to pitch determination. The wavelet transforms considered in this article are the discrete wavelet transform and the dual tree complex wavelet transform. This article, by all the provided experimental results, corroborates the idea of decomposing the cepstrum excitation by using wavelet transforms for improving pitch detection. Another interesting point of this article relies in using a simple but efficient voicing decision (which actually improves a similar voicing criterion we proposed in a preceding published study) which on one hand respects the real-time process with low latency and on the other hand allows obtaining low classifications errors. The accuracy of the proposed pitch tracking algorithms has been evaluated using the international Bagshaw and the Keele databases which include male and female speakers. Our various experimental results demonstrate that the proposed methods provide important performance improvements when compared with previously published pitch determination algorithms.
format Online
Article
Text
id pubmed-4859324
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-48593242016-05-21 A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets Bahja, Fadoua Di Martino, Joseph Ibn Elhaj, Elhassan Aboutajdine, Driss Springerplus Research A new wavelet-based method is presented in this work for estimating and tracking the pitch period. The main idea of the proposed new approach consists in extracting the cepstrum excitation signal and applying on it a wavelet transform whose resulting approximation coefficients are smoothed, for a better pitch determination. Although the principle of the algorithms proposed has already been considered previously, the novelty of our methods relies in the use of powerful wavelet transforms well adapted to pitch determination. The wavelet transforms considered in this article are the discrete wavelet transform and the dual tree complex wavelet transform. This article, by all the provided experimental results, corroborates the idea of decomposing the cepstrum excitation by using wavelet transforms for improving pitch detection. Another interesting point of this article relies in using a simple but efficient voicing decision (which actually improves a similar voicing criterion we proposed in a preceding published study) which on one hand respects the real-time process with low latency and on the other hand allows obtaining low classifications errors. The accuracy of the proposed pitch tracking algorithms has been evaluated using the international Bagshaw and the Keele databases which include male and female speakers. Our various experimental results demonstrate that the proposed methods provide important performance improvements when compared with previously published pitch determination algorithms. Springer International Publishing 2016-05-06 /pmc/articles/PMC4859324/ /pubmed/27213131 http://dx.doi.org/10.1186/s40064-016-2162-0 Text en © Bahja et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Bahja, Fadoua
Di Martino, Joseph
Ibn Elhaj, Elhassan
Aboutajdine, Driss
A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
title A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
title_full A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
title_fullStr A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
title_full_unstemmed A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
title_short A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
title_sort corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4859324/
https://www.ncbi.nlm.nih.gov/pubmed/27213131
http://dx.doi.org/10.1186/s40064-016-2162-0
work_keys_str_mv AT bahjafadoua acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT dimartinojoseph acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT ibnelhajelhassan acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT aboutajdinedriss acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT bahjafadoua corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT dimartinojoseph corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT ibnelhajelhassan corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets
AT aboutajdinedriss corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets