Cargando…
A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets
A new wavelet-based method is presented in this work for estimating and tracking the pitch period. The main idea of the proposed new approach consists in extracting the cepstrum excitation signal and applying on it a wavelet transform whose resulting approximation coefficients are smoothed, for a be...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4859324/ https://www.ncbi.nlm.nih.gov/pubmed/27213131 http://dx.doi.org/10.1186/s40064-016-2162-0 |
_version_ | 1782430947185000448 |
---|---|
author | Bahja, Fadoua Di Martino, Joseph Ibn Elhaj, Elhassan Aboutajdine, Driss |
author_facet | Bahja, Fadoua Di Martino, Joseph Ibn Elhaj, Elhassan Aboutajdine, Driss |
author_sort | Bahja, Fadoua |
collection | PubMed |
description | A new wavelet-based method is presented in this work for estimating and tracking the pitch period. The main idea of the proposed new approach consists in extracting the cepstrum excitation signal and applying on it a wavelet transform whose resulting approximation coefficients are smoothed, for a better pitch determination. Although the principle of the algorithms proposed has already been considered previously, the novelty of our methods relies in the use of powerful wavelet transforms well adapted to pitch determination. The wavelet transforms considered in this article are the discrete wavelet transform and the dual tree complex wavelet transform. This article, by all the provided experimental results, corroborates the idea of decomposing the cepstrum excitation by using wavelet transforms for improving pitch detection. Another interesting point of this article relies in using a simple but efficient voicing decision (which actually improves a similar voicing criterion we proposed in a preceding published study) which on one hand respects the real-time process with low latency and on the other hand allows obtaining low classifications errors. The accuracy of the proposed pitch tracking algorithms has been evaluated using the international Bagshaw and the Keele databases which include male and female speakers. Our various experimental results demonstrate that the proposed methods provide important performance improvements when compared with previously published pitch determination algorithms. |
format | Online Article Text |
id | pubmed-4859324 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-48593242016-05-21 A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets Bahja, Fadoua Di Martino, Joseph Ibn Elhaj, Elhassan Aboutajdine, Driss Springerplus Research A new wavelet-based method is presented in this work for estimating and tracking the pitch period. The main idea of the proposed new approach consists in extracting the cepstrum excitation signal and applying on it a wavelet transform whose resulting approximation coefficients are smoothed, for a better pitch determination. Although the principle of the algorithms proposed has already been considered previously, the novelty of our methods relies in the use of powerful wavelet transforms well adapted to pitch determination. The wavelet transforms considered in this article are the discrete wavelet transform and the dual tree complex wavelet transform. This article, by all the provided experimental results, corroborates the idea of decomposing the cepstrum excitation by using wavelet transforms for improving pitch detection. Another interesting point of this article relies in using a simple but efficient voicing decision (which actually improves a similar voicing criterion we proposed in a preceding published study) which on one hand respects the real-time process with low latency and on the other hand allows obtaining low classifications errors. The accuracy of the proposed pitch tracking algorithms has been evaluated using the international Bagshaw and the Keele databases which include male and female speakers. Our various experimental results demonstrate that the proposed methods provide important performance improvements when compared with previously published pitch determination algorithms. Springer International Publishing 2016-05-06 /pmc/articles/PMC4859324/ /pubmed/27213131 http://dx.doi.org/10.1186/s40064-016-2162-0 Text en © Bahja et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Bahja, Fadoua Di Martino, Joseph Ibn Elhaj, Elhassan Aboutajdine, Driss A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
title | A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
title_full | A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
title_fullStr | A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
title_full_unstemmed | A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
title_short | A corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
title_sort | corroborative study on improving pitch determination by time–frequency cepstrum decomposition using wavelets |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4859324/ https://www.ncbi.nlm.nih.gov/pubmed/27213131 http://dx.doi.org/10.1186/s40064-016-2162-0 |
work_keys_str_mv | AT bahjafadoua acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT dimartinojoseph acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT ibnelhajelhassan acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT aboutajdinedriss acorroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT bahjafadoua corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT dimartinojoseph corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT ibnelhajelhassan corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets AT aboutajdinedriss corroborativestudyonimprovingpitchdeterminationbytimefrequencycepstrumdecompositionusingwavelets |