Cargando…

A Robust and Low Computational Cost Pitch Estimation Method

Pitch estimation is widely used in speech and audio signal processing. However, the current methods of modeling harmonic structure used for pitch estimation cannot always match the harmonic distribution of actual signals. Due to the structure of vocal tract, the acoustic nature of musical equipment,...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Desheng, Wei, Yangjie, Wang, Yi, Wang, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9414051/
https://www.ncbi.nlm.nih.gov/pubmed/36015789
http://dx.doi.org/10.3390/s22166026
_version_ 1784775900016934912
author Wang, Desheng
Wei, Yangjie
Wang, Yi
Wang, Jing
author_facet Wang, Desheng
Wei, Yangjie
Wang, Yi
Wang, Jing
author_sort Wang, Desheng
collection PubMed
description Pitch estimation is widely used in speech and audio signal processing. However, the current methods of modeling harmonic structure used for pitch estimation cannot always match the harmonic distribution of actual signals. Due to the structure of vocal tract, the acoustic nature of musical equipment, and the spectrum leakage issue, speech and audio signals’ harmonic frequencies often slightly deviate from the integer multiple of the pitch. This paper starts with the summation of residual harmonics (SRH) method and makes two main modifications. First, the spectral peak position constraint of strict integer multiple is modified to allow slight deviation, which benefits capturing harmonics. Second, a main pitch segment extension scheme with low computational cost feature is proposed to utilize the smooth prior of pitch more efficiently. Besides, the pitch segment extension scheme is also integrated into the SRH method’s voiced/unvoiced decision to reduce short-term errors. Accuracy comparison experiments with ten pitch estimation methods show that the proposed method has better overall accuracy and robustness. Time cost experiments show that the time cost of the proposed method reduces to around 1/8 of the state-of-the-art fast NLS method on the experimental computer.
format Online
Article
Text
id pubmed-9414051
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94140512022-08-27 A Robust and Low Computational Cost Pitch Estimation Method Wang, Desheng Wei, Yangjie Wang, Yi Wang, Jing Sensors (Basel) Article Pitch estimation is widely used in speech and audio signal processing. However, the current methods of modeling harmonic structure used for pitch estimation cannot always match the harmonic distribution of actual signals. Due to the structure of vocal tract, the acoustic nature of musical equipment, and the spectrum leakage issue, speech and audio signals’ harmonic frequencies often slightly deviate from the integer multiple of the pitch. This paper starts with the summation of residual harmonics (SRH) method and makes two main modifications. First, the spectral peak position constraint of strict integer multiple is modified to allow slight deviation, which benefits capturing harmonics. Second, a main pitch segment extension scheme with low computational cost feature is proposed to utilize the smooth prior of pitch more efficiently. Besides, the pitch segment extension scheme is also integrated into the SRH method’s voiced/unvoiced decision to reduce short-term errors. Accuracy comparison experiments with ten pitch estimation methods show that the proposed method has better overall accuracy and robustness. Time cost experiments show that the time cost of the proposed method reduces to around 1/8 of the state-of-the-art fast NLS method on the experimental computer. MDPI 2022-08-12 /pmc/articles/PMC9414051/ /pubmed/36015789 http://dx.doi.org/10.3390/s22166026 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Wang, Desheng
Wei, Yangjie
Wang, Yi
Wang, Jing
A Robust and Low Computational Cost Pitch Estimation Method
title A Robust and Low Computational Cost Pitch Estimation Method
title_full A Robust and Low Computational Cost Pitch Estimation Method
title_fullStr A Robust and Low Computational Cost Pitch Estimation Method
title_full_unstemmed A Robust and Low Computational Cost Pitch Estimation Method
title_short A Robust and Low Computational Cost Pitch Estimation Method
title_sort robust and low computational cost pitch estimation method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9414051/
https://www.ncbi.nlm.nih.gov/pubmed/36015789
http://dx.doi.org/10.3390/s22166026
work_keys_str_mv AT wangdesheng arobustandlowcomputationalcostpitchestimationmethod
AT weiyangjie arobustandlowcomputationalcostpitchestimationmethod
AT wangyi arobustandlowcomputationalcostpitchestimationmethod
AT wangjing arobustandlowcomputationalcostpitchestimationmethod
AT wangdesheng robustandlowcomputationalcostpitchestimationmethod
AT weiyangjie robustandlowcomputationalcostpitchestimationmethod
AT wangyi robustandlowcomputationalcostpitchestimationmethod
AT wangjing robustandlowcomputationalcostpitchestimationmethod