Cargando…
An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study w...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Medknow Publications & Media Pvt Ltd
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394801/ https://www.ncbi.nlm.nih.gov/pubmed/28487827 |
_version_ | 1783229774202667008 |
---|---|
author | Esmaili, Iman Dabanloo, Nader Jafarnia Vali, Mansour |
author_facet | Esmaili, Iman Dabanloo, Nader Jafarnia Vali, Mansour |
author_sort | Esmaili, Iman |
collection | PubMed |
description | In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study was to develop a method to help speech-language pathologists (SLPs) during diagnosis and treatment sessions. To this end, speech signals were initially parameterized to perceptual linear predictive (PLP) features. To detect the prolonged segments, the similarities between successive frames of speech signals were calculated based on correlation similarity measures. The segments were labeled as prolongation when the duration of highly similar successive frames exceeded a threshold specified by the speaking rate. The proposed method was evaluated by UCLASS and self-recorded Persian speech databases. The results were also compared with three high-performance studies in automatic prolongation detection. The best accuracies of prolongation detection were 99 and 97.1% for UCLASS and Persian databases, respectively. The proposed method also indicated promising robustness against artificial variation of speaking rate from 70 to 130% of normal speaking rate. |
format | Online Article Text |
id | pubmed-5394801 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Medknow Publications & Media Pvt Ltd |
record_format | MEDLINE/PubMed |
spelling | pubmed-53948012017-05-09 An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations Esmaili, Iman Dabanloo, Nader Jafarnia Vali, Mansour J Med Signals Sens Original Article In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study was to develop a method to help speech-language pathologists (SLPs) during diagnosis and treatment sessions. To this end, speech signals were initially parameterized to perceptual linear predictive (PLP) features. To detect the prolonged segments, the similarities between successive frames of speech signals were calculated based on correlation similarity measures. The segments were labeled as prolongation when the duration of highly similar successive frames exceeded a threshold specified by the speaking rate. The proposed method was evaluated by UCLASS and self-recorded Persian speech databases. The results were also compared with three high-performance studies in automatic prolongation detection. The best accuracies of prolongation detection were 99 and 97.1% for UCLASS and Persian databases, respectively. The proposed method also indicated promising robustness against artificial variation of speaking rate from 70 to 130% of normal speaking rate. Medknow Publications & Media Pvt Ltd 2017 /pmc/articles/PMC5394801/ /pubmed/28487827 Text en Copyright: © 2017 Journal of Medical Signals & Sensors http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms. |
spellingShingle | Original Article Esmaili, Iman Dabanloo, Nader Jafarnia Vali, Mansour An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations |
title | An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations |
title_full | An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations |
title_fullStr | An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations |
title_full_unstemmed | An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations |
title_short | An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations |
title_sort | automatic prolongation detection approach in continuous speech with robustness against speaking rate variations |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394801/ https://www.ncbi.nlm.nih.gov/pubmed/28487827 |
work_keys_str_mv | AT esmailiiman anautomaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations AT dabanloonaderjafarnia anautomaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations AT valimansour anautomaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations AT esmailiiman automaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations AT dabanloonaderjafarnia automaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations AT valimansour automaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations |