Cargando…

An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations

In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study w...

Descripción completa

Detalles Bibliográficos
Autores principales: Esmaili, Iman, Dabanloo, Nader Jafarnia, Vali, Mansour
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394801/
https://www.ncbi.nlm.nih.gov/pubmed/28487827
_version_ 1783229774202667008
author Esmaili, Iman
Dabanloo, Nader Jafarnia
Vali, Mansour
author_facet Esmaili, Iman
Dabanloo, Nader Jafarnia
Vali, Mansour
author_sort Esmaili, Iman
collection PubMed
description In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study was to develop a method to help speech-language pathologists (SLPs) during diagnosis and treatment sessions. To this end, speech signals were initially parameterized to perceptual linear predictive (PLP) features. To detect the prolonged segments, the similarities between successive frames of speech signals were calculated based on correlation similarity measures. The segments were labeled as prolongation when the duration of highly similar successive frames exceeded a threshold specified by the speaking rate. The proposed method was evaluated by UCLASS and self-recorded Persian speech databases. The results were also compared with three high-performance studies in automatic prolongation detection. The best accuracies of prolongation detection were 99 and 97.1% for UCLASS and Persian databases, respectively. The proposed method also indicated promising robustness against artificial variation of speaking rate from 70 to 130% of normal speaking rate.
format Online
Article
Text
id pubmed-5394801
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-53948012017-05-09 An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations Esmaili, Iman Dabanloo, Nader Jafarnia Vali, Mansour J Med Signals Sens Original Article In recent years, many methods have been introduced for supporting the diagnosis of stuttering for automatic detection of prolongation in the speech of people who stutter. However, less attention has been paid to treatment processes in which clients learn to speak more slowly. The aim of this study was to develop a method to help speech-language pathologists (SLPs) during diagnosis and treatment sessions. To this end, speech signals were initially parameterized to perceptual linear predictive (PLP) features. To detect the prolonged segments, the similarities between successive frames of speech signals were calculated based on correlation similarity measures. The segments were labeled as prolongation when the duration of highly similar successive frames exceeded a threshold specified by the speaking rate. The proposed method was evaluated by UCLASS and self-recorded Persian speech databases. The results were also compared with three high-performance studies in automatic prolongation detection. The best accuracies of prolongation detection were 99 and 97.1% for UCLASS and Persian databases, respectively. The proposed method also indicated promising robustness against artificial variation of speaking rate from 70 to 130% of normal speaking rate. Medknow Publications & Media Pvt Ltd 2017 /pmc/articles/PMC5394801/ /pubmed/28487827 Text en Copyright: © 2017 Journal of Medical Signals & Sensors http://creativecommons.org/licenses/by-nc-sa/3.0 This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as the author is credited and the new creations are licensed under the identical terms.
spellingShingle Original Article
Esmaili, Iman
Dabanloo, Nader Jafarnia
Vali, Mansour
An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
title An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
title_full An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
title_fullStr An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
title_full_unstemmed An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
title_short An Automatic Prolongation Detection Approach in Continuous Speech With Robustness Against Speaking Rate Variations
title_sort automatic prolongation detection approach in continuous speech with robustness against speaking rate variations
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5394801/
https://www.ncbi.nlm.nih.gov/pubmed/28487827
work_keys_str_mv AT esmailiiman anautomaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations
AT dabanloonaderjafarnia anautomaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations
AT valimansour anautomaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations
AT esmailiiman automaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations
AT dabanloonaderjafarnia automaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations
AT valimansour automaticprolongationdetectionapproachincontinuousspeechwithrobustnessagainstspeakingratevariations