Cargando…

Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals

The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Acc...

Descripción completa

Detalles Bibliográficos
Autores principales: Lei, Zhengdong, Kennedy, Evan, Fasanella, Laura, Li-Jessen, Nicole Yee-Key, Mongeau, Luc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7055909/
https://www.ncbi.nlm.nih.gov/pubmed/32133204
http://dx.doi.org/10.3390/app9071505
_version_ 1783503447825317888
author Lei, Zhengdong
Kennedy, Evan
Fasanella, Laura
Li-Jessen, Nicole Yee-Key
Mongeau, Luc
author_facet Lei, Zhengdong
Kennedy, Evan
Fasanella, Laura
Li-Jessen, Nicole Yee-Key
Mongeau, Luc
author_sort Lei, Zhengdong
collection PubMed
description The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy.
format Online
Article
Text
id pubmed-7055909
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-70559092020-03-04 Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals Lei, Zhengdong Kennedy, Evan Fasanella, Laura Li-Jessen, Nicole Yee-Key Mongeau, Luc Appl Sci (Basel) Article The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy. 2019-04-11 2019-04 /pmc/articles/PMC7055909/ /pubmed/32133204 http://dx.doi.org/10.3390/app9071505 Text en Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lei, Zhengdong
Kennedy, Evan
Fasanella, Laura
Li-Jessen, Nicole Yee-Key
Mongeau, Luc
Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_full Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_fullStr Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_full_unstemmed Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_short Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
title_sort discrimination between modal, breathy and pressed voice for single vowels using neck-surface vibration signals
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7055909/
https://www.ncbi.nlm.nih.gov/pubmed/32133204
http://dx.doi.org/10.3390/app9071505
work_keys_str_mv AT leizhengdong discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT kennedyevan discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT fasanellalaura discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT lijessennicoleyeekey discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals
AT mongeauluc discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals