Cargando…
Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals
The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Acc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7055909/ https://www.ncbi.nlm.nih.gov/pubmed/32133204 http://dx.doi.org/10.3390/app9071505 |
_version_ | 1783503447825317888 |
---|---|
author | Lei, Zhengdong Kennedy, Evan Fasanella, Laura Li-Jessen, Nicole Yee-Key Mongeau, Luc |
author_facet | Lei, Zhengdong Kennedy, Evan Fasanella, Laura Li-Jessen, Nicole Yee-Key Mongeau, Luc |
author_sort | Lei, Zhengdong |
collection | PubMed |
description | The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy. |
format | Online Article Text |
id | pubmed-7055909 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
record_format | MEDLINE/PubMed |
spelling | pubmed-70559092020-03-04 Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals Lei, Zhengdong Kennedy, Evan Fasanella, Laura Li-Jessen, Nicole Yee-Key Mongeau, Luc Appl Sci (Basel) Article The purpose of this study was to investigate the feasibility of using neck-surface acceleration signals to discriminate between modal, breathy and pressed voice. Voice data for five English single vowels were collected from 31 female native Canadian English speakers using a portable Neck Surface Accelerometer (NSA) and a condenser microphone. Firstly, auditory-perceptual ratings were conducted by five clinically-certificated Speech Language Pathologists (SLPs) to categorize voice type using the audio recordings. Intra- and inter-rater analyses were used to determine the SLPs’ reliability for the perceptual categorization task. Mixed-type samples were screened out, and congruent samples were kept for the subsequent classification task. Secondly, features such as spectral harmonics, jitter, shimmer and spectral entropy were extracted from the NSA data. Supervised learning algorithms were used to map feature vectors to voice type categories. A feature wrapper strategy was used to evaluate the contribution of each feature or feature combinations to the classification between different voice types. The results showed that the highest classification accuracy on a full set was 82.5%. The breathy voice classification accuracy was notably greater (approximately 12%) than those of the other two voice types. Shimmer and spectral entropy were the best correlated metrics for the classification accuracy. 2019-04-11 2019-04 /pmc/articles/PMC7055909/ /pubmed/32133204 http://dx.doi.org/10.3390/app9071505 Text en Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Lei, Zhengdong Kennedy, Evan Fasanella, Laura Li-Jessen, Nicole Yee-Key Mongeau, Luc Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals |
title | Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals |
title_full | Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals |
title_fullStr | Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals |
title_full_unstemmed | Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals |
title_short | Discrimination between Modal, Breathy and Pressed Voice for Single Vowels Using Neck-Surface Vibration Signals |
title_sort | discrimination between modal, breathy and pressed voice for single vowels using neck-surface vibration signals |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7055909/ https://www.ncbi.nlm.nih.gov/pubmed/32133204 http://dx.doi.org/10.3390/app9071505 |
work_keys_str_mv | AT leizhengdong discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals AT kennedyevan discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals AT fasanellalaura discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals AT lijessennicoleyeekey discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals AT mongeauluc discriminationbetweenmodalbreathyandpressedvoiceforsinglevowelsusingnecksurfacevibrationsignals |