Cargando…
Detecting Parkinson’s disease from sustained phonation and speech signals
This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5628839/ https://www.ncbi.nlm.nih.gov/pubmed/28982171 http://dx.doi.org/10.1371/journal.pone.0185613 |
_version_ | 1783268949490663424 |
---|---|
author | Vaiciukynas, Evaldas Verikas, Antanas Gelzinis, Adas Bacauskiene, Marija |
author_facet | Vaiciukynas, Evaldas Verikas, Antanas Gelzinis, Adas Bacauskiene, Marija |
author_sort | Vaiciukynas, Evaldas |
collection | PubMed |
description | This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels simultaneously, namely, acoustic cardioid (AC) and smart phone (SP) microphones. Additional modalities were obtained by splitting speech recording into voiced and unvoiced parts. Information in each modality is summarized by 18 well-known audio feature sets. Random forest (RF) is used as a machine learning algorithm, both for individual feature sets and for decision-level fusion. Detection performance is measured by the out-of-bag equal error rate (EER) and the cost of log-likelihood-ratio. Essentia audio feature set was the best using the AC speech modality and YAAFE audio feature set was the best using the SP unvoiced modality, achieving EER of 20.30% and 25.57%, respectively. Fusion of all feature sets and modalities resulted in EER of 19.27% for the AC and 23.00% for the SP channel. Non-linear projection of a RF-based proximity matrix into the 2D space enriched medical decision support by visualization. |
format | Online Article Text |
id | pubmed-5628839 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-56288392017-10-20 Detecting Parkinson’s disease from sustained phonation and speech signals Vaiciukynas, Evaldas Verikas, Antanas Gelzinis, Adas Bacauskiene, Marija PLoS One Research Article This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels simultaneously, namely, acoustic cardioid (AC) and smart phone (SP) microphones. Additional modalities were obtained by splitting speech recording into voiced and unvoiced parts. Information in each modality is summarized by 18 well-known audio feature sets. Random forest (RF) is used as a machine learning algorithm, both for individual feature sets and for decision-level fusion. Detection performance is measured by the out-of-bag equal error rate (EER) and the cost of log-likelihood-ratio. Essentia audio feature set was the best using the AC speech modality and YAAFE audio feature set was the best using the SP unvoiced modality, achieving EER of 20.30% and 25.57%, respectively. Fusion of all feature sets and modalities resulted in EER of 19.27% for the AC and 23.00% for the SP channel. Non-linear projection of a RF-based proximity matrix into the 2D space enriched medical decision support by visualization. Public Library of Science 2017-10-05 /pmc/articles/PMC5628839/ /pubmed/28982171 http://dx.doi.org/10.1371/journal.pone.0185613 Text en © 2017 Vaiciukynas et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Vaiciukynas, Evaldas Verikas, Antanas Gelzinis, Adas Bacauskiene, Marija Detecting Parkinson’s disease from sustained phonation and speech signals |
title | Detecting Parkinson’s disease from sustained phonation and speech signals |
title_full | Detecting Parkinson’s disease from sustained phonation and speech signals |
title_fullStr | Detecting Parkinson’s disease from sustained phonation and speech signals |
title_full_unstemmed | Detecting Parkinson’s disease from sustained phonation and speech signals |
title_short | Detecting Parkinson’s disease from sustained phonation and speech signals |
title_sort | detecting parkinson’s disease from sustained phonation and speech signals |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5628839/ https://www.ncbi.nlm.nih.gov/pubmed/28982171 http://dx.doi.org/10.1371/journal.pone.0185613 |
work_keys_str_mv | AT vaiciukynasevaldas detectingparkinsonsdiseasefromsustainedphonationandspeechsignals AT verikasantanas detectingparkinsonsdiseasefromsustainedphonationandspeechsignals AT gelzinisadas detectingparkinsonsdiseasefromsustainedphonationandspeechsignals AT bacauskienemarija detectingparkinsonsdiseasefromsustainedphonationandspeechsignals |