Cargando…

Detecting Parkinson’s disease from sustained phonation and speech signals

This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels...

Descripción completa

Detalles Bibliográficos
Autores principales: Vaiciukynas, Evaldas, Verikas, Antanas, Gelzinis, Adas, Bacauskiene, Marija
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5628839/
https://www.ncbi.nlm.nih.gov/pubmed/28982171
http://dx.doi.org/10.1371/journal.pone.0185613
_version_ 1783268949490663424
author Vaiciukynas, Evaldas
Verikas, Antanas
Gelzinis, Adas
Bacauskiene, Marija
author_facet Vaiciukynas, Evaldas
Verikas, Antanas
Gelzinis, Adas
Bacauskiene, Marija
author_sort Vaiciukynas, Evaldas
collection PubMed
description This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels simultaneously, namely, acoustic cardioid (AC) and smart phone (SP) microphones. Additional modalities were obtained by splitting speech recording into voiced and unvoiced parts. Information in each modality is summarized by 18 well-known audio feature sets. Random forest (RF) is used as a machine learning algorithm, both for individual feature sets and for decision-level fusion. Detection performance is measured by the out-of-bag equal error rate (EER) and the cost of log-likelihood-ratio. Essentia audio feature set was the best using the AC speech modality and YAAFE audio feature set was the best using the SP unvoiced modality, achieving EER of 20.30% and 25.57%, respectively. Fusion of all feature sets and modalities resulted in EER of 19.27% for the AC and 23.00% for the SP channel. Non-linear projection of a RF-based proximity matrix into the 2D space enriched medical decision support by visualization.
format Online
Article
Text
id pubmed-5628839
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-56288392017-10-20 Detecting Parkinson’s disease from sustained phonation and speech signals Vaiciukynas, Evaldas Verikas, Antanas Gelzinis, Adas Bacauskiene, Marija PLoS One Research Article This study investigates signals from sustained phonation and text-dependent speech modalities for Parkinson’s disease screening. Phonation corresponds to the vowel /a/ voicing task and speech to the pronunciation of a short sentence in Lithuanian language. Signals were recorded through two channels simultaneously, namely, acoustic cardioid (AC) and smart phone (SP) microphones. Additional modalities were obtained by splitting speech recording into voiced and unvoiced parts. Information in each modality is summarized by 18 well-known audio feature sets. Random forest (RF) is used as a machine learning algorithm, both for individual feature sets and for decision-level fusion. Detection performance is measured by the out-of-bag equal error rate (EER) and the cost of log-likelihood-ratio. Essentia audio feature set was the best using the AC speech modality and YAAFE audio feature set was the best using the SP unvoiced modality, achieving EER of 20.30% and 25.57%, respectively. Fusion of all feature sets and modalities resulted in EER of 19.27% for the AC and 23.00% for the SP channel. Non-linear projection of a RF-based proximity matrix into the 2D space enriched medical decision support by visualization. Public Library of Science 2017-10-05 /pmc/articles/PMC5628839/ /pubmed/28982171 http://dx.doi.org/10.1371/journal.pone.0185613 Text en © 2017 Vaiciukynas et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Vaiciukynas, Evaldas
Verikas, Antanas
Gelzinis, Adas
Bacauskiene, Marija
Detecting Parkinson’s disease from sustained phonation and speech signals
title Detecting Parkinson’s disease from sustained phonation and speech signals
title_full Detecting Parkinson’s disease from sustained phonation and speech signals
title_fullStr Detecting Parkinson’s disease from sustained phonation and speech signals
title_full_unstemmed Detecting Parkinson’s disease from sustained phonation and speech signals
title_short Detecting Parkinson’s disease from sustained phonation and speech signals
title_sort detecting parkinson’s disease from sustained phonation and speech signals
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5628839/
https://www.ncbi.nlm.nih.gov/pubmed/28982171
http://dx.doi.org/10.1371/journal.pone.0185613
work_keys_str_mv AT vaiciukynasevaldas detectingparkinsonsdiseasefromsustainedphonationandspeechsignals
AT verikasantanas detectingparkinsonsdiseasefromsustainedphonationandspeechsignals
AT gelzinisadas detectingparkinsonsdiseasefromsustainedphonationandspeechsignals
AT bacauskienemarija detectingparkinsonsdiseasefromsustainedphonationandspeechsignals