Cargando…

Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech

Alzheimer's Disease (AD) is a form of dementia that affects the memory, cognition, and motor skills of patients. Extensive research has been done to develop accessible, cost-effective, and non-invasive techniques for the automatic detection of AD. Previous research has shown that speech can be...

Descripción completa

Detalles Bibliográficos
Autores principales:	Haulcy, R'mani, Glass, James
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2021
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7845557/ https://www.ncbi.nlm.nih.gov/pubmed/33519651 http://dx.doi.org/10.3389/fpsyg.2020.624137

_version_	1783644575638749184
author	Haulcy, R'mani Glass, James
author_facet	Haulcy, R'mani Glass, James
author_sort	Haulcy, R'mani
collection	PubMed
description	Alzheimer's Disease (AD) is a form of dementia that affects the memory, cognition, and motor skills of patients. Extensive research has been done to develop accessible, cost-effective, and non-invasive techniques for the automatic detection of AD. Previous research has shown that speech can be used to distinguish between healthy patients and afflicted patients. In this paper, the ADReSS dataset, a dataset balanced by gender and age, was used to automatically classify AD from spontaneous speech. The performance of five classifiers, as well as a convolutional neural network and long short-term memory network, was compared when trained on audio features (i-vectors and x-vectors) and text features (word vectors, BERT embeddings, LIWC features, and CLAN features). The same audio and text features were used to train five regression models to predict the Mini-Mental State Examination score for each patient, a score that has a maximum value of 30. The top-performing classification models were the support vector machine and random forest classifiers trained on BERT embeddings, which both achieved an accuracy of 85.4% on the test set. The best-performing regression model was the gradient boosting regression model trained on BERT embeddings and CLAN features, which had a root mean squared error of 4.56 on the test set. The performance on both tasks illustrates the feasibility of using speech to classify AD and predict neuropsychological scores.
format	Online Article Text
id	pubmed-7845557
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-78455572021-01-30 Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech Haulcy, R'mani Glass, James Front Psychol Psychology Alzheimer's Disease (AD) is a form of dementia that affects the memory, cognition, and motor skills of patients. Extensive research has been done to develop accessible, cost-effective, and non-invasive techniques for the automatic detection of AD. Previous research has shown that speech can be used to distinguish between healthy patients and afflicted patients. In this paper, the ADReSS dataset, a dataset balanced by gender and age, was used to automatically classify AD from spontaneous speech. The performance of five classifiers, as well as a convolutional neural network and long short-term memory network, was compared when trained on audio features (i-vectors and x-vectors) and text features (word vectors, BERT embeddings, LIWC features, and CLAN features). The same audio and text features were used to train five regression models to predict the Mini-Mental State Examination score for each patient, a score that has a maximum value of 30. The top-performing classification models were the support vector machine and random forest classifiers trained on BERT embeddings, which both achieved an accuracy of 85.4% on the test set. The best-performing regression model was the gradient boosting regression model trained on BERT embeddings and CLAN features, which had a root mean squared error of 4.56 on the test set. The performance on both tasks illustrates the feasibility of using speech to classify AD and predict neuropsychological scores. Frontiers Media S.A. 2021-01-15 /pmc/articles/PMC7845557/ /pubmed/33519651 http://dx.doi.org/10.3389/fpsyg.2020.624137 Text en Copyright © 2021 Haulcy and Glass. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Psychology Haulcy, R'mani Glass, James Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
title	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
title_full	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
title_fullStr	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
title_full_unstemmed	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
title_short	Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech
title_sort	classifying alzheimer's disease using audio and text-based representations of speech
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7845557/ https://www.ncbi.nlm.nih.gov/pubmed/33519651 http://dx.doi.org/10.3389/fpsyg.2020.624137
work_keys_str_mv	AT haulcyrmani classifyingalzheimersdiseaseusingaudioandtextbasedrepresentationsofspeech AT glassjames classifyingalzheimersdiseaseusingaudioandtextbasedrepresentationsofspeech

Classifying Alzheimer's Disease Using Audio and Text-Based Representations of Speech

Ejemplares similares