Cargando…

Predicting dementia from spontaneous speech using large language models

Language impairment is an important biomarker of neurodegenerative disorders such as Alzheimer’s disease (AD). Artificial intelligence (AI), particularly natural language processing (NLP), has recently been increasingly used for early prediction of AD through speech. Yet, relatively few studies exis...

Descripción completa

Detalles Bibliográficos
Autores principales:	Agbavor, Felix, Liang, Hualou
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931366/ https://www.ncbi.nlm.nih.gov/pubmed/36812634 http://dx.doi.org/10.1371/journal.pdig.0000168

_version_	1784889233719164928
author	Agbavor, Felix Liang, Hualou
author_facet	Agbavor, Felix Liang, Hualou
author_sort	Agbavor, Felix
collection	PubMed
description	Language impairment is an important biomarker of neurodegenerative disorders such as Alzheimer’s disease (AD). Artificial intelligence (AI), particularly natural language processing (NLP), has recently been increasingly used for early prediction of AD through speech. Yet, relatively few studies exist on using large language models, especially GPT-3, to aid in the early diagnosis of dementia. In this work, we show for the first time that GPT-3 can be utilized to predict dementia from spontaneous speech. Specifically, we leverage the vast semantic knowledge encoded in the GPT-3 model to generate text embedding, a vector representation of the transcribed text from speech, that captures the semantic meaning of the input. We demonstrate that the text embedding can be reliably used to (1) distinguish individuals with AD from healthy controls, and (2) infer the subject’s cognitive testing score, both solely based on speech data. We further show that text embedding considerably outperforms the conventional acoustic feature-based approach and even performs competitively with prevailing fine-tuned models. Together, our results suggest that GPT-3 based text embedding is a viable approach for AD assessment directly from speech and has the potential to improve early diagnosis of dementia.
format	Online Article Text
id	pubmed-9931366
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-99313662023-02-16 Predicting dementia from spontaneous speech using large language models Agbavor, Felix Liang, Hualou PLOS Digit Health Research Article Language impairment is an important biomarker of neurodegenerative disorders such as Alzheimer’s disease (AD). Artificial intelligence (AI), particularly natural language processing (NLP), has recently been increasingly used for early prediction of AD through speech. Yet, relatively few studies exist on using large language models, especially GPT-3, to aid in the early diagnosis of dementia. In this work, we show for the first time that GPT-3 can be utilized to predict dementia from spontaneous speech. Specifically, we leverage the vast semantic knowledge encoded in the GPT-3 model to generate text embedding, a vector representation of the transcribed text from speech, that captures the semantic meaning of the input. We demonstrate that the text embedding can be reliably used to (1) distinguish individuals with AD from healthy controls, and (2) infer the subject’s cognitive testing score, both solely based on speech data. We further show that text embedding considerably outperforms the conventional acoustic feature-based approach and even performs competitively with prevailing fine-tuned models. Together, our results suggest that GPT-3 based text embedding is a viable approach for AD assessment directly from speech and has the potential to improve early diagnosis of dementia. Public Library of Science 2022-12-22 /pmc/articles/PMC9931366/ /pubmed/36812634 http://dx.doi.org/10.1371/journal.pdig.0000168 Text en © 2022 Agbavor, Liang https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Agbavor, Felix Liang, Hualou Predicting dementia from spontaneous speech using large language models
title	Predicting dementia from spontaneous speech using large language models
title_full	Predicting dementia from spontaneous speech using large language models
title_fullStr	Predicting dementia from spontaneous speech using large language models
title_full_unstemmed	Predicting dementia from spontaneous speech using large language models
title_short	Predicting dementia from spontaneous speech using large language models
title_sort	predicting dementia from spontaneous speech using large language models
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931366/ https://www.ncbi.nlm.nih.gov/pubmed/36812634 http://dx.doi.org/10.1371/journal.pdig.0000168
work_keys_str_mv	AT agbavorfelix predictingdementiafromspontaneousspeechusinglargelanguagemodels AT lianghualou predictingdementiafromspontaneousspeechusinglargelanguagemodels

Predicting dementia from spontaneous speech using large language models

Ejemplares similares