Cargando…

Automated analysis of lexical features in Frontotemporal Degeneration

We implemented an automated analysis of lexical aspects of semi-structured speech produced by healthy elderly controls (n=37) and three patient groups with frontotemporal degeneration (FTD): behavioral variant FTD (n=74), semantic variant primary progressive aphasia (svPPA, n=42), and nonfluent/agra...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Sunghye, Nevler, Naomi, Ash, Sharon, Shellikeri, Sanjana, Irwin, David J., Massimo, Lauren, Rascovsky, Katya, Olm, Christopher, Grossman, Murray, Liberman, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7654918/
https://www.ncbi.nlm.nih.gov/pubmed/33173922
http://dx.doi.org/10.1101/2020.09.10.20192054
_version_ 1783608140011405312
author Cho, Sunghye
Nevler, Naomi
Ash, Sharon
Shellikeri, Sanjana
Irwin, David J.
Massimo, Lauren
Rascovsky, Katya
Olm, Christopher
Grossman, Murray
Liberman, Mark
author_facet Cho, Sunghye
Nevler, Naomi
Ash, Sharon
Shellikeri, Sanjana
Irwin, David J.
Massimo, Lauren
Rascovsky, Katya
Olm, Christopher
Grossman, Murray
Liberman, Mark
author_sort Cho, Sunghye
collection PubMed
description We implemented an automated analysis of lexical aspects of semi-structured speech produced by healthy elderly controls (n=37) and three patient groups with frontotemporal degeneration (FTD): behavioral variant FTD (n=74), semantic variant primary progressive aphasia (svPPA, n=42), and nonfluent/agrammatic PPA (naPPA, n=22). Based on previous findings, we hypothesized that the three patient groups and controls would differ in the counts of part-of-speech (POS) categories and several lexical measures. With a natural language processing program, we automatically tagged POS categories of all words produced during a picture description task. We further counted the number of wh-words, and we rated nouns for abstractness, ambiguity, frequency, familiarity, and age of acquisition. We also computed the cross-entropy estimation, which is a measure of word predictability, and lexical diversity for each description. We validated a subset of the POS data that were automatically tagged with the Google Universal POS scheme using gold-standard POS data tagged by a linguist, and we found that the POS categories from our automated methods were more than 90% accurate. For svPPA patients, we found fewer unique nouns than in naPPA and more pronouns and wh-words than in the other groups. We also found high abstractness, ambiguity, frequency, and familiarity for nouns and the lowest cross-entropy estimation among all groups. These measures were associated with cortical thinning in the left temporal lobe. In naPPA patients, we found increased speech errors and partial words compared to controls, and these impairments were associated with cortical thinning in the left middle frontal gyrus. bvFTD patients’ adjective production was decreased compared to controls and was correlated with their apathy scores. Their adjective production was associated with cortical thinning in the dorsolateral frontal and orbitofrontal gyri. Our results demonstrate distinct language profiles in subgroups of FTD patients and validate our automated method of analyzing FTD patients’ speech.
format Online
Article
Text
id pubmed-7654918
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-76549182020-11-11 Automated analysis of lexical features in Frontotemporal Degeneration Cho, Sunghye Nevler, Naomi Ash, Sharon Shellikeri, Sanjana Irwin, David J. Massimo, Lauren Rascovsky, Katya Olm, Christopher Grossman, Murray Liberman, Mark medRxiv Article We implemented an automated analysis of lexical aspects of semi-structured speech produced by healthy elderly controls (n=37) and three patient groups with frontotemporal degeneration (FTD): behavioral variant FTD (n=74), semantic variant primary progressive aphasia (svPPA, n=42), and nonfluent/agrammatic PPA (naPPA, n=22). Based on previous findings, we hypothesized that the three patient groups and controls would differ in the counts of part-of-speech (POS) categories and several lexical measures. With a natural language processing program, we automatically tagged POS categories of all words produced during a picture description task. We further counted the number of wh-words, and we rated nouns for abstractness, ambiguity, frequency, familiarity, and age of acquisition. We also computed the cross-entropy estimation, which is a measure of word predictability, and lexical diversity for each description. We validated a subset of the POS data that were automatically tagged with the Google Universal POS scheme using gold-standard POS data tagged by a linguist, and we found that the POS categories from our automated methods were more than 90% accurate. For svPPA patients, we found fewer unique nouns than in naPPA and more pronouns and wh-words than in the other groups. We also found high abstractness, ambiguity, frequency, and familiarity for nouns and the lowest cross-entropy estimation among all groups. These measures were associated with cortical thinning in the left temporal lobe. In naPPA patients, we found increased speech errors and partial words compared to controls, and these impairments were associated with cortical thinning in the left middle frontal gyrus. bvFTD patients’ adjective production was decreased compared to controls and was correlated with their apathy scores. Their adjective production was associated with cortical thinning in the dorsolateral frontal and orbitofrontal gyri. Our results demonstrate distinct language profiles in subgroups of FTD patients and validate our automated method of analyzing FTD patients’ speech. Cold Spring Harbor Laboratory 2020-11-04 /pmc/articles/PMC7654918/ /pubmed/33173922 http://dx.doi.org/10.1101/2020.09.10.20192054 Text en http://creativecommons.org/licenses/by-nc-nd/4.0/It is made available under a CC-BY-NC-ND 4.0 International license (http://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Article
Cho, Sunghye
Nevler, Naomi
Ash, Sharon
Shellikeri, Sanjana
Irwin, David J.
Massimo, Lauren
Rascovsky, Katya
Olm, Christopher
Grossman, Murray
Liberman, Mark
Automated analysis of lexical features in Frontotemporal Degeneration
title Automated analysis of lexical features in Frontotemporal Degeneration
title_full Automated analysis of lexical features in Frontotemporal Degeneration
title_fullStr Automated analysis of lexical features in Frontotemporal Degeneration
title_full_unstemmed Automated analysis of lexical features in Frontotemporal Degeneration
title_short Automated analysis of lexical features in Frontotemporal Degeneration
title_sort automated analysis of lexical features in frontotemporal degeneration
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7654918/
https://www.ncbi.nlm.nih.gov/pubmed/33173922
http://dx.doi.org/10.1101/2020.09.10.20192054
work_keys_str_mv AT chosunghye automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT nevlernaomi automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT ashsharon automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT shellikerisanjana automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT irwindavidj automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT massimolauren automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT rascovskykatya automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT olmchristopher automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT grossmanmurray automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration
AT libermanmark automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration