Cargando…
Automated analysis of lexical features in Frontotemporal Degeneration
We implemented an automated analysis of lexical aspects of semi-structured speech produced by healthy elderly controls (n=37) and three patient groups with frontotemporal degeneration (FTD): behavioral variant FTD (n=74), semantic variant primary progressive aphasia (svPPA, n=42), and nonfluent/agra...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7654918/ https://www.ncbi.nlm.nih.gov/pubmed/33173922 http://dx.doi.org/10.1101/2020.09.10.20192054 |
_version_ | 1783608140011405312 |
---|---|
author | Cho, Sunghye Nevler, Naomi Ash, Sharon Shellikeri, Sanjana Irwin, David J. Massimo, Lauren Rascovsky, Katya Olm, Christopher Grossman, Murray Liberman, Mark |
author_facet | Cho, Sunghye Nevler, Naomi Ash, Sharon Shellikeri, Sanjana Irwin, David J. Massimo, Lauren Rascovsky, Katya Olm, Christopher Grossman, Murray Liberman, Mark |
author_sort | Cho, Sunghye |
collection | PubMed |
description | We implemented an automated analysis of lexical aspects of semi-structured speech produced by healthy elderly controls (n=37) and three patient groups with frontotemporal degeneration (FTD): behavioral variant FTD (n=74), semantic variant primary progressive aphasia (svPPA, n=42), and nonfluent/agrammatic PPA (naPPA, n=22). Based on previous findings, we hypothesized that the three patient groups and controls would differ in the counts of part-of-speech (POS) categories and several lexical measures. With a natural language processing program, we automatically tagged POS categories of all words produced during a picture description task. We further counted the number of wh-words, and we rated nouns for abstractness, ambiguity, frequency, familiarity, and age of acquisition. We also computed the cross-entropy estimation, which is a measure of word predictability, and lexical diversity for each description. We validated a subset of the POS data that were automatically tagged with the Google Universal POS scheme using gold-standard POS data tagged by a linguist, and we found that the POS categories from our automated methods were more than 90% accurate. For svPPA patients, we found fewer unique nouns than in naPPA and more pronouns and wh-words than in the other groups. We also found high abstractness, ambiguity, frequency, and familiarity for nouns and the lowest cross-entropy estimation among all groups. These measures were associated with cortical thinning in the left temporal lobe. In naPPA patients, we found increased speech errors and partial words compared to controls, and these impairments were associated with cortical thinning in the left middle frontal gyrus. bvFTD patients’ adjective production was decreased compared to controls and was correlated with their apathy scores. Their adjective production was associated with cortical thinning in the dorsolateral frontal and orbitofrontal gyri. Our results demonstrate distinct language profiles in subgroups of FTD patients and validate our automated method of analyzing FTD patients’ speech. |
format | Online Article Text |
id | pubmed-7654918 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-76549182020-11-11 Automated analysis of lexical features in Frontotemporal Degeneration Cho, Sunghye Nevler, Naomi Ash, Sharon Shellikeri, Sanjana Irwin, David J. Massimo, Lauren Rascovsky, Katya Olm, Christopher Grossman, Murray Liberman, Mark medRxiv Article We implemented an automated analysis of lexical aspects of semi-structured speech produced by healthy elderly controls (n=37) and three patient groups with frontotemporal degeneration (FTD): behavioral variant FTD (n=74), semantic variant primary progressive aphasia (svPPA, n=42), and nonfluent/agrammatic PPA (naPPA, n=22). Based on previous findings, we hypothesized that the three patient groups and controls would differ in the counts of part-of-speech (POS) categories and several lexical measures. With a natural language processing program, we automatically tagged POS categories of all words produced during a picture description task. We further counted the number of wh-words, and we rated nouns for abstractness, ambiguity, frequency, familiarity, and age of acquisition. We also computed the cross-entropy estimation, which is a measure of word predictability, and lexical diversity for each description. We validated a subset of the POS data that were automatically tagged with the Google Universal POS scheme using gold-standard POS data tagged by a linguist, and we found that the POS categories from our automated methods were more than 90% accurate. For svPPA patients, we found fewer unique nouns than in naPPA and more pronouns and wh-words than in the other groups. We also found high abstractness, ambiguity, frequency, and familiarity for nouns and the lowest cross-entropy estimation among all groups. These measures were associated with cortical thinning in the left temporal lobe. In naPPA patients, we found increased speech errors and partial words compared to controls, and these impairments were associated with cortical thinning in the left middle frontal gyrus. bvFTD patients’ adjective production was decreased compared to controls and was correlated with their apathy scores. Their adjective production was associated with cortical thinning in the dorsolateral frontal and orbitofrontal gyri. Our results demonstrate distinct language profiles in subgroups of FTD patients and validate our automated method of analyzing FTD patients’ speech. Cold Spring Harbor Laboratory 2020-11-04 /pmc/articles/PMC7654918/ /pubmed/33173922 http://dx.doi.org/10.1101/2020.09.10.20192054 Text en http://creativecommons.org/licenses/by-nc-nd/4.0/It is made available under a CC-BY-NC-ND 4.0 International license (http://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Article Cho, Sunghye Nevler, Naomi Ash, Sharon Shellikeri, Sanjana Irwin, David J. Massimo, Lauren Rascovsky, Katya Olm, Christopher Grossman, Murray Liberman, Mark Automated analysis of lexical features in Frontotemporal Degeneration |
title | Automated analysis of lexical features in Frontotemporal Degeneration |
title_full | Automated analysis of lexical features in Frontotemporal Degeneration |
title_fullStr | Automated analysis of lexical features in Frontotemporal Degeneration |
title_full_unstemmed | Automated analysis of lexical features in Frontotemporal Degeneration |
title_short | Automated analysis of lexical features in Frontotemporal Degeneration |
title_sort | automated analysis of lexical features in frontotemporal degeneration |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7654918/ https://www.ncbi.nlm.nih.gov/pubmed/33173922 http://dx.doi.org/10.1101/2020.09.10.20192054 |
work_keys_str_mv | AT chosunghye automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT nevlernaomi automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT ashsharon automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT shellikerisanjana automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT irwindavidj automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT massimolauren automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT rascovskykatya automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT olmchristopher automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT grossmanmurray automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration AT libermanmark automatedanalysisoflexicalfeaturesinfrontotemporaldegeneration |