Cargando…

Multilingual automation of transcript preprocessing in Alzheimer's disease detection

INTRODUCTION: Analyzing linguistic functions can improve early detection of Alzheimer's disease (AD). To date, no studies have focused on creating a universal pipeline for clinical transcript preprocessing. METHODS: This article presents a simple and efficient method for processing linguistic a...

Descripción completa

Detalles Bibliográficos
Autores principales: Abiven, Frédéric, Ratté, Sylvie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7975846/
https://www.ncbi.nlm.nih.gov/pubmed/33763518
http://dx.doi.org/10.1002/trc2.12147
Descripción
Sumario:INTRODUCTION: Analyzing linguistic functions can improve early detection of Alzheimer's disease (AD). To date, no studies have focused on creating a universal pipeline for clinical transcript preprocessing. METHODS: This article presents a simple and efficient method for processing linguistic and phonetic data, sequencing subproblems of cleaning, normalization, and measure extraction tasks. Because some of these tasks are language‐ and context‐ dependent, they were designed to be easily configurable, thus increasing their scalability when dealing with new corpora. RESULTS: Results show improved performances over previous studies in this time‐consuming preprocessing task. Moreover, our findings showed that some discursive markers extracted from transcripts revealed a significant correlation (>0.5) with cognitive impairment severity. DISCUSSION: This article contributes to the literature on AD by presenting an efficient pipeline that allows speeding up the transcripts preprocessing task. We further invite other researchers to contribute to this work to help improve the quality of this pipeline (https://github.com/LiNCS-lab/usAge).