Cargando…

Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis

BACKGROUND: Speech data for medical research can be collected noninvasively and in large volumes. Speech analysis has shown promise in diagnosing neurodegenerative disease. To effectively leverage speech data, transcription is important, as there is valuable information contained in lexical content....

Descripción completa

Detalles Bibliográficos
Autores principales: Soroski, Thomas, da Cunha Vasco, Thiago, Newton-Mason, Sally, Granby, Saffrin, Lewis, Caitlin, Harisinghani, Anuj, Rizzo, Matteo, Conati, Cristina, Murray, Gabriel, Carenini, Giuseppe, Field, Thalia S, Jang, Hyeju
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536526/
https://www.ncbi.nlm.nih.gov/pubmed/36129754
http://dx.doi.org/10.2196/33460
_version_ 1784802998837313536
author Soroski, Thomas
da Cunha Vasco, Thiago
Newton-Mason, Sally
Granby, Saffrin
Lewis, Caitlin
Harisinghani, Anuj
Rizzo, Matteo
Conati, Cristina
Murray, Gabriel
Carenini, Giuseppe
Field, Thalia S
Jang, Hyeju
author_facet Soroski, Thomas
da Cunha Vasco, Thiago
Newton-Mason, Sally
Granby, Saffrin
Lewis, Caitlin
Harisinghani, Anuj
Rizzo, Matteo
Conati, Cristina
Murray, Gabriel
Carenini, Giuseppe
Field, Thalia S
Jang, Hyeju
author_sort Soroski, Thomas
collection PubMed
description BACKGROUND: Speech data for medical research can be collected noninvasively and in large volumes. Speech analysis has shown promise in diagnosing neurodegenerative disease. To effectively leverage speech data, transcription is important, as there is valuable information contained in lexical content. Manual transcription, while highly accurate, limits the potential scalability and cost savings associated with language-based screening. OBJECTIVE: To better understand the use of automatic transcription for classification of neurodegenerative disease, namely, Alzheimer disease (AD), mild cognitive impairment (MCI), or subjective memory complaints (SMC) versus healthy controls, we compared automatically generated transcripts against transcripts that went through manual correction. METHODS: We recruited individuals from a memory clinic (“patients”) with a diagnosis of mild-to-moderate AD, (n=44, 30%), MCI (n=20, 13%), SMC (n=8, 5%), as well as healthy controls (n=77, 52%) living in the community. Participants were asked to describe a standardized picture, read a paragraph, and recall a pleasant life experience. We compared transcripts generated using Google speech-to-text software to manually verified transcripts by examining transcription confidence scores, transcription error rates, and machine learning classification accuracy. For the classification tasks, logistic regression, Gaussian naive Bayes, and random forests were used. RESULTS: The transcription software showed higher confidence scores (P<.001) and lower error rates (P>.05) for speech from healthy controls compared with patients. Classification models using human-verified transcripts significantly (P<.001) outperformed automatically generated transcript models for both spontaneous speech tasks. This comparison showed no difference in the reading task. Manually adding pauses to transcripts had no impact on classification performance. However, manually correcting both spontaneous speech tasks led to significantly higher performances in the machine learning models. CONCLUSIONS: We found that automatically transcribed speech data could be used to distinguish patients with a diagnosis of AD, MCI, or SMC from controls. We recommend a human verification step to improve the performance of automatic transcripts, especially for spontaneous tasks. Moreover, human verification can focus on correcting errors and adding punctuation to transcripts. However, manual addition of pauses is not needed, which can simplify the human verification step to more efficiently process large volumes of speech data.
format Online
Article
Text
id pubmed-9536526
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-95365262022-10-07 Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis Soroski, Thomas da Cunha Vasco, Thiago Newton-Mason, Sally Granby, Saffrin Lewis, Caitlin Harisinghani, Anuj Rizzo, Matteo Conati, Cristina Murray, Gabriel Carenini, Giuseppe Field, Thalia S Jang, Hyeju JMIR Aging Original Paper BACKGROUND: Speech data for medical research can be collected noninvasively and in large volumes. Speech analysis has shown promise in diagnosing neurodegenerative disease. To effectively leverage speech data, transcription is important, as there is valuable information contained in lexical content. Manual transcription, while highly accurate, limits the potential scalability and cost savings associated with language-based screening. OBJECTIVE: To better understand the use of automatic transcription for classification of neurodegenerative disease, namely, Alzheimer disease (AD), mild cognitive impairment (MCI), or subjective memory complaints (SMC) versus healthy controls, we compared automatically generated transcripts against transcripts that went through manual correction. METHODS: We recruited individuals from a memory clinic (“patients”) with a diagnosis of mild-to-moderate AD, (n=44, 30%), MCI (n=20, 13%), SMC (n=8, 5%), as well as healthy controls (n=77, 52%) living in the community. Participants were asked to describe a standardized picture, read a paragraph, and recall a pleasant life experience. We compared transcripts generated using Google speech-to-text software to manually verified transcripts by examining transcription confidence scores, transcription error rates, and machine learning classification accuracy. For the classification tasks, logistic regression, Gaussian naive Bayes, and random forests were used. RESULTS: The transcription software showed higher confidence scores (P<.001) and lower error rates (P>.05) for speech from healthy controls compared with patients. Classification models using human-verified transcripts significantly (P<.001) outperformed automatically generated transcript models for both spontaneous speech tasks. This comparison showed no difference in the reading task. Manually adding pauses to transcripts had no impact on classification performance. However, manually correcting both spontaneous speech tasks led to significantly higher performances in the machine learning models. CONCLUSIONS: We found that automatically transcribed speech data could be used to distinguish patients with a diagnosis of AD, MCI, or SMC from controls. We recommend a human verification step to improve the performance of automatic transcripts, especially for spontaneous tasks. Moreover, human verification can focus on correcting errors and adding punctuation to transcripts. However, manual addition of pauses is not needed, which can simplify the human verification step to more efficiently process large volumes of speech data. JMIR Publications 2022-09-21 /pmc/articles/PMC9536526/ /pubmed/36129754 http://dx.doi.org/10.2196/33460 Text en ©Thomas Soroski, Thiago da Cunha Vasco, Sally Newton-Mason, Saffrin Granby, Caitlin Lewis, Anuj Harisinghani, Matteo Rizzo, Cristina Conati, Gabriel Murray, Giuseppe Carenini, Thalia S Field, Hyeju Jang. Originally published in JMIR Aging (https://aging.jmir.org), 21.09.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Aging, is properly cited. The complete bibliographic information, a link to the original publication on https://aging.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Soroski, Thomas
da Cunha Vasco, Thiago
Newton-Mason, Sally
Granby, Saffrin
Lewis, Caitlin
Harisinghani, Anuj
Rizzo, Matteo
Conati, Cristina
Murray, Gabriel
Carenini, Giuseppe
Field, Thalia S
Jang, Hyeju
Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis
title Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis
title_full Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis
title_fullStr Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis
title_full_unstemmed Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis
title_short Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis
title_sort evaluating web-based automatic transcription for alzheimer speech data: transcript comparison and machine learning analysis
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536526/
https://www.ncbi.nlm.nih.gov/pubmed/36129754
http://dx.doi.org/10.2196/33460
work_keys_str_mv AT soroskithomas evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT dacunhavascothiago evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT newtonmasonsally evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT granbysaffrin evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT lewiscaitlin evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT harisinghanianuj evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT rizzomatteo evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT conaticristina evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT murraygabriel evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT careninigiuseppe evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT fieldthalias evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis
AT janghyeju evaluatingwebbasedautomatictranscriptionforalzheimerspeechdatatranscriptcomparisonandmachinelearninganalysis