Cargando…

Assessing the accuracy of automatic speech recognition for psychotherapy

Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Miner, Adam S., Haque, Albert, Fries, Jason A., Fleming, Scott L., Wilfley, Denise E., Terence Wilson, G., Milstein, Arnold, Jurafsky, Dan, Arnow, Bruce A., Stewart Agras, W., Fei-Fei, Li, Shah, Nigam H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7270106/
https://www.ncbi.nlm.nih.gov/pubmed/32550644
http://dx.doi.org/10.1038/s41746-020-0285-8
_version_ 1783541850780467200
author Miner, Adam S.
Haque, Albert
Fries, Jason A.
Fleming, Scott L.
Wilfley, Denise E.
Terence Wilson, G.
Milstein, Arnold
Jurafsky, Dan
Arnow, Bruce A.
Stewart Agras, W.
Fei-Fei, Li
Shah, Nigam H.
author_facet Miner, Adam S.
Haque, Albert
Fries, Jason A.
Fleming, Scott L.
Wilfley, Denise E.
Terence Wilson, G.
Milstein, Arnold
Jurafsky, Dan
Arnow, Bruce A.
Stewart Agras, W.
Fei-Fei, Li
Shah, Nigam H.
author_sort Miner, Adam S.
collection PubMed
description Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear which metrics and thresholds are appropriate for different clinical use cases, which may range from population descriptions to individual safety monitoring. Here we show that automatic speech recognition is feasible in psychotherapy, but further improvements in accuracy are needed before widespread use. Our HIPAA-compliant automatic speech recognition system demonstrated a transcription word error rate of 25%. For depression-related utterances, sensitivity was 80% and positive predictive value was 83%. For clinician-identified harm-related sentences, the word error rate was 34%. These results suggest that automatic speech recognition may support understanding of language patterns and subgroup variation in existing treatments but may not be ready for individual-level safety surveillance.
format Online
Article
Text
id pubmed-7270106
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72701062020-06-16 Assessing the accuracy of automatic speech recognition for psychotherapy Miner, Adam S. Haque, Albert Fries, Jason A. Fleming, Scott L. Wilfley, Denise E. Terence Wilson, G. Milstein, Arnold Jurafsky, Dan Arnow, Bruce A. Stewart Agras, W. Fei-Fei, Li Shah, Nigam H. NPJ Digit Med Article Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear which metrics and thresholds are appropriate for different clinical use cases, which may range from population descriptions to individual safety monitoring. Here we show that automatic speech recognition is feasible in psychotherapy, but further improvements in accuracy are needed before widespread use. Our HIPAA-compliant automatic speech recognition system demonstrated a transcription word error rate of 25%. For depression-related utterances, sensitivity was 80% and positive predictive value was 83%. For clinician-identified harm-related sentences, the word error rate was 34%. These results suggest that automatic speech recognition may support understanding of language patterns and subgroup variation in existing treatments but may not be ready for individual-level safety surveillance. Nature Publishing Group UK 2020-06-03 /pmc/articles/PMC7270106/ /pubmed/32550644 http://dx.doi.org/10.1038/s41746-020-0285-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Miner, Adam S.
Haque, Albert
Fries, Jason A.
Fleming, Scott L.
Wilfley, Denise E.
Terence Wilson, G.
Milstein, Arnold
Jurafsky, Dan
Arnow, Bruce A.
Stewart Agras, W.
Fei-Fei, Li
Shah, Nigam H.
Assessing the accuracy of automatic speech recognition for psychotherapy
title Assessing the accuracy of automatic speech recognition for psychotherapy
title_full Assessing the accuracy of automatic speech recognition for psychotherapy
title_fullStr Assessing the accuracy of automatic speech recognition for psychotherapy
title_full_unstemmed Assessing the accuracy of automatic speech recognition for psychotherapy
title_short Assessing the accuracy of automatic speech recognition for psychotherapy
title_sort assessing the accuracy of automatic speech recognition for psychotherapy
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7270106/
https://www.ncbi.nlm.nih.gov/pubmed/32550644
http://dx.doi.org/10.1038/s41746-020-0285-8
work_keys_str_mv AT mineradams assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT haquealbert assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT friesjasona assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT flemingscottl assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT wilfleydenisee assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT terencewilsong assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT milsteinarnold assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT jurafskydan assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT arnowbrucea assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT stewartagrasw assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT feifeili assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy
AT shahnigamh assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy