Cargando…
Assessing the accuracy of automatic speech recognition for psychotherapy
Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear whic...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7270106/ https://www.ncbi.nlm.nih.gov/pubmed/32550644 http://dx.doi.org/10.1038/s41746-020-0285-8 |
_version_ | 1783541850780467200 |
---|---|
author | Miner, Adam S. Haque, Albert Fries, Jason A. Fleming, Scott L. Wilfley, Denise E. Terence Wilson, G. Milstein, Arnold Jurafsky, Dan Arnow, Bruce A. Stewart Agras, W. Fei-Fei, Li Shah, Nigam H. |
author_facet | Miner, Adam S. Haque, Albert Fries, Jason A. Fleming, Scott L. Wilfley, Denise E. Terence Wilson, G. Milstein, Arnold Jurafsky, Dan Arnow, Bruce A. Stewart Agras, W. Fei-Fei, Li Shah, Nigam H. |
author_sort | Miner, Adam S. |
collection | PubMed |
description | Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear which metrics and thresholds are appropriate for different clinical use cases, which may range from population descriptions to individual safety monitoring. Here we show that automatic speech recognition is feasible in psychotherapy, but further improvements in accuracy are needed before widespread use. Our HIPAA-compliant automatic speech recognition system demonstrated a transcription word error rate of 25%. For depression-related utterances, sensitivity was 80% and positive predictive value was 83%. For clinician-identified harm-related sentences, the word error rate was 34%. These results suggest that automatic speech recognition may support understanding of language patterns and subgroup variation in existing treatments but may not be ready for individual-level safety surveillance. |
format | Online Article Text |
id | pubmed-7270106 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-72701062020-06-16 Assessing the accuracy of automatic speech recognition for psychotherapy Miner, Adam S. Haque, Albert Fries, Jason A. Fleming, Scott L. Wilfley, Denise E. Terence Wilson, G. Milstein, Arnold Jurafsky, Dan Arnow, Bruce A. Stewart Agras, W. Fei-Fei, Li Shah, Nigam H. NPJ Digit Med Article Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear which metrics and thresholds are appropriate for different clinical use cases, which may range from population descriptions to individual safety monitoring. Here we show that automatic speech recognition is feasible in psychotherapy, but further improvements in accuracy are needed before widespread use. Our HIPAA-compliant automatic speech recognition system demonstrated a transcription word error rate of 25%. For depression-related utterances, sensitivity was 80% and positive predictive value was 83%. For clinician-identified harm-related sentences, the word error rate was 34%. These results suggest that automatic speech recognition may support understanding of language patterns and subgroup variation in existing treatments but may not be ready for individual-level safety surveillance. Nature Publishing Group UK 2020-06-03 /pmc/articles/PMC7270106/ /pubmed/32550644 http://dx.doi.org/10.1038/s41746-020-0285-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Miner, Adam S. Haque, Albert Fries, Jason A. Fleming, Scott L. Wilfley, Denise E. Terence Wilson, G. Milstein, Arnold Jurafsky, Dan Arnow, Bruce A. Stewart Agras, W. Fei-Fei, Li Shah, Nigam H. Assessing the accuracy of automatic speech recognition for psychotherapy |
title | Assessing the accuracy of automatic speech recognition for psychotherapy |
title_full | Assessing the accuracy of automatic speech recognition for psychotherapy |
title_fullStr | Assessing the accuracy of automatic speech recognition for psychotherapy |
title_full_unstemmed | Assessing the accuracy of automatic speech recognition for psychotherapy |
title_short | Assessing the accuracy of automatic speech recognition for psychotherapy |
title_sort | assessing the accuracy of automatic speech recognition for psychotherapy |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7270106/ https://www.ncbi.nlm.nih.gov/pubmed/32550644 http://dx.doi.org/10.1038/s41746-020-0285-8 |
work_keys_str_mv | AT mineradams assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT haquealbert assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT friesjasona assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT flemingscottl assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT wilfleydenisee assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT terencewilsong assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT milsteinarnold assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT jurafskydan assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT arnowbrucea assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT stewartagrasw assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT feifeili assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy AT shahnigamh assessingtheaccuracyofautomaticspeechrecognitionforpsychotherapy |