Cargando…
Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent
INTRODUCTION: One of the major challenges in clinical psychiatry remains the absence of well established objective measures of symptoms’ severity. Clinical insights are mainly provided through keen behavioral observation and subjective questionnaires and scales. OBJECTIVES: The aim of this paper is...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10596522/ http://dx.doi.org/10.1192/j.eurpsy.2023.387 |
_version_ | 1785125124271243264 |
---|---|
author | König, A. Mina, M. Schäfer, S. Linz, N. Tröger, J. |
author_facet | König, A. Mina, M. Schäfer, S. Linz, N. Tröger, J. |
author_sort | König, A. |
collection | PubMed |
description | INTRODUCTION: One of the major challenges in clinical psychiatry remains the absence of well established objective measures of symptoms’ severity. Clinical insights are mainly provided through keen behavioral observation and subjective questionnaires and scales. OBJECTIVES: The aim of this paper is to predict depression severity through speech using the features extracted from the speech as provided by participants during a semi-structured dialogue with a virtual avatar. METHODS: We use data from a subset of the DAICWOZ dataset consisting in 142 dialogues between participants and a virtual avatar during which the avatar uses several prompts to maintain a conversation with the participant. The avatar uses prompts involving the topics of travel, dream jobs, and memorable experiences. From the speech generated from the dialogue, we extract participant utterances separated by prompt and extract features from the three sets of transcripts. We extract content features from the transcript and acoustic features from the excerpt corresponding to the speech from the participant for the prompt in question.We perform regression experiments on the PHQ8 items using the features extracted from each set of transcripts. Furthermore, we combine the features extracted from each set of transcripts and compute partial spearman correlations between them and the PHQ8 items using gender as a covariate. RESULTS: With our best regression model we obtain an R2 of 0.1, explaining 10% of the variance in the PHQ total score. Additionally, we obtain a mean absolute error of 1.25, suggesting that the regressor can detect with more or less precision clinically meaningful differences in depression severity between participants. Partial correlations between the total score and the features show significant correlations between features dependent on the amount of speech generated by each participant, along with the complexity of syntactic structures used. CONCLUSIONS: Automatic analysis of spontaneous speech could help with the detection and monitoring of signs of depression. By combining the use of this technology with timely intervention strategies for instance provided by a virtual agent it could contribute to timely prevention. DISCLOSURE OF INTEREST: A. König: None Declared, M. Mina Employee of: ki:elements GmbH, S. Schäfer Employee of: ki:elements GmbH, N. Linz Shareolder of: ki:elements GmbH, Employee of: ki:elements GmbH, J. Tröger Shareolder of: ki:elements GmbH, Employee of: ki:elements GmbH |
format | Online Article Text |
id | pubmed-10596522 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-105965222023-10-25 Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent König, A. Mina, M. Schäfer, S. Linz, N. Tröger, J. Eur Psychiatry Abstract INTRODUCTION: One of the major challenges in clinical psychiatry remains the absence of well established objective measures of symptoms’ severity. Clinical insights are mainly provided through keen behavioral observation and subjective questionnaires and scales. OBJECTIVES: The aim of this paper is to predict depression severity through speech using the features extracted from the speech as provided by participants during a semi-structured dialogue with a virtual avatar. METHODS: We use data from a subset of the DAICWOZ dataset consisting in 142 dialogues between participants and a virtual avatar during which the avatar uses several prompts to maintain a conversation with the participant. The avatar uses prompts involving the topics of travel, dream jobs, and memorable experiences. From the speech generated from the dialogue, we extract participant utterances separated by prompt and extract features from the three sets of transcripts. We extract content features from the transcript and acoustic features from the excerpt corresponding to the speech from the participant for the prompt in question.We perform regression experiments on the PHQ8 items using the features extracted from each set of transcripts. Furthermore, we combine the features extracted from each set of transcripts and compute partial spearman correlations between them and the PHQ8 items using gender as a covariate. RESULTS: With our best regression model we obtain an R2 of 0.1, explaining 10% of the variance in the PHQ total score. Additionally, we obtain a mean absolute error of 1.25, suggesting that the regressor can detect with more or less precision clinically meaningful differences in depression severity between participants. Partial correlations between the total score and the features show significant correlations between features dependent on the amount of speech generated by each participant, along with the complexity of syntactic structures used. CONCLUSIONS: Automatic analysis of spontaneous speech could help with the detection and monitoring of signs of depression. By combining the use of this technology with timely intervention strategies for instance provided by a virtual agent it could contribute to timely prevention. DISCLOSURE OF INTEREST: A. König: None Declared, M. Mina Employee of: ki:elements GmbH, S. Schäfer Employee of: ki:elements GmbH, N. Linz Shareolder of: ki:elements GmbH, Employee of: ki:elements GmbH, J. Tröger Shareolder of: ki:elements GmbH, Employee of: ki:elements GmbH Cambridge University Press 2023-07-19 /pmc/articles/PMC10596522/ http://dx.doi.org/10.1192/j.eurpsy.2023.387 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Abstract König, A. Mina, M. Schäfer, S. Linz, N. Tröger, J. Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent |
title | Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent |
title_full | Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent |
title_fullStr | Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent |
title_full_unstemmed | Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent |
title_short | Predicting Depression Severity from Spontaneous Speech as Prompted by a Virtual Agent |
title_sort | predicting depression severity from spontaneous speech as prompted by a virtual agent |
topic | Abstract |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10596522/ http://dx.doi.org/10.1192/j.eurpsy.2023.387 |
work_keys_str_mv | AT koniga predictingdepressionseverityfromspontaneousspeechaspromptedbyavirtualagent AT minam predictingdepressionseverityfromspontaneousspeechaspromptedbyavirtualagent AT schafers predictingdepressionseverityfromspontaneousspeechaspromptedbyavirtualagent AT linzn predictingdepressionseverityfromspontaneousspeechaspromptedbyavirtualagent AT trogerj predictingdepressionseverityfromspontaneousspeechaspromptedbyavirtualagent |