Cargando…
Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study
INTRODUCTION: Sentiment analysis may be a useful technique to derive a user’s emotional state from free text input, allowing for more empathic automated feedback in online cognitive behavioral therapy (iCBT) interventions for psychological disorders such as depression. As guided iCBT is considered m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6530336/ https://www.ncbi.nlm.nih.gov/pubmed/31156504 http://dx.doi.org/10.3389/fpsyg.2019.01065 |
_version_ | 1783420620133892096 |
---|---|
author | Provoost, Simon Ruwaard, Jeroen van Breda, Ward Riper, Heleen Bosse, Tibor |
author_facet | Provoost, Simon Ruwaard, Jeroen van Breda, Ward Riper, Heleen Bosse, Tibor |
author_sort | Provoost, Simon |
collection | PubMed |
description | INTRODUCTION: Sentiment analysis may be a useful technique to derive a user’s emotional state from free text input, allowing for more empathic automated feedback in online cognitive behavioral therapy (iCBT) interventions for psychological disorders such as depression. As guided iCBT is considered more effective than unguided iCBT, such automated feedback may help close the gap between the two. The accuracy of automated sentiment analysis is domain dependent, and it is unclear how well the technology is applicable to iCBT. This paper presents an empirical study in which automated sentiment analysis by an algorithm for the Dutch language is validated against human judgment. METHODS: A total of 493 iCBT user texts were evaluated on overall sentiment and the presence of five specific emotions by an algorithm, and by 52 psychology students who evaluated 75 randomly selected texts each, providing about eight human evaluations per text. Inter-rater agreement (IRR) between algorithm and humans, and humans among each other, was analyzed by calculating the intra-class correlation under a numerical interpretation of the data, and Cohen’s kappa, and Krippendorff’s alpha under a categorical interpretation. RESULTS: All analyses indicated moderate agreement between the algorithm and average human judgment with respect to evaluating overall sentiment, and low agreement for the specific emotions. Somewhat surprisingly, the same was the case for the IRR among human judges, which means that the algorithm performed about as well as a randomly selected human judge. Thus, considering average human judgment as a benchmark for the applicability of automated sentiment analysis, the technique can be considered for practical application. DISCUSSION/CONCLUSION: The low human-human agreement on the presence of emotions may be due to the nature of the texts, it may simply be difficult for humans to agree on the presence of the selected emotions, or perhaps trained therapists would have reached more consensus. Future research may focus on validating the algorithm against a more solid benchmark, on applying the algorithm in an application in which empathic feedback is provided, for example, by an embodied conversational agent, or on improving the algorithm for the iCBT domain with a bottom-up machine learning approach. |
format | Online Article Text |
id | pubmed-6530336 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-65303362019-05-31 Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study Provoost, Simon Ruwaard, Jeroen van Breda, Ward Riper, Heleen Bosse, Tibor Front Psychol Psychology INTRODUCTION: Sentiment analysis may be a useful technique to derive a user’s emotional state from free text input, allowing for more empathic automated feedback in online cognitive behavioral therapy (iCBT) interventions for psychological disorders such as depression. As guided iCBT is considered more effective than unguided iCBT, such automated feedback may help close the gap between the two. The accuracy of automated sentiment analysis is domain dependent, and it is unclear how well the technology is applicable to iCBT. This paper presents an empirical study in which automated sentiment analysis by an algorithm for the Dutch language is validated against human judgment. METHODS: A total of 493 iCBT user texts were evaluated on overall sentiment and the presence of five specific emotions by an algorithm, and by 52 psychology students who evaluated 75 randomly selected texts each, providing about eight human evaluations per text. Inter-rater agreement (IRR) between algorithm and humans, and humans among each other, was analyzed by calculating the intra-class correlation under a numerical interpretation of the data, and Cohen’s kappa, and Krippendorff’s alpha under a categorical interpretation. RESULTS: All analyses indicated moderate agreement between the algorithm and average human judgment with respect to evaluating overall sentiment, and low agreement for the specific emotions. Somewhat surprisingly, the same was the case for the IRR among human judges, which means that the algorithm performed about as well as a randomly selected human judge. Thus, considering average human judgment as a benchmark for the applicability of automated sentiment analysis, the technique can be considered for practical application. DISCUSSION/CONCLUSION: The low human-human agreement on the presence of emotions may be due to the nature of the texts, it may simply be difficult for humans to agree on the presence of the selected emotions, or perhaps trained therapists would have reached more consensus. Future research may focus on validating the algorithm against a more solid benchmark, on applying the algorithm in an application in which empathic feedback is provided, for example, by an embodied conversational agent, or on improving the algorithm for the iCBT domain with a bottom-up machine learning approach. Frontiers Media S.A. 2019-05-14 /pmc/articles/PMC6530336/ /pubmed/31156504 http://dx.doi.org/10.3389/fpsyg.2019.01065 Text en Copyright © 2019 Provoost, Ruwaard, van Breda, Riper and Bosse. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Psychology Provoost, Simon Ruwaard, Jeroen van Breda, Ward Riper, Heleen Bosse, Tibor Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study |
title | Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study |
title_full | Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study |
title_fullStr | Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study |
title_full_unstemmed | Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study |
title_short | Validating Automated Sentiment Analysis of Online Cognitive Behavioral Therapy Patient Texts: An Exploratory Study |
title_sort | validating automated sentiment analysis of online cognitive behavioral therapy patient texts: an exploratory study |
topic | Psychology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6530336/ https://www.ncbi.nlm.nih.gov/pubmed/31156504 http://dx.doi.org/10.3389/fpsyg.2019.01065 |
work_keys_str_mv | AT provoostsimon validatingautomatedsentimentanalysisofonlinecognitivebehavioraltherapypatienttextsanexploratorystudy AT ruwaardjeroen validatingautomatedsentimentanalysisofonlinecognitivebehavioraltherapypatienttextsanexploratorystudy AT vanbredaward validatingautomatedsentimentanalysisofonlinecognitivebehavioraltherapypatienttextsanexploratorystudy AT riperheleen validatingautomatedsentimentanalysisofonlinecognitivebehavioraltherapypatienttextsanexploratorystudy AT bossetibor validatingautomatedsentimentanalysisofonlinecognitivebehavioraltherapypatienttextsanexploratorystudy |