Cargando…

Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach

BACKGROUND: Intervening in and preventing diabetes distress requires an understanding of its causes and, in particular, from a patient’s perspective. Social media data provide direct access to how patients see and understand their disease and consequently show the causes of diabetes distress. OBJECT...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahne, Adrian, Khetan, Vivek, Tannier, Xavier, Rizvi, Md Imbesat Hassan, Czernichow, Thomas, Orchard, Francisco, Bour, Charline, Fano, Andrew, Fagherazzi, Guy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9346561/
https://www.ncbi.nlm.nih.gov/pubmed/35852829
http://dx.doi.org/10.2196/37201
_version_ 1784761676502925312
author Ahne, Adrian
Khetan, Vivek
Tannier, Xavier
Rizvi, Md Imbesat Hassan
Czernichow, Thomas
Orchard, Francisco
Bour, Charline
Fano, Andrew
Fagherazzi, Guy
author_facet Ahne, Adrian
Khetan, Vivek
Tannier, Xavier
Rizvi, Md Imbesat Hassan
Czernichow, Thomas
Orchard, Francisco
Bour, Charline
Fano, Andrew
Fagherazzi, Guy
author_sort Ahne, Adrian
collection PubMed
description BACKGROUND: Intervening in and preventing diabetes distress requires an understanding of its causes and, in particular, from a patient’s perspective. Social media data provide direct access to how patients see and understand their disease and consequently show the causes of diabetes distress. OBJECTIVE: Leveraging machine learning methods, we aim to extract both explicit and implicit cause-effect relationships in patient-reported diabetes-related tweets and provide a methodology to better understand the opinions, feelings, and observations shared within the diabetes online community from a causality perspective. METHODS: More than 30 million diabetes-related tweets in English were collected between April 2017 and January 2021. Deep learning and natural language processing methods were applied to focus on tweets with personal and emotional content. A cause-effect tweet data set was manually labeled and used to train (1) a fine-tuned BERTweet model to detect causal sentences containing a causal relation and (2) a conditional random field model with Bidirectional Encoder Representations from Transformers (BERT)-based features to extract possible cause-effect associations. Causes and effects were clustered in a semisupervised approach and visualized in an interactive cause-effect network. RESULTS: Causal sentences were detected with a recall of 68% in an imbalanced data set. A conditional random field model with BERT-based features outperformed a fine-tuned BERT model for cause-effect detection with a macro recall of 68%. This led to 96,676 sentences with cause-effect relationships. “Diabetes” was identified as the central cluster followed by “death” and “insulin.” Insulin pricing–related causes were frequently associated with death. CONCLUSIONS: A novel methodology was developed to detect causal sentences and identify both explicit and implicit, single and multiword cause, and the corresponding effect, as expressed in diabetes-related tweets leveraging BERT-based architectures and visualized as cause-effect network. Extracting causal associations in real life, patient-reported outcomes in social media data provide a useful complementary source of information in diabetes research.
format Online
Article
Text
id pubmed-9346561
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-93465612022-08-04 Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach Ahne, Adrian Khetan, Vivek Tannier, Xavier Rizvi, Md Imbesat Hassan Czernichow, Thomas Orchard, Francisco Bour, Charline Fano, Andrew Fagherazzi, Guy JMIR Med Inform Original Paper BACKGROUND: Intervening in and preventing diabetes distress requires an understanding of its causes and, in particular, from a patient’s perspective. Social media data provide direct access to how patients see and understand their disease and consequently show the causes of diabetes distress. OBJECTIVE: Leveraging machine learning methods, we aim to extract both explicit and implicit cause-effect relationships in patient-reported diabetes-related tweets and provide a methodology to better understand the opinions, feelings, and observations shared within the diabetes online community from a causality perspective. METHODS: More than 30 million diabetes-related tweets in English were collected between April 2017 and January 2021. Deep learning and natural language processing methods were applied to focus on tweets with personal and emotional content. A cause-effect tweet data set was manually labeled and used to train (1) a fine-tuned BERTweet model to detect causal sentences containing a causal relation and (2) a conditional random field model with Bidirectional Encoder Representations from Transformers (BERT)-based features to extract possible cause-effect associations. Causes and effects were clustered in a semisupervised approach and visualized in an interactive cause-effect network. RESULTS: Causal sentences were detected with a recall of 68% in an imbalanced data set. A conditional random field model with BERT-based features outperformed a fine-tuned BERT model for cause-effect detection with a macro recall of 68%. This led to 96,676 sentences with cause-effect relationships. “Diabetes” was identified as the central cluster followed by “death” and “insulin.” Insulin pricing–related causes were frequently associated with death. CONCLUSIONS: A novel methodology was developed to detect causal sentences and identify both explicit and implicit, single and multiword cause, and the corresponding effect, as expressed in diabetes-related tweets leveraging BERT-based architectures and visualized as cause-effect network. Extracting causal associations in real life, patient-reported outcomes in social media data provide a useful complementary source of information in diabetes research. JMIR Publications 2022-07-19 /pmc/articles/PMC9346561/ /pubmed/35852829 http://dx.doi.org/10.2196/37201 Text en ©Adrian Ahne, Vivek Khetan, Xavier Tannier, Md Imbesat Hassan Rizvi, Thomas Czernichow, Francisco Orchard, Charline Bour, Andrew Fano, Guy Fagherazzi. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 19.07.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Ahne, Adrian
Khetan, Vivek
Tannier, Xavier
Rizvi, Md Imbesat Hassan
Czernichow, Thomas
Orchard, Francisco
Bour, Charline
Fano, Andrew
Fagherazzi, Guy
Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach
title Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach
title_full Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach
title_fullStr Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach
title_full_unstemmed Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach
title_short Extraction of Explicit and Implicit Cause-Effect Relationships in Patient-Reported Diabetes-Related Tweets From 2017 to 2021: Deep Learning Approach
title_sort extraction of explicit and implicit cause-effect relationships in patient-reported diabetes-related tweets from 2017 to 2021: deep learning approach
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9346561/
https://www.ncbi.nlm.nih.gov/pubmed/35852829
http://dx.doi.org/10.2196/37201
work_keys_str_mv AT ahneadrian extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT khetanvivek extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT tannierxavier extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT rizvimdimbesathassan extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT czernichowthomas extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT orchardfrancisco extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT bourcharline extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT fanoandrew extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach
AT fagherazziguy extractionofexplicitandimplicitcauseeffectrelationshipsinpatientreporteddiabetesrelatedtweetsfrom2017to2021deeplearningapproach