Cargando…

Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study

BACKGROUND: Previous research has shown the feasibility of using machine learning models trained on social media data from a single platform (eg, Facebook or Twitter) to distinguish individuals either with a diagnosis of mental illness or experiencing an adverse outcome from healthy controls. Howeve...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Viet Cuong, Lu, Nathaniel, Kane, John M, Birnbaum, Michael L, De Choudhury, Munmun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840099/
https://www.ncbi.nlm.nih.gov/pubmed/36583932
http://dx.doi.org/10.2196/39747
_version_ 1784869574087278592
author Nguyen, Viet Cuong
Lu, Nathaniel
Kane, John M
Birnbaum, Michael L
De Choudhury, Munmun
author_facet Nguyen, Viet Cuong
Lu, Nathaniel
Kane, John M
Birnbaum, Michael L
De Choudhury, Munmun
author_sort Nguyen, Viet Cuong
collection PubMed
description BACKGROUND: Previous research has shown the feasibility of using machine learning models trained on social media data from a single platform (eg, Facebook or Twitter) to distinguish individuals either with a diagnosis of mental illness or experiencing an adverse outcome from healthy controls. However, the performance of such models on data from novel social media platforms unseen in the training data (eg, Instagram and TikTok) has not been investigated in previous literature. OBJECTIVE: Our study examined the feasibility of building machine learning classifiers that can effectively predict an upcoming psychiatric hospitalization given social media data from platforms unseen in the classifiers’ training data despite the preliminary evidence on identity fragmentation on the investigated social media platforms. METHODS: Windowed timeline data of patients with a diagnosis of schizophrenia spectrum disorder before a known hospitalization event and healthy controls were gathered from 3 platforms: Facebook (254/268, 94.8% of participants), Twitter (51/268, 19% of participants), and Instagram (134/268, 50% of participants). We then used a 3 × 3 combinatorial binary classification design to train machine learning classifiers and evaluate their performance on testing data from all available platforms. We further compared results from models in intraplatform experiments (ie, training and testing data belonging to the same platform) to those from models in interplatform experiments (ie, training and testing data belonging to different platforms). Finally, we used Shapley Additive Explanation values to extract the top predictive features to explain and compare the underlying constructs that predict hospitalization on each platform. RESULTS: We found that models in intraplatform experiments on average achieved an F(1)-score of 0.72 (SD 0.07) in predicting a psychiatric hospitalization because of schizophrenia spectrum disorder, which is 68% higher than the average of models in interplatform experiments at an F(1)-score of 0.428 (SD 0.11). When investigating the key drivers for divergence in construct validities between models, an analysis of top features for the intraplatform models showed both low predictive feature overlap between the platforms and low pairwise rank correlation (<0.1) between the platforms’ top feature rankings. Furthermore, low average cosine similarity of data between platforms within participants in comparison with the same measurement on data within platforms between participants points to evidence of identity fragmentation of participants between platforms. CONCLUSIONS: We demonstrated that models built on one platform’s data to predict critical mental health treatment outcomes such as hospitalization do not generalize to another platform. In our case, this is because different social media platforms consistently reflect different segments of participants’ identities. With the changing ecosystem of social media use among different demographic groups and as web-based identities continue to become fragmented across platforms, further research on holistic approaches to harnessing these diverse data sources is required.
format Online
Article
Text
id pubmed-9840099
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-98400992023-01-15 Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study Nguyen, Viet Cuong Lu, Nathaniel Kane, John M Birnbaum, Michael L De Choudhury, Munmun JMIR Ment Health Original Paper BACKGROUND: Previous research has shown the feasibility of using machine learning models trained on social media data from a single platform (eg, Facebook or Twitter) to distinguish individuals either with a diagnosis of mental illness or experiencing an adverse outcome from healthy controls. However, the performance of such models on data from novel social media platforms unseen in the training data (eg, Instagram and TikTok) has not been investigated in previous literature. OBJECTIVE: Our study examined the feasibility of building machine learning classifiers that can effectively predict an upcoming psychiatric hospitalization given social media data from platforms unseen in the classifiers’ training data despite the preliminary evidence on identity fragmentation on the investigated social media platforms. METHODS: Windowed timeline data of patients with a diagnosis of schizophrenia spectrum disorder before a known hospitalization event and healthy controls were gathered from 3 platforms: Facebook (254/268, 94.8% of participants), Twitter (51/268, 19% of participants), and Instagram (134/268, 50% of participants). We then used a 3 × 3 combinatorial binary classification design to train machine learning classifiers and evaluate their performance on testing data from all available platforms. We further compared results from models in intraplatform experiments (ie, training and testing data belonging to the same platform) to those from models in interplatform experiments (ie, training and testing data belonging to different platforms). Finally, we used Shapley Additive Explanation values to extract the top predictive features to explain and compare the underlying constructs that predict hospitalization on each platform. RESULTS: We found that models in intraplatform experiments on average achieved an F(1)-score of 0.72 (SD 0.07) in predicting a psychiatric hospitalization because of schizophrenia spectrum disorder, which is 68% higher than the average of models in interplatform experiments at an F(1)-score of 0.428 (SD 0.11). When investigating the key drivers for divergence in construct validities between models, an analysis of top features for the intraplatform models showed both low predictive feature overlap between the platforms and low pairwise rank correlation (<0.1) between the platforms’ top feature rankings. Furthermore, low average cosine similarity of data between platforms within participants in comparison with the same measurement on data within platforms between participants points to evidence of identity fragmentation of participants between platforms. CONCLUSIONS: We demonstrated that models built on one platform’s data to predict critical mental health treatment outcomes such as hospitalization do not generalize to another platform. In our case, this is because different social media platforms consistently reflect different segments of participants’ identities. With the changing ecosystem of social media use among different demographic groups and as web-based identities continue to become fragmented across platforms, further research on holistic approaches to harnessing these diverse data sources is required. JMIR Publications 2022-12-30 /pmc/articles/PMC9840099/ /pubmed/36583932 http://dx.doi.org/10.2196/39747 Text en ©Viet Cuong Nguyen, Nathaniel Lu, John M Kane, Michael L Birnbaum, Munmun De Choudhury. Originally published in JMIR Mental Health (https://mental.jmir.org), 30.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Mental Health, is properly cited. The complete bibliographic information, a link to the original publication on https://mental.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Nguyen, Viet Cuong
Lu, Nathaniel
Kane, John M
Birnbaum, Michael L
De Choudhury, Munmun
Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study
title Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study
title_full Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study
title_fullStr Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study
title_full_unstemmed Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study
title_short Cross-Platform Detection of Psychiatric Hospitalization via Social Media Data: Comparison Study
title_sort cross-platform detection of psychiatric hospitalization via social media data: comparison study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9840099/
https://www.ncbi.nlm.nih.gov/pubmed/36583932
http://dx.doi.org/10.2196/39747
work_keys_str_mv AT nguyenvietcuong crossplatformdetectionofpsychiatrichospitalizationviasocialmediadatacomparisonstudy
AT lunathaniel crossplatformdetectionofpsychiatrichospitalizationviasocialmediadatacomparisonstudy
AT kanejohnm crossplatformdetectionofpsychiatrichospitalizationviasocialmediadatacomparisonstudy
AT birnbaummichaell crossplatformdetectionofpsychiatrichospitalizationviasocialmediadatacomparisonstudy
AT dechoudhurymunmun crossplatformdetectionofpsychiatrichospitalizationviasocialmediadatacomparisonstudy