Cargando…

Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram

BACKGROUND: The content produced by individuals on various social media platforms has been successfully used to identify mental illness, including depression. However, most of the previous work in this area has focused on user-generated content, that is, content created by the individual, such as an...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ricard, Benjamin J, Marsch, Lisa A, Crosier, Benjamin, Hassanpour, Saeed
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2018
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302231/ https://www.ncbi.nlm.nih.gov/pubmed/30522991 http://dx.doi.org/10.2196/11817

_version_	1783381942178152448
author	Ricard, Benjamin J Marsch, Lisa A Crosier, Benjamin Hassanpour, Saeed
author_facet	Ricard, Benjamin J Marsch, Lisa A Crosier, Benjamin Hassanpour, Saeed
author_sort	Ricard, Benjamin J
collection	PubMed
description	BACKGROUND: The content produced by individuals on various social media platforms has been successfully used to identify mental illness, including depression. However, most of the previous work in this area has focused on user-generated content, that is, content created by the individual, such as an individual’s posts and pictures. In this study, we explored the predictive capability of community-generated content, that is, the data generated by a community of friends or followers, rather than by a sole individual, to identify depression among social media users. OBJECTIVE: The objective of this research was to evaluate the utility of community-generated content on social media, such as comments on an individual’s posts, to predict depression as defined by the clinically validated Patient Health Questionnaire-8 (PHQ-8) assessment questionnaire. We hypothesized that the results of this research may provide new insights into next generation of population-level mental illness risk assessment and intervention delivery. METHODS: We created a Web-based survey on a crowdsourcing platform through which participants granted access to their Instagram profiles as well as provided their responses to PHQ-8 as a reference standard for depression status. After data quality assurance and postprocessing, the study analyzed the data of 749 participants. To build our predictive model, linguistic features were extracted from Instagram post captions and comments, including multiple sentiment scores, emoji sentiment analysis results, and meta-variables such as the number of likes and average comment length. In this study, 10.4% (78/749) of the data were held out as a test set. The remaining 89.6% (671/749) of the data were used to train an elastic-net regularized linear regression model to predict PHQ-8 scores. We compared different versions of this model (ie, a model trained on only user-generated data, a model trained on only community-generated data, and a model trained on the combination of both types of data) on a test set to explore the utility of community-generated data in our predictive analysis. RESULTS: The 2 models, the first trained on only community-generated data (area under curve [AUC]=0.71) and the second trained on a combination of user-generated and community-generated data (AUC=0.72), had statistically significant performances for predicting depression based on the Mann-Whitney U test (P=.03 and P=.02, respectively). The model trained on only user-generated data (AUC=0.63; P=.11) did not achieve statistically significant results. The coefficients of the models revealed that our combined data classifier effectively amalgamated both user-generated and community-generated data and that the 2 feature sets were complementary and contained nonoverlapping information in our predictive analysis. CONCLUSIONS: The results presented in this study indicate that leveraging community-generated data from social media, in addition to user-generated data, can be informative for predicting depression among social media users.
format	Online Article Text
id	pubmed-6302231
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-63022312019-01-16 Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram Ricard, Benjamin J Marsch, Lisa A Crosier, Benjamin Hassanpour, Saeed J Med Internet Res Original Paper BACKGROUND: The content produced by individuals on various social media platforms has been successfully used to identify mental illness, including depression. However, most of the previous work in this area has focused on user-generated content, that is, content created by the individual, such as an individual’s posts and pictures. In this study, we explored the predictive capability of community-generated content, that is, the data generated by a community of friends or followers, rather than by a sole individual, to identify depression among social media users. OBJECTIVE: The objective of this research was to evaluate the utility of community-generated content on social media, such as comments on an individual’s posts, to predict depression as defined by the clinically validated Patient Health Questionnaire-8 (PHQ-8) assessment questionnaire. We hypothesized that the results of this research may provide new insights into next generation of population-level mental illness risk assessment and intervention delivery. METHODS: We created a Web-based survey on a crowdsourcing platform through which participants granted access to their Instagram profiles as well as provided their responses to PHQ-8 as a reference standard for depression status. After data quality assurance and postprocessing, the study analyzed the data of 749 participants. To build our predictive model, linguistic features were extracted from Instagram post captions and comments, including multiple sentiment scores, emoji sentiment analysis results, and meta-variables such as the number of likes and average comment length. In this study, 10.4% (78/749) of the data were held out as a test set. The remaining 89.6% (671/749) of the data were used to train an elastic-net regularized linear regression model to predict PHQ-8 scores. We compared different versions of this model (ie, a model trained on only user-generated data, a model trained on only community-generated data, and a model trained on the combination of both types of data) on a test set to explore the utility of community-generated data in our predictive analysis. RESULTS: The 2 models, the first trained on only community-generated data (area under curve [AUC]=0.71) and the second trained on a combination of user-generated and community-generated data (AUC=0.72), had statistically significant performances for predicting depression based on the Mann-Whitney U test (P=.03 and P=.02, respectively). The model trained on only user-generated data (AUC=0.63; P=.11) did not achieve statistically significant results. The coefficients of the models revealed that our combined data classifier effectively amalgamated both user-generated and community-generated data and that the 2 feature sets were complementary and contained nonoverlapping information in our predictive analysis. CONCLUSIONS: The results presented in this study indicate that leveraging community-generated data from social media, in addition to user-generated data, can be informative for predicting depression among social media users. JMIR Publications 2018-12-06 /pmc/articles/PMC6302231/ /pubmed/30522991 http://dx.doi.org/10.2196/11817 Text en ©Benjamin J Ricard, Lisa A Marsch, Benjamin Crosier, Saeed Hassanpour. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 06.12.2018. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Ricard, Benjamin J Marsch, Lisa A Crosier, Benjamin Hassanpour, Saeed Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram
title	Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram
title_full	Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram
title_fullStr	Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram
title_full_unstemmed	Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram
title_short	Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram
title_sort	exploring the utility of community-generated social media content for detecting depression: an analytical study on instagram
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6302231/ https://www.ncbi.nlm.nih.gov/pubmed/30522991 http://dx.doi.org/10.2196/11817
work_keys_str_mv	AT ricardbenjaminj exploringtheutilityofcommunitygeneratedsocialmediacontentfordetectingdepressionananalyticalstudyoninstagram AT marschlisaa exploringtheutilityofcommunitygeneratedsocialmediacontentfordetectingdepressionananalyticalstudyoninstagram AT crosierbenjamin exploringtheutilityofcommunitygeneratedsocialmediacontentfordetectingdepressionananalyticalstudyoninstagram AT hassanpoursaeed exploringtheutilityofcommunitygeneratedsocialmediacontentfordetectingdepressionananalyticalstudyoninstagram

Exploring the Utility of Community-Generated Social Media Content for Detecting Depression: An Analytical Study on Instagram

Ejemplares similares