Cargando…

Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study

BACKGROUND: The COVID-19 pandemic has affected people’s daily lives and has caused economic loss worldwide. Anecdotal evidence suggests that the pandemic has increased depression levels among the population. However, systematic studies of depression detection and monitoring during the pandemic are l...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yipeng, Lyu, Hanjia, Liu, Yubao, Zhang, Xiyang, Wang, Yu, Luo, Jiebo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330892/
https://www.ncbi.nlm.nih.gov/pubmed/34458682
http://dx.doi.org/10.2196/26769
_version_ 1783732816033349632
author Zhang, Yipeng
Lyu, Hanjia
Liu, Yubao
Zhang, Xiyang
Wang, Yu
Luo, Jiebo
author_facet Zhang, Yipeng
Lyu, Hanjia
Liu, Yubao
Zhang, Xiyang
Wang, Yu
Luo, Jiebo
author_sort Zhang, Yipeng
collection PubMed
description BACKGROUND: The COVID-19 pandemic has affected people’s daily lives and has caused economic loss worldwide. Anecdotal evidence suggests that the pandemic has increased depression levels among the population. However, systematic studies of depression detection and monitoring during the pandemic are lacking. OBJECTIVE: This study aims to develop a method to create a large-scale depression user data set in an automatic fashion so that the method is scalable and can be adapted to future events; verify the effectiveness of transformer-based deep learning language models in identifying depression users from their everyday language; examine psychological text features’ importance when used in depression classification; and, finally, use the model for monitoring the fluctuation of depression levels of different groups as the disease propagates. METHODS: To study this subject, we designed an effective regular expression-based search method and created the largest English Twitter depression data set containing 2575 distinct identified users with depression and their past tweets. To examine the effect of depression on people’s Twitter language, we trained three transformer-based depression classification models on the data set, evaluated their performance with progressively increased training sizes, and compared the model’s tweet chunk-level and user-level performances. Furthermore, inspired by psychological studies, we created a fusion classifier that combines deep learning model scores with psychological text features and users’ demographic information, and investigated these features’ relations to depression signals. Finally, we demonstrated our model’s capability of monitoring both group-level and population-level depression trends by presenting two of its applications during the COVID-19 pandemic. RESULTS: Our fusion model demonstrated an accuracy of 78.9% on a test set containing 446 people, half of which were identified as having depression. Conscientiousness, neuroticism, appearance of first person pronouns, talking about biological processes such as eat and sleep, talking about power, and exhibiting sadness were shown to be important features in depression classification. Further, when used for monitoring the depression trend, our model showed that depressive users, in general, responded to the pandemic later than the control group based on their tweets (n=500). It was also shown that three US states—New York, California, and Florida—shared a similar depression trend as the whole US population (n=9050). When compared to New York and California, people in Florida demonstrated a substantially lower level of depression. CONCLUSIONS: This study proposes an efficient method that can be used to analyze the depression level of different groups of people on Twitter. We hope this study can raise awareness among researchers and the public of COVID-19’s impact on people’s mental health. The noninvasive monitoring system can also be readily adapted to other big events besides COVID-19 and can be useful during future outbreaks.
format Online
Article
Text
id pubmed-8330892
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-83308922021-08-24 Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study Zhang, Yipeng Lyu, Hanjia Liu, Yubao Zhang, Xiyang Wang, Yu Luo, Jiebo JMIR Infodemiology Original Paper BACKGROUND: The COVID-19 pandemic has affected people’s daily lives and has caused economic loss worldwide. Anecdotal evidence suggests that the pandemic has increased depression levels among the population. However, systematic studies of depression detection and monitoring during the pandemic are lacking. OBJECTIVE: This study aims to develop a method to create a large-scale depression user data set in an automatic fashion so that the method is scalable and can be adapted to future events; verify the effectiveness of transformer-based deep learning language models in identifying depression users from their everyday language; examine psychological text features’ importance when used in depression classification; and, finally, use the model for monitoring the fluctuation of depression levels of different groups as the disease propagates. METHODS: To study this subject, we designed an effective regular expression-based search method and created the largest English Twitter depression data set containing 2575 distinct identified users with depression and their past tweets. To examine the effect of depression on people’s Twitter language, we trained three transformer-based depression classification models on the data set, evaluated their performance with progressively increased training sizes, and compared the model’s tweet chunk-level and user-level performances. Furthermore, inspired by psychological studies, we created a fusion classifier that combines deep learning model scores with psychological text features and users’ demographic information, and investigated these features’ relations to depression signals. Finally, we demonstrated our model’s capability of monitoring both group-level and population-level depression trends by presenting two of its applications during the COVID-19 pandemic. RESULTS: Our fusion model demonstrated an accuracy of 78.9% on a test set containing 446 people, half of which were identified as having depression. Conscientiousness, neuroticism, appearance of first person pronouns, talking about biological processes such as eat and sleep, talking about power, and exhibiting sadness were shown to be important features in depression classification. Further, when used for monitoring the depression trend, our model showed that depressive users, in general, responded to the pandemic later than the control group based on their tweets (n=500). It was also shown that three US states—New York, California, and Florida—shared a similar depression trend as the whole US population (n=9050). When compared to New York and California, people in Florida demonstrated a substantially lower level of depression. CONCLUSIONS: This study proposes an efficient method that can be used to analyze the depression level of different groups of people on Twitter. We hope this study can raise awareness among researchers and the public of COVID-19’s impact on people’s mental health. The noninvasive monitoring system can also be readily adapted to other big events besides COVID-19 and can be useful during future outbreaks. JMIR Publications 2021-07-18 /pmc/articles/PMC8330892/ /pubmed/34458682 http://dx.doi.org/10.2196/26769 Text en ©Yipeng Zhang, Hanjia Lyu, Yubao Liu, Xiyang Zhang, Yu Wang, Jiebo Luo. Originally published in the JMIR Infodemiology (https://infodemiology.jmir.org), 07.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the JMIR Infodemiology, is properly cited. The complete bibliographic information, a link to the original publication on https://infodemiology.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Zhang, Yipeng
Lyu, Hanjia
Liu, Yubao
Zhang, Xiyang
Wang, Yu
Luo, Jiebo
Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
title Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
title_full Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
title_fullStr Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
title_full_unstemmed Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
title_short Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
title_sort monitoring depression trends on twitter during the covid-19 pandemic: observational study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330892/
https://www.ncbi.nlm.nih.gov/pubmed/34458682
http://dx.doi.org/10.2196/26769
work_keys_str_mv AT zhangyipeng monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy
AT lyuhanjia monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy
AT liuyubao monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy
AT zhangxiyang monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy
AT wangyu monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy
AT luojiebo monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy