Cargando…
Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study
BACKGROUND: The COVID-19 pandemic has affected people’s daily lives and has caused economic loss worldwide. Anecdotal evidence suggests that the pandemic has increased depression levels among the population. However, systematic studies of depression detection and monitoring during the pandemic are l...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330892/ https://www.ncbi.nlm.nih.gov/pubmed/34458682 http://dx.doi.org/10.2196/26769 |
_version_ | 1783732816033349632 |
---|---|
author | Zhang, Yipeng Lyu, Hanjia Liu, Yubao Zhang, Xiyang Wang, Yu Luo, Jiebo |
author_facet | Zhang, Yipeng Lyu, Hanjia Liu, Yubao Zhang, Xiyang Wang, Yu Luo, Jiebo |
author_sort | Zhang, Yipeng |
collection | PubMed |
description | BACKGROUND: The COVID-19 pandemic has affected people’s daily lives and has caused economic loss worldwide. Anecdotal evidence suggests that the pandemic has increased depression levels among the population. However, systematic studies of depression detection and monitoring during the pandemic are lacking. OBJECTIVE: This study aims to develop a method to create a large-scale depression user data set in an automatic fashion so that the method is scalable and can be adapted to future events; verify the effectiveness of transformer-based deep learning language models in identifying depression users from their everyday language; examine psychological text features’ importance when used in depression classification; and, finally, use the model for monitoring the fluctuation of depression levels of different groups as the disease propagates. METHODS: To study this subject, we designed an effective regular expression-based search method and created the largest English Twitter depression data set containing 2575 distinct identified users with depression and their past tweets. To examine the effect of depression on people’s Twitter language, we trained three transformer-based depression classification models on the data set, evaluated their performance with progressively increased training sizes, and compared the model’s tweet chunk-level and user-level performances. Furthermore, inspired by psychological studies, we created a fusion classifier that combines deep learning model scores with psychological text features and users’ demographic information, and investigated these features’ relations to depression signals. Finally, we demonstrated our model’s capability of monitoring both group-level and population-level depression trends by presenting two of its applications during the COVID-19 pandemic. RESULTS: Our fusion model demonstrated an accuracy of 78.9% on a test set containing 446 people, half of which were identified as having depression. Conscientiousness, neuroticism, appearance of first person pronouns, talking about biological processes such as eat and sleep, talking about power, and exhibiting sadness were shown to be important features in depression classification. Further, when used for monitoring the depression trend, our model showed that depressive users, in general, responded to the pandemic later than the control group based on their tweets (n=500). It was also shown that three US states—New York, California, and Florida—shared a similar depression trend as the whole US population (n=9050). When compared to New York and California, people in Florida demonstrated a substantially lower level of depression. CONCLUSIONS: This study proposes an efficient method that can be used to analyze the depression level of different groups of people on Twitter. We hope this study can raise awareness among researchers and the public of COVID-19’s impact on people’s mental health. The noninvasive monitoring system can also be readily adapted to other big events besides COVID-19 and can be useful during future outbreaks. |
format | Online Article Text |
id | pubmed-8330892 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-83308922021-08-24 Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study Zhang, Yipeng Lyu, Hanjia Liu, Yubao Zhang, Xiyang Wang, Yu Luo, Jiebo JMIR Infodemiology Original Paper BACKGROUND: The COVID-19 pandemic has affected people’s daily lives and has caused economic loss worldwide. Anecdotal evidence suggests that the pandemic has increased depression levels among the population. However, systematic studies of depression detection and monitoring during the pandemic are lacking. OBJECTIVE: This study aims to develop a method to create a large-scale depression user data set in an automatic fashion so that the method is scalable and can be adapted to future events; verify the effectiveness of transformer-based deep learning language models in identifying depression users from their everyday language; examine psychological text features’ importance when used in depression classification; and, finally, use the model for monitoring the fluctuation of depression levels of different groups as the disease propagates. METHODS: To study this subject, we designed an effective regular expression-based search method and created the largest English Twitter depression data set containing 2575 distinct identified users with depression and their past tweets. To examine the effect of depression on people’s Twitter language, we trained three transformer-based depression classification models on the data set, evaluated their performance with progressively increased training sizes, and compared the model’s tweet chunk-level and user-level performances. Furthermore, inspired by psychological studies, we created a fusion classifier that combines deep learning model scores with psychological text features and users’ demographic information, and investigated these features’ relations to depression signals. Finally, we demonstrated our model’s capability of monitoring both group-level and population-level depression trends by presenting two of its applications during the COVID-19 pandemic. RESULTS: Our fusion model demonstrated an accuracy of 78.9% on a test set containing 446 people, half of which were identified as having depression. Conscientiousness, neuroticism, appearance of first person pronouns, talking about biological processes such as eat and sleep, talking about power, and exhibiting sadness were shown to be important features in depression classification. Further, when used for monitoring the depression trend, our model showed that depressive users, in general, responded to the pandemic later than the control group based on their tweets (n=500). It was also shown that three US states—New York, California, and Florida—shared a similar depression trend as the whole US population (n=9050). When compared to New York and California, people in Florida demonstrated a substantially lower level of depression. CONCLUSIONS: This study proposes an efficient method that can be used to analyze the depression level of different groups of people on Twitter. We hope this study can raise awareness among researchers and the public of COVID-19’s impact on people’s mental health. The noninvasive monitoring system can also be readily adapted to other big events besides COVID-19 and can be useful during future outbreaks. JMIR Publications 2021-07-18 /pmc/articles/PMC8330892/ /pubmed/34458682 http://dx.doi.org/10.2196/26769 Text en ©Yipeng Zhang, Hanjia Lyu, Yubao Liu, Xiyang Zhang, Yu Wang, Jiebo Luo. Originally published in the JMIR Infodemiology (https://infodemiology.jmir.org), 07.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the JMIR Infodemiology, is properly cited. The complete bibliographic information, a link to the original publication on https://infodemiology.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Zhang, Yipeng Lyu, Hanjia Liu, Yubao Zhang, Xiyang Wang, Yu Luo, Jiebo Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study |
title | Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study |
title_full | Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study |
title_fullStr | Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study |
title_full_unstemmed | Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study |
title_short | Monitoring Depression Trends on Twitter During the COVID-19 Pandemic: Observational Study |
title_sort | monitoring depression trends on twitter during the covid-19 pandemic: observational study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8330892/ https://www.ncbi.nlm.nih.gov/pubmed/34458682 http://dx.doi.org/10.2196/26769 |
work_keys_str_mv | AT zhangyipeng monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy AT lyuhanjia monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy AT liuyubao monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy AT zhangxiyang monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy AT wangyu monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy AT luojiebo monitoringdepressiontrendsontwitterduringthecovid19pandemicobservationalstudy |