Cargando…

Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study

BACKGROUND: Existing research and national surveillance data suggest an increase of the prevalence of mental disorders during the COVID-19 pandemic. Social media platforms, such as Twitter, could be a source of data for estimation owing to its real-time nature, high availability, and large geographi...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Ruilie, Zhang, Jiajia, Li, Zhenlong, Zeng, Chengbo, Qiao, Shan, Li, Xiaoming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9770024/
https://www.ncbi.nlm.nih.gov/pubmed/36459569
http://dx.doi.org/10.2196/37582
_version_ 1784854501550718976
author Cai, Ruilie
Zhang, Jiajia
Li, Zhenlong
Zeng, Chengbo
Qiao, Shan
Li, Xiaoming
author_facet Cai, Ruilie
Zhang, Jiajia
Li, Zhenlong
Zeng, Chengbo
Qiao, Shan
Li, Xiaoming
author_sort Cai, Ruilie
collection PubMed
description BACKGROUND: Existing research and national surveillance data suggest an increase of the prevalence of mental disorders during the COVID-19 pandemic. Social media platforms, such as Twitter, could be a source of data for estimation owing to its real-time nature, high availability, and large geographical coverage. However, there is a dearth of studies validating the accuracy of the prevalence of mental disorders on Twitter compared to that reported by the Centers for Disease Control and Prevention (CDC). OBJECTIVE: This study aims to verify the feasibility of Twitter-based prevalence of mental disorders symptoms being an instrument for prevalence estimation, where feasibility is gauged via correlations between Twitter-based prevalence of mental disorder symptoms (ie, anxiety and depressive symptoms) and that based on national surveillance data. In addition, this study aims to identify how the correlations changed over time (ie, the temporal trend). METHODS: State-level prevalence of anxiety and depressive symptoms was retrieved from the national Household Pulse Survey (HPS) of the CDC from April 2020 to July 2021. Tweets were retrieved from the Twitter streaming application programming interface during the same period and were used to estimate the prevalence of symptoms of mental disorders for each state using keyword analysis. Stratified linear mixed models were used to evaluate the correlations between the Twitter-based prevalence of symptoms of mental disorders and those reported by the CDC. The magnitude and significance of model parameters were considered to evaluate the correlations. Temporal trends of correlations were tested after adding the time variable to the model. Geospatial differences were compared on the basis of random effects. RESULTS: Pearson correlation coefficients between the overall prevalence reported by the CDC and that on Twitter for anxiety and depressive symptoms were 0.587 (P<.001) and 0.368 (P<.001), respectively. Stratified by 4 phases (ie, April 2020, August 2020, October 2020, and April 2021) defined by the HPS, linear mixed models showed that Twitter-based prevalence for anxiety symptoms had a positive and significant correlation with CDC-reported prevalence in phases 2 and 3, while a significant correlation for depressive symptoms was identified in phases 1 and 3. CONCLUSIONS: Positive correlations were identified between Twitter-based and CDC-reported prevalence, and temporal trends of these correlations were found. Geospatial differences in the prevalence of symptoms of mental disorders were found between the northern and southern United States. Findings from this study could inform future investigation on leveraging social media platforms to estimate symptoms of mental disorders and the provision of immediate prevention measures to improve health outcomes.
format Online
Article
Text
id pubmed-9770024
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-97700242022-12-22 Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study Cai, Ruilie Zhang, Jiajia Li, Zhenlong Zeng, Chengbo Qiao, Shan Li, Xiaoming JMIR Form Res Original Paper BACKGROUND: Existing research and national surveillance data suggest an increase of the prevalence of mental disorders during the COVID-19 pandemic. Social media platforms, such as Twitter, could be a source of data for estimation owing to its real-time nature, high availability, and large geographical coverage. However, there is a dearth of studies validating the accuracy of the prevalence of mental disorders on Twitter compared to that reported by the Centers for Disease Control and Prevention (CDC). OBJECTIVE: This study aims to verify the feasibility of Twitter-based prevalence of mental disorders symptoms being an instrument for prevalence estimation, where feasibility is gauged via correlations between Twitter-based prevalence of mental disorder symptoms (ie, anxiety and depressive symptoms) and that based on national surveillance data. In addition, this study aims to identify how the correlations changed over time (ie, the temporal trend). METHODS: State-level prevalence of anxiety and depressive symptoms was retrieved from the national Household Pulse Survey (HPS) of the CDC from April 2020 to July 2021. Tweets were retrieved from the Twitter streaming application programming interface during the same period and were used to estimate the prevalence of symptoms of mental disorders for each state using keyword analysis. Stratified linear mixed models were used to evaluate the correlations between the Twitter-based prevalence of symptoms of mental disorders and those reported by the CDC. The magnitude and significance of model parameters were considered to evaluate the correlations. Temporal trends of correlations were tested after adding the time variable to the model. Geospatial differences were compared on the basis of random effects. RESULTS: Pearson correlation coefficients between the overall prevalence reported by the CDC and that on Twitter for anxiety and depressive symptoms were 0.587 (P<.001) and 0.368 (P<.001), respectively. Stratified by 4 phases (ie, April 2020, August 2020, October 2020, and April 2021) defined by the HPS, linear mixed models showed that Twitter-based prevalence for anxiety symptoms had a positive and significant correlation with CDC-reported prevalence in phases 2 and 3, while a significant correlation for depressive symptoms was identified in phases 1 and 3. CONCLUSIONS: Positive correlations were identified between Twitter-based and CDC-reported prevalence, and temporal trends of these correlations were found. Geospatial differences in the prevalence of symptoms of mental disorders were found between the northern and southern United States. Findings from this study could inform future investigation on leveraging social media platforms to estimate symptoms of mental disorders and the provision of immediate prevention measures to improve health outcomes. JMIR Publications 2022-12-20 /pmc/articles/PMC9770024/ /pubmed/36459569 http://dx.doi.org/10.2196/37582 Text en ©Ruilie Cai, Jiajia Zhang, Zhenlong Li, Chengbo Zeng, Shan Qiao, Xiaoming Li. Originally published in JMIR Formative Research (https://formative.jmir.org), 20.12.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Cai, Ruilie
Zhang, Jiajia
Li, Zhenlong
Zeng, Chengbo
Qiao, Shan
Li, Xiaoming
Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study
title Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study
title_full Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study
title_fullStr Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study
title_full_unstemmed Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study
title_short Using Twitter Data to Estimate the Prevalence of Symptoms of Mental Disorders in the United States During the COVID-19 Pandemic: Ecological Cohort Study
title_sort using twitter data to estimate the prevalence of symptoms of mental disorders in the united states during the covid-19 pandemic: ecological cohort study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9770024/
https://www.ncbi.nlm.nih.gov/pubmed/36459569
http://dx.doi.org/10.2196/37582
work_keys_str_mv AT cairuilie usingtwitterdatatoestimatetheprevalenceofsymptomsofmentaldisordersintheunitedstatesduringthecovid19pandemicecologicalcohortstudy
AT zhangjiajia usingtwitterdatatoestimatetheprevalenceofsymptomsofmentaldisordersintheunitedstatesduringthecovid19pandemicecologicalcohortstudy
AT lizhenlong usingtwitterdatatoestimatetheprevalenceofsymptomsofmentaldisordersintheunitedstatesduringthecovid19pandemicecologicalcohortstudy
AT zengchengbo usingtwitterdatatoestimatetheprevalenceofsymptomsofmentaldisordersintheunitedstatesduringthecovid19pandemicecologicalcohortstudy
AT qiaoshan usingtwitterdatatoestimatetheprevalenceofsymptomsofmentaldisordersintheunitedstatesduringthecovid19pandemicecologicalcohortstudy
AT lixiaoming usingtwitterdatatoestimatetheprevalenceofsymptomsofmentaldisordersintheunitedstatesduringthecovid19pandemicecologicalcohortstudy