Cargando…

Quantifying depression-related language on social media during the COVID-19 pandemic

INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the ass...

Descripción completa

Detalles Bibliográficos
Autores principales: Davis, Brent D., McKnight, Dawn Estes, Teodorescu, Daniela, Quan-Haase, Anabel, Chunara, Rumi, Fyshe, Alona, Lizotte, Daniel J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Swansea University 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9052361/
https://www.ncbi.nlm.nih.gov/pubmed/35516163
http://dx.doi.org/10.23889/ijpds.v5i4.1716
_version_ 1784696766136844288
author Davis, Brent D.
McKnight, Dawn Estes
Teodorescu, Daniela
Quan-Haase, Anabel
Chunara, Rumi
Fyshe, Alona
Lizotte, Daniel J.
author_facet Davis, Brent D.
McKnight, Dawn Estes
Teodorescu, Daniela
Quan-Haase, Anabel
Chunara, Rumi
Fyshe, Alona
Lizotte, Daniel J.
author_sort Davis, Brent D.
collection PubMed
description INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. METHODS: We create a word embedding based on the posts in Reddit’s /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic’s beginning through June 2021. RESULTS: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. CONCLUSIONS: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health.
format Online
Article
Text
id pubmed-9052361
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Swansea University
record_format MEDLINE/PubMed
spelling pubmed-90523612022-05-04 Quantifying depression-related language on social media during the COVID-19 pandemic Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. Int J Popul Data Sci Population Data Science INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. METHODS: We create a word embedding based on the posts in Reddit’s /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic’s beginning through June 2021. RESULTS: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. CONCLUSIONS: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health. Swansea University 2022-03-30 /pmc/articles/PMC9052361/ /pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle Population Data Science
Davis, Brent D.
McKnight, Dawn Estes
Teodorescu, Daniela
Quan-Haase, Anabel
Chunara, Rumi
Fyshe, Alona
Lizotte, Daniel J.
Quantifying depression-related language on social media during the COVID-19 pandemic
title Quantifying depression-related language on social media during the COVID-19 pandemic
title_full Quantifying depression-related language on social media during the COVID-19 pandemic
title_fullStr Quantifying depression-related language on social media during the COVID-19 pandemic
title_full_unstemmed Quantifying depression-related language on social media during the COVID-19 pandemic
title_short Quantifying depression-related language on social media during the COVID-19 pandemic
title_sort quantifying depression-related language on social media during the covid-19 pandemic
topic Population Data Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9052361/
https://www.ncbi.nlm.nih.gov/pubmed/35516163
http://dx.doi.org/10.23889/ijpds.v5i4.1716
work_keys_str_mv AT davisbrentd quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic
AT mcknightdawnestes quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic
AT teodorescudaniela quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic
AT quanhaaseanabel quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic
AT chunararumi quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic
AT fyshealona quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic
AT lizottedanielj quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic