Cargando…
Quantifying depression-related language on social media during the COVID-19 pandemic
INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the ass...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Swansea University
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9052361/ https://www.ncbi.nlm.nih.gov/pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716 |
_version_ | 1784696766136844288 |
---|---|
author | Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. |
author_facet | Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. |
author_sort | Davis, Brent D. |
collection | PubMed |
description | INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. METHODS: We create a word embedding based on the posts in Reddit’s /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic’s beginning through June 2021. RESULTS: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. CONCLUSIONS: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health. |
format | Online Article Text |
id | pubmed-9052361 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Swansea University |
record_format | MEDLINE/PubMed |
spelling | pubmed-90523612022-05-04 Quantifying depression-related language on social media during the COVID-19 pandemic Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. Int J Popul Data Sci Population Data Science INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. METHODS: We create a word embedding based on the posts in Reddit’s /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic’s beginning through June 2021. RESULTS: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. CONCLUSIONS: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health. Swansea University 2022-03-30 /pmc/articles/PMC9052361/ /pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. |
spellingShingle | Population Data Science Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. Quantifying depression-related language on social media during the COVID-19 pandemic |
title | Quantifying depression-related language on social media during the COVID-19 pandemic |
title_full | Quantifying depression-related language on social media during the COVID-19 pandemic |
title_fullStr | Quantifying depression-related language on social media during the COVID-19 pandemic |
title_full_unstemmed | Quantifying depression-related language on social media during the COVID-19 pandemic |
title_short | Quantifying depression-related language on social media during the COVID-19 pandemic |
title_sort | quantifying depression-related language on social media during the covid-19 pandemic |
topic | Population Data Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9052361/ https://www.ncbi.nlm.nih.gov/pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716 |
work_keys_str_mv | AT davisbrentd quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT mcknightdawnestes quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT teodorescudaniela quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT quanhaaseanabel quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT chunararumi quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT fyshealona quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT lizottedanielj quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic |