Cargando…

Quantifying depression-related language on social media during the COVID-19 pandemic

INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the ass...

Descripción completa

Detalles Bibliográficos
Autores principales:	Davis, Brent D., McKnight, Dawn Estes, Teodorescu, Daniela, Quan-Haase, Anabel, Chunara, Rumi, Fyshe, Alona, Lizotte, Daniel J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Swansea University 2022
Materias:	Population Data Science
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9052361/ https://www.ncbi.nlm.nih.gov/pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716

_version_	1784696766136844288
author	Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J.
author_facet	Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J.
author_sort	Davis, Brent D.
collection	PubMed
description	INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. METHODS: We create a word embedding based on the posts in Reddit’s /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic’s beginning through June 2021. RESULTS: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. CONCLUSIONS: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health.
format	Online Article Text
id	pubmed-9052361
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Swansea University
record_format	MEDLINE/PubMed
spelling	pubmed-90523612022-05-04 Quantifying depression-related language on social media during the COVID-19 pandemic Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. Int J Popul Data Sci Population Data Science INTRODUCTION: The COVID-19 pandemic had clear impacts on mental health. Social media presents an opportunity for assessing mental health at the population level. OBJECTIVES: 1) Identify and describe language used on social media that is associated with discourse about depression. 2) Describe the associations between identified language and COVID-19 incidence over time across several geographies. METHODS: We create a word embedding based on the posts in Reddit’s /r/Depression and use this word embedding to train representations of active authors. We contrast these authors against a control group and extract keywords that capture differences between the two groups. We filter these keywords for face validity and to match character limits of an information retrieval system, Elasticsearch. We retrieve all geo-tagged posts on Twitter from April 2019 to June 2021 from Seattle, Sydney, Mumbai, and Toronto. The tweets are scored with BM25 using the keywords. We call this score rDD. We compare changes in average score over time with case counts from the pandemic’s beginning through June 2021. RESULTS: We observe a pattern in rDD across all cities analyzed: There is an increase in rDD near the start of the pandemic which levels off over time. However, in Mumbai we also see an increase aligned with a second wave of cases. CONCLUSIONS: Our results are concordant with other studies which indicate that the impact of the pandemic on mental health was highest initially and was followed by recovery, largely unchanged by subsequent waves. However, in the Mumbai data we observed a substantial rise in rDD with a large second wave. Our results indicate possible un-captured heterogeneity across geographies, and point to a need for a better understanding of this differential impact on mental health. Swansea University 2022-03-30 /pmc/articles/PMC9052361/ /pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
spellingShingle	Population Data Science Davis, Brent D. McKnight, Dawn Estes Teodorescu, Daniela Quan-Haase, Anabel Chunara, Rumi Fyshe, Alona Lizotte, Daniel J. Quantifying depression-related language on social media during the COVID-19 pandemic
title	Quantifying depression-related language on social media during the COVID-19 pandemic
title_full	Quantifying depression-related language on social media during the COVID-19 pandemic
title_fullStr	Quantifying depression-related language on social media during the COVID-19 pandemic
title_full_unstemmed	Quantifying depression-related language on social media during the COVID-19 pandemic
title_short	Quantifying depression-related language on social media during the COVID-19 pandemic
title_sort	quantifying depression-related language on social media during the covid-19 pandemic
topic	Population Data Science
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9052361/ https://www.ncbi.nlm.nih.gov/pubmed/35516163 http://dx.doi.org/10.23889/ijpds.v5i4.1716
work_keys_str_mv	AT davisbrentd quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT mcknightdawnestes quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT teodorescudaniela quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT quanhaaseanabel quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT chunararumi quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT fyshealona quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic AT lizottedanielj quantifyingdepressionrelatedlanguageonsocialmediaduringthecovid19pandemic

Quantifying depression-related language on social media during the COVID-19 pandemic

Ejemplares similares