Cargando…

Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports

Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexico...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarker, Abeed, Ge, Yao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8411371/
https://www.ncbi.nlm.nih.gov/pubmed/34485849
http://dx.doi.org/10.1093/jamiaopen/ooab075
_version_ 1783747285995225088
author Sarker, Abeed
Ge, Yao
author_facet Sarker, Abeed
Ge, Yao
author_sort Sarker, Abeed
collection PubMed
description Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexicon. We mapped the extracted symptoms to standard concept IDs, compared their distributions with those reported in recent literature and analyzed their distributions over time. From 42 995 posts by 4249 users, we identified 1744 users who expressed at least 1 symptom. The most frequently reported long-COVID symptoms were mental health-related symptoms (55.2%), fatigue (51.2%), general ache/pain (48.4%), brain fog/confusion (32.8%), and dyspnea (28.9%) among users reporting at least 1 symptom. Comparison with recent literature revealed a large variance in reported symptoms across studies. Temporal analysis showed several persistent symptoms up to 15 months after infection. The spectrum of symptoms identified from Reddit may provide early insights about long-COVID.
format Online
Article
Text
id pubmed-8411371
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84113712021-09-03 Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports Sarker, Abeed Ge, Yao JAMIA Open Brief Communications Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexicon. We mapped the extracted symptoms to standard concept IDs, compared their distributions with those reported in recent literature and analyzed their distributions over time. From 42 995 posts by 4249 users, we identified 1744 users who expressed at least 1 symptom. The most frequently reported long-COVID symptoms were mental health-related symptoms (55.2%), fatigue (51.2%), general ache/pain (48.4%), brain fog/confusion (32.8%), and dyspnea (28.9%) among users reporting at least 1 symptom. Comparison with recent literature revealed a large variance in reported symptoms across studies. Temporal analysis showed several persistent symptoms up to 15 months after infection. The spectrum of symptoms identified from Reddit may provide early insights about long-COVID. Oxford University Press 2021-09-02 /pmc/articles/PMC8411371/ /pubmed/34485849 http://dx.doi.org/10.1093/jamiaopen/ooab075 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Brief Communications
Sarker, Abeed
Ge, Yao
Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
title Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
title_full Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
title_fullStr Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
title_full_unstemmed Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
title_short Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
title_sort mining long-covid symptoms from reddit: characterizing post-covid syndrome from patient reports
topic Brief Communications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8411371/
https://www.ncbi.nlm.nih.gov/pubmed/34485849
http://dx.doi.org/10.1093/jamiaopen/ooab075
work_keys_str_mv AT sarkerabeed mininglongcovidsymptomsfromredditcharacterizingpostcovidsyndromefrompatientreports
AT geyao mininglongcovidsymptomsfromredditcharacterizingpostcovidsyndromefrompatientreports