Cargando…
Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports
Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexico...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8411371/ https://www.ncbi.nlm.nih.gov/pubmed/34485849 http://dx.doi.org/10.1093/jamiaopen/ooab075 |
_version_ | 1783747285995225088 |
---|---|
author | Sarker, Abeed Ge, Yao |
author_facet | Sarker, Abeed Ge, Yao |
author_sort | Sarker, Abeed |
collection | PubMed |
description | Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexicon. We mapped the extracted symptoms to standard concept IDs, compared their distributions with those reported in recent literature and analyzed their distributions over time. From 42 995 posts by 4249 users, we identified 1744 users who expressed at least 1 symptom. The most frequently reported long-COVID symptoms were mental health-related symptoms (55.2%), fatigue (51.2%), general ache/pain (48.4%), brain fog/confusion (32.8%), and dyspnea (28.9%) among users reporting at least 1 symptom. Comparison with recent literature revealed a large variance in reported symptoms across studies. Temporal analysis showed several persistent symptoms up to 15 months after infection. The spectrum of symptoms identified from Reddit may provide early insights about long-COVID. |
format | Online Article Text |
id | pubmed-8411371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-84113712021-09-03 Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports Sarker, Abeed Ge, Yao JAMIA Open Brief Communications Our objective was to mine Reddit to discover long-COVID symptoms self-reported by users, compare symptom distributions across studies, and create a symptom lexicon. We retrieved posts from the /r/covidlonghaulers subreddit and extracted symptoms via approximate matching using an expanded meta-lexicon. We mapped the extracted symptoms to standard concept IDs, compared their distributions with those reported in recent literature and analyzed their distributions over time. From 42 995 posts by 4249 users, we identified 1744 users who expressed at least 1 symptom. The most frequently reported long-COVID symptoms were mental health-related symptoms (55.2%), fatigue (51.2%), general ache/pain (48.4%), brain fog/confusion (32.8%), and dyspnea (28.9%) among users reporting at least 1 symptom. Comparison with recent literature revealed a large variance in reported symptoms across studies. Temporal analysis showed several persistent symptoms up to 15 months after infection. The spectrum of symptoms identified from Reddit may provide early insights about long-COVID. Oxford University Press 2021-09-02 /pmc/articles/PMC8411371/ /pubmed/34485849 http://dx.doi.org/10.1093/jamiaopen/ooab075 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Brief Communications Sarker, Abeed Ge, Yao Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports |
title | Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports |
title_full | Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports |
title_fullStr | Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports |
title_full_unstemmed | Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports |
title_short | Mining long-COVID symptoms from Reddit: characterizing post-COVID syndrome from patient reports |
title_sort | mining long-covid symptoms from reddit: characterizing post-covid syndrome from patient reports |
topic | Brief Communications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8411371/ https://www.ncbi.nlm.nih.gov/pubmed/34485849 http://dx.doi.org/10.1093/jamiaopen/ooab075 |
work_keys_str_mv | AT sarkerabeed mininglongcovidsymptomsfromredditcharacterizingpostcovidsyndromefrompatientreports AT geyao mininglongcovidsymptomsfromredditcharacterizingpostcovidsyndromefrompatientreports |