Cargando…

Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data

BACKGROUND: In December 2019, the COVID-19 outbreak started in China and rapidly spread around the world. Lack of a vaccine or optimized intervention raised the importance of characterizing risk factors and symptoms for the early identification and successful treatment of patients with COVID-19. OBJ...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeon, Jouhyun, Baruah, Gaurav, Sarabadani, Sarah, Palanica, Adam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7537723/
https://www.ncbi.nlm.nih.gov/pubmed/32936770
http://dx.doi.org/10.2196/20509
_version_ 1783590722428993536
author Jeon, Jouhyun
Baruah, Gaurav
Sarabadani, Sarah
Palanica, Adam
author_facet Jeon, Jouhyun
Baruah, Gaurav
Sarabadani, Sarah
Palanica, Adam
author_sort Jeon, Jouhyun
collection PubMed
description BACKGROUND: In December 2019, the COVID-19 outbreak started in China and rapidly spread around the world. Lack of a vaccine or optimized intervention raised the importance of characterizing risk factors and symptoms for the early identification and successful treatment of patients with COVID-19. OBJECTIVE: This study aims to investigate and analyze biomedical literature and public social media data to understand the association of risk factors and symptoms with the various outcomes observed in patients with COVID-19. METHODS: Through semantic analysis, we collected 45 retrospective cohort studies, which evaluated 303 clinical and demographic variables across 13 different outcomes of patients with COVID-19, and 84,140 Twitter posts from 1036 COVID-19–positive users. Machine learning tools to extract biomedical information were introduced to identify mentions of uncommon or novel symptoms in tweets. We then examined and compared two data sets to expand our landscape of risk factors and symptoms related to COVID-19. RESULTS: From the biomedical literature, approximately 90% of clinical and demographic variables showed inconsistent associations with COVID-19 outcomes. Consensus analysis identified 72 risk factors that were specifically associated with individual outcomes. From the social media data, 51 symptoms were characterized and analyzed. By comparing social media data with biomedical literature, we identified 25 novel symptoms that were specifically mentioned in tweets but have been not previously well characterized. Furthermore, there were certain combinations of symptoms that were frequently mentioned together in social media. CONCLUSIONS: Identified outcome-specific risk factors, symptoms, and combinations of symptoms may serve as surrogate indicators to identify patients with COVID-19 and predict their clinical outcomes in order to provide appropriate treatments.
format Online
Article
Text
id pubmed-7537723
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-75377232020-10-20 Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data Jeon, Jouhyun Baruah, Gaurav Sarabadani, Sarah Palanica, Adam J Med Internet Res Original Paper BACKGROUND: In December 2019, the COVID-19 outbreak started in China and rapidly spread around the world. Lack of a vaccine or optimized intervention raised the importance of characterizing risk factors and symptoms for the early identification and successful treatment of patients with COVID-19. OBJECTIVE: This study aims to investigate and analyze biomedical literature and public social media data to understand the association of risk factors and symptoms with the various outcomes observed in patients with COVID-19. METHODS: Through semantic analysis, we collected 45 retrospective cohort studies, which evaluated 303 clinical and demographic variables across 13 different outcomes of patients with COVID-19, and 84,140 Twitter posts from 1036 COVID-19–positive users. Machine learning tools to extract biomedical information were introduced to identify mentions of uncommon or novel symptoms in tweets. We then examined and compared two data sets to expand our landscape of risk factors and symptoms related to COVID-19. RESULTS: From the biomedical literature, approximately 90% of clinical and demographic variables showed inconsistent associations with COVID-19 outcomes. Consensus analysis identified 72 risk factors that were specifically associated with individual outcomes. From the social media data, 51 symptoms were characterized and analyzed. By comparing social media data with biomedical literature, we identified 25 novel symptoms that were specifically mentioned in tweets but have been not previously well characterized. Furthermore, there were certain combinations of symptoms that were frequently mentioned together in social media. CONCLUSIONS: Identified outcome-specific risk factors, symptoms, and combinations of symptoms may serve as surrogate indicators to identify patients with COVID-19 and predict their clinical outcomes in order to provide appropriate treatments. JMIR Publications 2020-10-02 /pmc/articles/PMC7537723/ /pubmed/32936770 http://dx.doi.org/10.2196/20509 Text en ©Jouhyun Jeon, Gaurav Baruah, Sarah Sarabadani, Adam Palanica. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 02.10.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Jeon, Jouhyun
Baruah, Gaurav
Sarabadani, Sarah
Palanica, Adam
Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data
title Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data
title_full Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data
title_fullStr Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data
title_full_unstemmed Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data
title_short Identification of Risk Factors and Symptoms of COVID-19: Analysis of Biomedical Literature and Social Media Data
title_sort identification of risk factors and symptoms of covid-19: analysis of biomedical literature and social media data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7537723/
https://www.ncbi.nlm.nih.gov/pubmed/32936770
http://dx.doi.org/10.2196/20509
work_keys_str_mv AT jeonjouhyun identificationofriskfactorsandsymptomsofcovid19analysisofbiomedicalliteratureandsocialmediadata
AT baruahgaurav identificationofriskfactorsandsymptomsofcovid19analysisofbiomedicalliteratureandsocialmediadata
AT sarabadanisarah identificationofriskfactorsandsymptomsofcovid19analysisofbiomedicalliteratureandsocialmediadata
AT palanicaadam identificationofriskfactorsandsymptomsofcovid19analysisofbiomedicalliteratureandsocialmediadata