Cargando…
458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
BACKGROUND: The novel coronavirus disease 2019 (COVID-19) pandemic remains a global challenge. Accurate COVID-19 prognosis remains an important aspect of clinical management. While many prognostic systems have been proposed, most are derived from analyses of individual symptoms or biomarkers. Here,...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8644530/ http://dx.doi.org/10.1093/ofid/ofab466.657 |
Sumario: | BACKGROUND: The novel coronavirus disease 2019 (COVID-19) pandemic remains a global challenge. Accurate COVID-19 prognosis remains an important aspect of clinical management. While many prognostic systems have been proposed, most are derived from analyses of individual symptoms or biomarkers. Here, we take a machine learning approach to first identify discrete clusters of early stage-symptoms which may delineate groups with distinct symptom phenotypes. We then sought to identify whether these groups correlate with subsequent disease severity. METHODS: The Epidemiology, Immunology, and Clinical Characteristics of Emerging Infectious Diseases with Pandemic Potential (EPICC) study is a longitudinal cohort study with data and biospecimens collected from nine military treatment facilities over 1 year of follow-up. Demographic and clinical characteristics were measured with interviews and electronic medical record review. Early symptoms by organ-domain were measured by FLU-PRO-plus surveys collected for 14 days post-enrollment, with surveys completed a median 14.5 (Interquartile Range, IQR = 13) days post-symptom onset. Using these FLU-PRO-plus responses, we applied principal component analysis followed by unsupervised machine learning algorithm k-means to identify groups with distinct clusters of symptoms. We then fit multivariate logistic regression models to determine how these early-symptom clusters correlated with hospitalization risk after controlling for age, sex, race, and obesity. RESULTS: Using SARS-CoV-2 positive participants (n = 1137) from the EPICC cohort (Figure 1), we transformed reported symptoms into domains and identified three groups of participants with distinct clusters of symptoms. Logistic regression demonstrated that cluster-2 was associated with an approximately three-fold increased odds [3.01 (95% CI: 2-4.52); P < 0.001] of hospitalization which remained significant after controlling for other factors [2.97 (95% CI: 1.88-4.69); P < 0.001]. [Image: see text] (A) Baseline characteristics of SARS-CoV-2 positive participants. (B) Heatmap comparing FLU-PRO response in each participant. (C) Principal component analysis followed by k-means clustering identified three groups of participants. (D) Crude and adjusted association of identified cluster with hospitalization. CONCLUSION: Our findings have identified three distinct groups with early-symptom phenotypes. With further validation of the clusters’ significance, this tool could be used to improve COVID-19 prognosis in a precision medicine framework and may assist in patient triaging and clinical decision-making. DISCLAIMER: [Image: see text] DISCLOSURES: David A. Lindholm, MD, American Board of Internal Medicine (Individual(s) Involved: Self): Member of Auxiliary R&D Infectious Disease Item-Writer Task Force. No financial support received. No exam questions will be disclosed ., Other Financial or Material Support Ryan C. Maves, MD, EMD Serono (Advisor or Review Panel member)Heron Therapeutics (Advisor or Review Panel member) Simon Pollett, MBBS, Astra Zeneca (Other Financial or Material Support, HJF, in support of USU IDCRP, funded under a CRADA to augment the conduct of an unrelated Phase III COVID-19 vaccine trial sponsored by AstraZeneca as part of USG response (unrelated work)) |
---|