Cargando…

458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes

BACKGROUND: The novel coronavirus disease 2019 (COVID-19) pandemic remains a global challenge. Accurate COVID-19 prognosis remains an important aspect of clinical management. While many prognostic systems have been proposed, most are derived from analyses of individual symptoms or biomarkers. Here,...

Descripción completa

Detalles Bibliográficos
Autores principales: Epsi, Nusrat J, Powers, John H, Lindholm, David A, Helfrich, Alison, Huprikar, Nikhil, Ganesan, Anuradha, Lalani, Tahaniyat, Mody, Rupal, Madar, Cristian, Bazan, Samantha, Colombo, Rhonda E, Larson, Derek, Maves, Ryan C, Utz, Gregory, Tribble, David, Agan, Brian, Burgess, Timothy, Malloy, Allison, Pollett, Simon, Richard, Stephanie A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8644530/
http://dx.doi.org/10.1093/ofid/ofab466.657
_version_ 1784610107428962304
author Epsi, Nusrat J
Powers, John H
Lindholm, David A
Lindholm, David A
Helfrich, Alison
Huprikar, Nikhil
Ganesan, Anuradha
Lalani, Tahaniyat
Mody, Rupal
Madar, Cristian
Bazan, Samantha
Colombo, Rhonda E
Larson, Derek
Maves, Ryan C
Maves, Ryan C
Utz, Gregory
Tribble, David
Agan, Brian
Burgess, Timothy
Malloy, Allison
Pollett, Simon
Richard, Stephanie A
author_facet Epsi, Nusrat J
Powers, John H
Lindholm, David A
Lindholm, David A
Helfrich, Alison
Huprikar, Nikhil
Ganesan, Anuradha
Lalani, Tahaniyat
Mody, Rupal
Madar, Cristian
Bazan, Samantha
Colombo, Rhonda E
Larson, Derek
Maves, Ryan C
Maves, Ryan C
Utz, Gregory
Tribble, David
Agan, Brian
Burgess, Timothy
Malloy, Allison
Pollett, Simon
Richard, Stephanie A
author_sort Epsi, Nusrat J
collection PubMed
description BACKGROUND: The novel coronavirus disease 2019 (COVID-19) pandemic remains a global challenge. Accurate COVID-19 prognosis remains an important aspect of clinical management. While many prognostic systems have been proposed, most are derived from analyses of individual symptoms or biomarkers. Here, we take a machine learning approach to first identify discrete clusters of early stage-symptoms which may delineate groups with distinct symptom phenotypes. We then sought to identify whether these groups correlate with subsequent disease severity. METHODS: The Epidemiology, Immunology, and Clinical Characteristics of Emerging Infectious Diseases with Pandemic Potential (EPICC) study is a longitudinal cohort study with data and biospecimens collected from nine military treatment facilities over 1 year of follow-up. Demographic and clinical characteristics were measured with interviews and electronic medical record review. Early symptoms by organ-domain were measured by FLU-PRO-plus surveys collected for 14 days post-enrollment, with surveys completed a median 14.5 (Interquartile Range, IQR = 13) days post-symptom onset. Using these FLU-PRO-plus responses, we applied principal component analysis followed by unsupervised machine learning algorithm k-means to identify groups with distinct clusters of symptoms. We then fit multivariate logistic regression models to determine how these early-symptom clusters correlated with hospitalization risk after controlling for age, sex, race, and obesity. RESULTS: Using SARS-CoV-2 positive participants (n = 1137) from the EPICC cohort (Figure 1), we transformed reported symptoms into domains and identified three groups of participants with distinct clusters of symptoms. Logistic regression demonstrated that cluster-2 was associated with an approximately three-fold increased odds [3.01 (95% CI: 2-4.52); P < 0.001] of hospitalization which remained significant after controlling for other factors [2.97 (95% CI: 1.88-4.69); P < 0.001]. [Image: see text] (A) Baseline characteristics of SARS-CoV-2 positive participants. (B) Heatmap comparing FLU-PRO response in each participant. (C) Principal component analysis followed by k-means clustering identified three groups of participants. (D) Crude and adjusted association of identified cluster with hospitalization. CONCLUSION: Our findings have identified three distinct groups with early-symptom phenotypes. With further validation of the clusters’ significance, this tool could be used to improve COVID-19 prognosis in a precision medicine framework and may assist in patient triaging and clinical decision-making. DISCLAIMER: [Image: see text] DISCLOSURES: David A. Lindholm, MD, American Board of Internal Medicine (Individual(s) Involved: Self): Member of Auxiliary R&D Infectious Disease Item-Writer Task Force. No financial support received. No exam questions will be disclosed ., Other Financial or Material Support Ryan C. Maves, MD, EMD Serono (Advisor or Review Panel member)Heron Therapeutics (Advisor or Review Panel member) Simon Pollett, MBBS, Astra Zeneca (Other Financial or Material Support, HJF, in support of USU IDCRP, funded under a CRADA to augment the conduct of an unrelated Phase III COVID-19 vaccine trial sponsored by AstraZeneca as part of USG response (unrelated work))
format Online
Article
Text
id pubmed-8644530
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86445302021-12-06 458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes Epsi, Nusrat J Powers, John H Lindholm, David A Lindholm, David A Helfrich, Alison Huprikar, Nikhil Ganesan, Anuradha Lalani, Tahaniyat Mody, Rupal Madar, Cristian Bazan, Samantha Colombo, Rhonda E Larson, Derek Maves, Ryan C Maves, Ryan C Utz, Gregory Tribble, David Agan, Brian Burgess, Timothy Malloy, Allison Pollett, Simon Richard, Stephanie A Open Forum Infect Dis Poster Abstracts BACKGROUND: The novel coronavirus disease 2019 (COVID-19) pandemic remains a global challenge. Accurate COVID-19 prognosis remains an important aspect of clinical management. While many prognostic systems have been proposed, most are derived from analyses of individual symptoms or biomarkers. Here, we take a machine learning approach to first identify discrete clusters of early stage-symptoms which may delineate groups with distinct symptom phenotypes. We then sought to identify whether these groups correlate with subsequent disease severity. METHODS: The Epidemiology, Immunology, and Clinical Characteristics of Emerging Infectious Diseases with Pandemic Potential (EPICC) study is a longitudinal cohort study with data and biospecimens collected from nine military treatment facilities over 1 year of follow-up. Demographic and clinical characteristics were measured with interviews and electronic medical record review. Early symptoms by organ-domain were measured by FLU-PRO-plus surveys collected for 14 days post-enrollment, with surveys completed a median 14.5 (Interquartile Range, IQR = 13) days post-symptom onset. Using these FLU-PRO-plus responses, we applied principal component analysis followed by unsupervised machine learning algorithm k-means to identify groups with distinct clusters of symptoms. We then fit multivariate logistic regression models to determine how these early-symptom clusters correlated with hospitalization risk after controlling for age, sex, race, and obesity. RESULTS: Using SARS-CoV-2 positive participants (n = 1137) from the EPICC cohort (Figure 1), we transformed reported symptoms into domains and identified three groups of participants with distinct clusters of symptoms. Logistic regression demonstrated that cluster-2 was associated with an approximately three-fold increased odds [3.01 (95% CI: 2-4.52); P < 0.001] of hospitalization which remained significant after controlling for other factors [2.97 (95% CI: 1.88-4.69); P < 0.001]. [Image: see text] (A) Baseline characteristics of SARS-CoV-2 positive participants. (B) Heatmap comparing FLU-PRO response in each participant. (C) Principal component analysis followed by k-means clustering identified three groups of participants. (D) Crude and adjusted association of identified cluster with hospitalization. CONCLUSION: Our findings have identified three distinct groups with early-symptom phenotypes. With further validation of the clusters’ significance, this tool could be used to improve COVID-19 prognosis in a precision medicine framework and may assist in patient triaging and clinical decision-making. DISCLAIMER: [Image: see text] DISCLOSURES: David A. Lindholm, MD, American Board of Internal Medicine (Individual(s) Involved: Self): Member of Auxiliary R&D Infectious Disease Item-Writer Task Force. No financial support received. No exam questions will be disclosed ., Other Financial or Material Support Ryan C. Maves, MD, EMD Serono (Advisor or Review Panel member)Heron Therapeutics (Advisor or Review Panel member) Simon Pollett, MBBS, Astra Zeneca (Other Financial or Material Support, HJF, in support of USU IDCRP, funded under a CRADA to augment the conduct of an unrelated Phase III COVID-19 vaccine trial sponsored by AstraZeneca as part of USG response (unrelated work)) Oxford University Press 2021-12-04 /pmc/articles/PMC8644530/ http://dx.doi.org/10.1093/ofid/ofab466.657 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Infectious Diseases Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Poster Abstracts
Epsi, Nusrat J
Powers, John H
Lindholm, David A
Lindholm, David A
Helfrich, Alison
Huprikar, Nikhil
Ganesan, Anuradha
Lalani, Tahaniyat
Mody, Rupal
Madar, Cristian
Bazan, Samantha
Colombo, Rhonda E
Larson, Derek
Maves, Ryan C
Maves, Ryan C
Utz, Gregory
Tribble, David
Agan, Brian
Burgess, Timothy
Malloy, Allison
Pollett, Simon
Richard, Stephanie A
458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
title 458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
title_full 458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
title_fullStr 458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
title_full_unstemmed 458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
title_short 458. A Machine Learning Approach Identifies Distinct Early-Symptom Cluster Phenotypes Which Correlate with Severe SARS-CoV-2 Outcomes
title_sort 458. a machine learning approach identifies distinct early-symptom cluster phenotypes which correlate with severe sars-cov-2 outcomes
topic Poster Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8644530/
http://dx.doi.org/10.1093/ofid/ofab466.657
work_keys_str_mv AT epsinusratj 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT powersjohnh 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT lindholmdavida 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT lindholmdavida 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT helfrichalison 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT huprikarnikhil 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT ganesananuradha 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT lalanitahaniyat 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT modyrupal 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT madarcristian 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT bazansamantha 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT colomborhondae 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT larsonderek 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT mavesryanc 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT mavesryanc 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT utzgregory 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT tribbledavid 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT aganbrian 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT burgesstimothy 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT malloyallison 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT pollettsimon 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes
AT richardstephaniea 458amachinelearningapproachidentifiesdistinctearlysymptomclusterphenotypeswhichcorrelatewithseveresarscov2outcomes