Cargando…
Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
Clinical outcomes for patients with COVID-19 are heterogeneous and there is interest in defining subgroups for prognostic modeling and development of treatment algorithms. We obtained 28 demographic and laboratory variables in patients admitted to hospital with COVID-19. These comprised a training c...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9153184/ https://www.ncbi.nlm.nih.gov/pubmed/35665240 http://dx.doi.org/10.1016/j.isci.2022.104480 |
_version_ | 1784717797312430080 |
---|---|
author | Greenwood, David Taverner, Thomas Adderley, Nicola J. Price, Malcolm James Gokhale, Krishna Sainsbury, Christopher Gallier, Suzy Welch, Carly Sapey, Elizabeth Murray, Duncan Fanning, Hilary Ball, Simon Nirantharakumar, Krishnarajah Croft, Wayne Moss, Paul |
author_facet | Greenwood, David Taverner, Thomas Adderley, Nicola J. Price, Malcolm James Gokhale, Krishna Sainsbury, Christopher Gallier, Suzy Welch, Carly Sapey, Elizabeth Murray, Duncan Fanning, Hilary Ball, Simon Nirantharakumar, Krishnarajah Croft, Wayne Moss, Paul |
author_sort | Greenwood, David |
collection | PubMed |
description | Clinical outcomes for patients with COVID-19 are heterogeneous and there is interest in defining subgroups for prognostic modeling and development of treatment algorithms. We obtained 28 demographic and laboratory variables in patients admitted to hospital with COVID-19. These comprised a training cohort (n = 6099) and two validation cohorts during the first and second waves of the pandemic (n = 996; n = 1011). Uniform manifold approximation and projection (UMAP) dimension reduction and Gaussian mixture model (GMM) analysis was used to define patient clusters. 29 clusters were defined in the training cohort and associated with markedly different mortality rates, which were predictive within confirmation datasets. Deconvolution of clinical features within clusters identified unexpected relationships between variables. Integration of large datasets using UMAP-assisted clustering can therefore identify patient subgroups with prognostic information and uncovers unexpected interactions between clinical variables. This application of machine learning represents a powerful approach for delineating disease pathogenesis and potential therapeutic interventions. |
format | Online Article Text |
id | pubmed-9153184 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-91531842022-05-31 Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential Greenwood, David Taverner, Thomas Adderley, Nicola J. Price, Malcolm James Gokhale, Krishna Sainsbury, Christopher Gallier, Suzy Welch, Carly Sapey, Elizabeth Murray, Duncan Fanning, Hilary Ball, Simon Nirantharakumar, Krishnarajah Croft, Wayne Moss, Paul iScience Article Clinical outcomes for patients with COVID-19 are heterogeneous and there is interest in defining subgroups for prognostic modeling and development of treatment algorithms. We obtained 28 demographic and laboratory variables in patients admitted to hospital with COVID-19. These comprised a training cohort (n = 6099) and two validation cohorts during the first and second waves of the pandemic (n = 996; n = 1011). Uniform manifold approximation and projection (UMAP) dimension reduction and Gaussian mixture model (GMM) analysis was used to define patient clusters. 29 clusters were defined in the training cohort and associated with markedly different mortality rates, which were predictive within confirmation datasets. Deconvolution of clinical features within clusters identified unexpected relationships between variables. Integration of large datasets using UMAP-assisted clustering can therefore identify patient subgroups with prognostic information and uncovers unexpected interactions between clinical variables. This application of machine learning represents a powerful approach for delineating disease pathogenesis and potential therapeutic interventions. Elsevier 2022-05-31 /pmc/articles/PMC9153184/ /pubmed/35665240 http://dx.doi.org/10.1016/j.isci.2022.104480 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Greenwood, David Taverner, Thomas Adderley, Nicola J. Price, Malcolm James Gokhale, Krishna Sainsbury, Christopher Gallier, Suzy Welch, Carly Sapey, Elizabeth Murray, Duncan Fanning, Hilary Ball, Simon Nirantharakumar, Krishnarajah Croft, Wayne Moss, Paul Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential |
title | Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential |
title_full | Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential |
title_fullStr | Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential |
title_full_unstemmed | Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential |
title_short | Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential |
title_sort | machine learning of covid-19 clinical data identifies population structures with therapeutic potential |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9153184/ https://www.ncbi.nlm.nih.gov/pubmed/35665240 http://dx.doi.org/10.1016/j.isci.2022.104480 |
work_keys_str_mv | AT greenwooddavid machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT tavernerthomas machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT adderleynicolaj machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT pricemalcolmjames machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT gokhalekrishna machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT sainsburychristopher machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT galliersuzy machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT welchcarly machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT sapeyelizabeth machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT murrayduncan machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT fanninghilary machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT ballsimon machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT nirantharakumarkrishnarajah machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT croftwayne machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential AT mosspaul machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential |