Cargando…

Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential

Clinical outcomes for patients with COVID-19 are heterogeneous and there is interest in defining subgroups for prognostic modeling and development of treatment algorithms. We obtained 28 demographic and laboratory variables in patients admitted to hospital with COVID-19. These comprised a training c...

Descripción completa

Detalles Bibliográficos
Autores principales: Greenwood, David, Taverner, Thomas, Adderley, Nicola J., Price, Malcolm James, Gokhale, Krishna, Sainsbury, Christopher, Gallier, Suzy, Welch, Carly, Sapey, Elizabeth, Murray, Duncan, Fanning, Hilary, Ball, Simon, Nirantharakumar, Krishnarajah, Croft, Wayne, Moss, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9153184/
https://www.ncbi.nlm.nih.gov/pubmed/35665240
http://dx.doi.org/10.1016/j.isci.2022.104480
_version_ 1784717797312430080
author Greenwood, David
Taverner, Thomas
Adderley, Nicola J.
Price, Malcolm James
Gokhale, Krishna
Sainsbury, Christopher
Gallier, Suzy
Welch, Carly
Sapey, Elizabeth
Murray, Duncan
Fanning, Hilary
Ball, Simon
Nirantharakumar, Krishnarajah
Croft, Wayne
Moss, Paul
author_facet Greenwood, David
Taverner, Thomas
Adderley, Nicola J.
Price, Malcolm James
Gokhale, Krishna
Sainsbury, Christopher
Gallier, Suzy
Welch, Carly
Sapey, Elizabeth
Murray, Duncan
Fanning, Hilary
Ball, Simon
Nirantharakumar, Krishnarajah
Croft, Wayne
Moss, Paul
author_sort Greenwood, David
collection PubMed
description Clinical outcomes for patients with COVID-19 are heterogeneous and there is interest in defining subgroups for prognostic modeling and development of treatment algorithms. We obtained 28 demographic and laboratory variables in patients admitted to hospital with COVID-19. These comprised a training cohort (n = 6099) and two validation cohorts during the first and second waves of the pandemic (n = 996; n = 1011). Uniform manifold approximation and projection (UMAP) dimension reduction and Gaussian mixture model (GMM) analysis was used to define patient clusters. 29 clusters were defined in the training cohort and associated with markedly different mortality rates, which were predictive within confirmation datasets. Deconvolution of clinical features within clusters identified unexpected relationships between variables. Integration of large datasets using UMAP-assisted clustering can therefore identify patient subgroups with prognostic information and uncovers unexpected interactions between clinical variables. This application of machine learning represents a powerful approach for delineating disease pathogenesis and potential therapeutic interventions.
format Online
Article
Text
id pubmed-9153184
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-91531842022-05-31 Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential Greenwood, David Taverner, Thomas Adderley, Nicola J. Price, Malcolm James Gokhale, Krishna Sainsbury, Christopher Gallier, Suzy Welch, Carly Sapey, Elizabeth Murray, Duncan Fanning, Hilary Ball, Simon Nirantharakumar, Krishnarajah Croft, Wayne Moss, Paul iScience Article Clinical outcomes for patients with COVID-19 are heterogeneous and there is interest in defining subgroups for prognostic modeling and development of treatment algorithms. We obtained 28 demographic and laboratory variables in patients admitted to hospital with COVID-19. These comprised a training cohort (n = 6099) and two validation cohorts during the first and second waves of the pandemic (n = 996; n = 1011). Uniform manifold approximation and projection (UMAP) dimension reduction and Gaussian mixture model (GMM) analysis was used to define patient clusters. 29 clusters were defined in the training cohort and associated with markedly different mortality rates, which were predictive within confirmation datasets. Deconvolution of clinical features within clusters identified unexpected relationships between variables. Integration of large datasets using UMAP-assisted clustering can therefore identify patient subgroups with prognostic information and uncovers unexpected interactions between clinical variables. This application of machine learning represents a powerful approach for delineating disease pathogenesis and potential therapeutic interventions. Elsevier 2022-05-31 /pmc/articles/PMC9153184/ /pubmed/35665240 http://dx.doi.org/10.1016/j.isci.2022.104480 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Greenwood, David
Taverner, Thomas
Adderley, Nicola J.
Price, Malcolm James
Gokhale, Krishna
Sainsbury, Christopher
Gallier, Suzy
Welch, Carly
Sapey, Elizabeth
Murray, Duncan
Fanning, Hilary
Ball, Simon
Nirantharakumar, Krishnarajah
Croft, Wayne
Moss, Paul
Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
title Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
title_full Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
title_fullStr Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
title_full_unstemmed Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
title_short Machine learning of COVID-19 clinical data identifies population structures with therapeutic potential
title_sort machine learning of covid-19 clinical data identifies population structures with therapeutic potential
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9153184/
https://www.ncbi.nlm.nih.gov/pubmed/35665240
http://dx.doi.org/10.1016/j.isci.2022.104480
work_keys_str_mv AT greenwooddavid machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT tavernerthomas machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT adderleynicolaj machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT pricemalcolmjames machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT gokhalekrishna machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT sainsburychristopher machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT galliersuzy machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT welchcarly machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT sapeyelizabeth machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT murrayduncan machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT fanninghilary machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT ballsimon machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT nirantharakumarkrishnarajah machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT croftwayne machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential
AT mosspaul machinelearningofcovid19clinicaldataidentifiespopulationstructureswiththerapeuticpotential