Cargando…

2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19

OBJECTIVES/GOALS: Analysis and modeling of large, complex clinical data remain challenging despite modern advances in biomedical informatics. We aim to explore the potential of topological data analysis (TDA) to address such challenges in the context of COVID-19 outcomes using electronic health reco...

Descripción completa

Detalles Bibliográficos
Autores principales: Skaf, Yara, Dasa, Osama, Brunson, Jason Cory, Pearson, Thomas, Laubenbacher, Reinhard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cambridge University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10129655/
http://dx.doi.org/10.1017/cts.2023.105
_version_ 1785030795885281280
author Skaf, Yara
Dasa, Osama
Brunson, Jason Cory
Pearson, Thomas
Laubenbacher, Reinhard
author_facet Skaf, Yara
Dasa, Osama
Brunson, Jason Cory
Pearson, Thomas
Laubenbacher, Reinhard
author_sort Skaf, Yara
collection PubMed
description OBJECTIVES/GOALS: Analysis and modeling of large, complex clinical data remain challenging despite modern advances in biomedical informatics. We aim to explore the potential of topological data analysis (TDA) to address such challenges in the context of COVID-19 outcomes using electronic health records (EHRs). METHODS/STUDY POPULATION: In this work, we develop TDA approaches to characterize subtypes and predict outcomes in patients with COVID-19 infection. First, data for >70,000 COVID-19 patients were extracted from the OneFlorida EHR database. Next, enhancements to the TDA algorithm Mapper were designed and implemented to adapt the technique to this type of data. Clinical variables, including patient demographics, vital signs, and lab values, were then used as input to conduct a population-level exploratory analysis with an emphasis on identifying phenotypic subtypes at increased risk of adverse outcomes such as major adverse cardiovascular events (MACE), mechanical ventilation, and death. RESULTS/ANTICIPATED RESULTS: Preliminary Mapper experiments have produced visual representations of the COVID-19 patient population that are well-suited to exploratory analysis. Such visualizations facilitate easy identification of phenotypic subnetworks that differ from the general population in terms of baseline variables or clinical outcomes. In this and subsequent work, we aim to fully characterize and quantify differences between these subnetworks to identify factors that may confer increased risk (or protection from) adverse outcomes. We also plan to validate and rigorously compare the efficacy of this TDA-based approach to common alternatives such as clustering, principal component analysis, and machine learning. DISCUSSION/SIGNIFICANCE: This work demonstrates the potential utility of TDA for the characterization of complex biomedical data. Mapper provides a novel means of exploring EHR data, which are otherwise difficult to visualize and can aid in identifying or characterizing patient subtypes in diseases such as COVID-19.
format Online
Article
Text
id pubmed-10129655
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cambridge University Press
record_format MEDLINE/PubMed
spelling pubmed-101296552023-04-26 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 Skaf, Yara Dasa, Osama Brunson, Jason Cory Pearson, Thomas Laubenbacher, Reinhard J Clin Transl Sci Biostatistics, Epidemiology, and Research Design OBJECTIVES/GOALS: Analysis and modeling of large, complex clinical data remain challenging despite modern advances in biomedical informatics. We aim to explore the potential of topological data analysis (TDA) to address such challenges in the context of COVID-19 outcomes using electronic health records (EHRs). METHODS/STUDY POPULATION: In this work, we develop TDA approaches to characterize subtypes and predict outcomes in patients with COVID-19 infection. First, data for >70,000 COVID-19 patients were extracted from the OneFlorida EHR database. Next, enhancements to the TDA algorithm Mapper were designed and implemented to adapt the technique to this type of data. Clinical variables, including patient demographics, vital signs, and lab values, were then used as input to conduct a population-level exploratory analysis with an emphasis on identifying phenotypic subtypes at increased risk of adverse outcomes such as major adverse cardiovascular events (MACE), mechanical ventilation, and death. RESULTS/ANTICIPATED RESULTS: Preliminary Mapper experiments have produced visual representations of the COVID-19 patient population that are well-suited to exploratory analysis. Such visualizations facilitate easy identification of phenotypic subnetworks that differ from the general population in terms of baseline variables or clinical outcomes. In this and subsequent work, we aim to fully characterize and quantify differences between these subnetworks to identify factors that may confer increased risk (or protection from) adverse outcomes. We also plan to validate and rigorously compare the efficacy of this TDA-based approach to common alternatives such as clustering, principal component analysis, and machine learning. DISCUSSION/SIGNIFICANCE: This work demonstrates the potential utility of TDA for the characterization of complex biomedical data. Mapper provides a novel means of exploring EHR data, which are otherwise difficult to visualize and can aid in identifying or characterizing patient subtypes in diseases such as COVID-19. Cambridge University Press 2023-04-24 /pmc/articles/PMC10129655/ http://dx.doi.org/10.1017/cts.2023.105 Text en © The Association for Clinical and Translational Science 2023 https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
spellingShingle Biostatistics, Epidemiology, and Research Design
Skaf, Yara
Dasa, Osama
Brunson, Jason Cory
Pearson, Thomas
Laubenbacher, Reinhard
2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
title 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
title_full 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
title_fullStr 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
title_full_unstemmed 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
title_short 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
title_sort 2 a cts team approach to topological data analysis of electronic health records for subtyping and clinical outcomes prediction in patients with covid-19
topic Biostatistics, Epidemiology, and Research Design
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10129655/
http://dx.doi.org/10.1017/cts.2023.105
work_keys_str_mv AT skafyara 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19
AT dasaosama 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19
AT brunsonjasoncory 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19
AT pearsonthomas 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19
AT laubenbacherreinhard 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19