Cargando…
2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19
OBJECTIVES/GOALS: Analysis and modeling of large, complex clinical data remain challenging despite modern advances in biomedical informatics. We aim to explore the potential of topological data analysis (TDA) to address such challenges in the context of COVID-19 outcomes using electronic health reco...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cambridge University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10129655/ http://dx.doi.org/10.1017/cts.2023.105 |
_version_ | 1785030795885281280 |
---|---|
author | Skaf, Yara Dasa, Osama Brunson, Jason Cory Pearson, Thomas Laubenbacher, Reinhard |
author_facet | Skaf, Yara Dasa, Osama Brunson, Jason Cory Pearson, Thomas Laubenbacher, Reinhard |
author_sort | Skaf, Yara |
collection | PubMed |
description | OBJECTIVES/GOALS: Analysis and modeling of large, complex clinical data remain challenging despite modern advances in biomedical informatics. We aim to explore the potential of topological data analysis (TDA) to address such challenges in the context of COVID-19 outcomes using electronic health records (EHRs). METHODS/STUDY POPULATION: In this work, we develop TDA approaches to characterize subtypes and predict outcomes in patients with COVID-19 infection. First, data for >70,000 COVID-19 patients were extracted from the OneFlorida EHR database. Next, enhancements to the TDA algorithm Mapper were designed and implemented to adapt the technique to this type of data. Clinical variables, including patient demographics, vital signs, and lab values, were then used as input to conduct a population-level exploratory analysis with an emphasis on identifying phenotypic subtypes at increased risk of adverse outcomes such as major adverse cardiovascular events (MACE), mechanical ventilation, and death. RESULTS/ANTICIPATED RESULTS: Preliminary Mapper experiments have produced visual representations of the COVID-19 patient population that are well-suited to exploratory analysis. Such visualizations facilitate easy identification of phenotypic subnetworks that differ from the general population in terms of baseline variables or clinical outcomes. In this and subsequent work, we aim to fully characterize and quantify differences between these subnetworks to identify factors that may confer increased risk (or protection from) adverse outcomes. We also plan to validate and rigorously compare the efficacy of this TDA-based approach to common alternatives such as clustering, principal component analysis, and machine learning. DISCUSSION/SIGNIFICANCE: This work demonstrates the potential utility of TDA for the characterization of complex biomedical data. Mapper provides a novel means of exploring EHR data, which are otherwise difficult to visualize and can aid in identifying or characterizing patient subtypes in diseases such as COVID-19. |
format | Online Article Text |
id | pubmed-10129655 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cambridge University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101296552023-04-26 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 Skaf, Yara Dasa, Osama Brunson, Jason Cory Pearson, Thomas Laubenbacher, Reinhard J Clin Transl Sci Biostatistics, Epidemiology, and Research Design OBJECTIVES/GOALS: Analysis and modeling of large, complex clinical data remain challenging despite modern advances in biomedical informatics. We aim to explore the potential of topological data analysis (TDA) to address such challenges in the context of COVID-19 outcomes using electronic health records (EHRs). METHODS/STUDY POPULATION: In this work, we develop TDA approaches to characterize subtypes and predict outcomes in patients with COVID-19 infection. First, data for >70,000 COVID-19 patients were extracted from the OneFlorida EHR database. Next, enhancements to the TDA algorithm Mapper were designed and implemented to adapt the technique to this type of data. Clinical variables, including patient demographics, vital signs, and lab values, were then used as input to conduct a population-level exploratory analysis with an emphasis on identifying phenotypic subtypes at increased risk of adverse outcomes such as major adverse cardiovascular events (MACE), mechanical ventilation, and death. RESULTS/ANTICIPATED RESULTS: Preliminary Mapper experiments have produced visual representations of the COVID-19 patient population that are well-suited to exploratory analysis. Such visualizations facilitate easy identification of phenotypic subnetworks that differ from the general population in terms of baseline variables or clinical outcomes. In this and subsequent work, we aim to fully characterize and quantify differences between these subnetworks to identify factors that may confer increased risk (or protection from) adverse outcomes. We also plan to validate and rigorously compare the efficacy of this TDA-based approach to common alternatives such as clustering, principal component analysis, and machine learning. DISCUSSION/SIGNIFICANCE: This work demonstrates the potential utility of TDA for the characterization of complex biomedical data. Mapper provides a novel means of exploring EHR data, which are otherwise difficult to visualize and can aid in identifying or characterizing patient subtypes in diseases such as COVID-19. Cambridge University Press 2023-04-24 /pmc/articles/PMC10129655/ http://dx.doi.org/10.1017/cts.2023.105 Text en © The Association for Clinical and Translational Science 2023 https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work. |
spellingShingle | Biostatistics, Epidemiology, and Research Design Skaf, Yara Dasa, Osama Brunson, Jason Cory Pearson, Thomas Laubenbacher, Reinhard 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 |
title | 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 |
title_full | 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 |
title_fullStr | 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 |
title_full_unstemmed | 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 |
title_short | 2 A CTS Team Approach to Topological Data Analysis of Electronic Health Records for Subtyping and Clinical Outcomes Prediction in Patients with COVID-19 |
title_sort | 2 a cts team approach to topological data analysis of electronic health records for subtyping and clinical outcomes prediction in patients with covid-19 |
topic | Biostatistics, Epidemiology, and Research Design |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10129655/ http://dx.doi.org/10.1017/cts.2023.105 |
work_keys_str_mv | AT skafyara 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19 AT dasaosama 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19 AT brunsonjasoncory 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19 AT pearsonthomas 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19 AT laubenbacherreinhard 2actsteamapproachtotopologicaldataanalysisofelectronichealthrecordsforsubtypingandclinicaloutcomespredictioninpatientswithcovid19 |