Cargando…
Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
OBJECTIVE: To identify and characterize clinical subgroups of hospitalized Coronavirus Disease 2019 (COVID-19) patients. MATERIALS AND METHODS: Electronic health records of hospitalized COVID-19 patients at NewYork-Presbyterian/Columbia University Irving Medical Center were temporally sequenced and...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620768/ https://www.ncbi.nlm.nih.gov/pubmed/36255273 http://dx.doi.org/10.1093/jamia/ocac208 |
_version_ | 1784821392093478912 |
---|---|
author | Ta, Casey N Zucker, Jason E Chiu, Po-Hsiang Fang, Yilu Natarajan, Karthik Weng, Chunhua |
author_facet | Ta, Casey N Zucker, Jason E Chiu, Po-Hsiang Fang, Yilu Natarajan, Karthik Weng, Chunhua |
author_sort | Ta, Casey N |
collection | PubMed |
description | OBJECTIVE: To identify and characterize clinical subgroups of hospitalized Coronavirus Disease 2019 (COVID-19) patients. MATERIALS AND METHODS: Electronic health records of hospitalized COVID-19 patients at NewYork-Presbyterian/Columbia University Irving Medical Center were temporally sequenced and transformed into patient vector representations using Paragraph Vector models. K-means clustering was performed to identify subgroups. RESULTS: A diverse cohort of 11 313 patients with COVID-19 and hospitalizations between March 2, 2020 and December 1, 2021 were identified; median [IQR] age: 61.2 [40.3–74.3]; 51.5% female. Twenty subgroups of hospitalized COVID-19 patients, labeled by increasing severity, were characterized by their demographics, conditions, outcomes, and severity (mild-moderate/severe/critical). Subgroup temporal patterns were characterized by the durations in each subgroup, transitions between subgroups, and the complete paths throughout the course of hospitalization. DISCUSSION: Several subgroups had mild-moderate severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections but were hospitalized for underlying conditions (pregnancy, cardiovascular disease [CVD], etc.). Subgroup 7 included solid organ transplant recipients who mostly developed mild-moderate or severe disease. Subgroup 9 had a history of type-2 diabetes, kidney and CVD, and suffered the highest rates of heart failure (45.2%) and end-stage renal disease (80.6%). Subgroup 13 was the oldest (median: 82.7 years) and had mixed severity but high mortality (33.3%). Subgroup 17 had critical disease and the highest mortality (64.6%), with age (median: 68.1 years) being the only notable risk factor. Subgroups 18–20 had critical disease with high complication rates and long hospitalizations (median: 40+ days). All subgroups are detailed in the full text. A chord diagram depicts the most common transitions, and paths with the highest prevalence, longest hospitalizations, lowest and highest mortalities are presented. Understanding these subgroups and their pathways may aid clinicians in their decisions for better management and earlier intervention for patients. |
format | Online Article Text |
id | pubmed-9620768 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-96207682022-11-04 Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records Ta, Casey N Zucker, Jason E Chiu, Po-Hsiang Fang, Yilu Natarajan, Karthik Weng, Chunhua J Am Med Inform Assoc Research and Applications OBJECTIVE: To identify and characterize clinical subgroups of hospitalized Coronavirus Disease 2019 (COVID-19) patients. MATERIALS AND METHODS: Electronic health records of hospitalized COVID-19 patients at NewYork-Presbyterian/Columbia University Irving Medical Center were temporally sequenced and transformed into patient vector representations using Paragraph Vector models. K-means clustering was performed to identify subgroups. RESULTS: A diverse cohort of 11 313 patients with COVID-19 and hospitalizations between March 2, 2020 and December 1, 2021 were identified; median [IQR] age: 61.2 [40.3–74.3]; 51.5% female. Twenty subgroups of hospitalized COVID-19 patients, labeled by increasing severity, were characterized by their demographics, conditions, outcomes, and severity (mild-moderate/severe/critical). Subgroup temporal patterns were characterized by the durations in each subgroup, transitions between subgroups, and the complete paths throughout the course of hospitalization. DISCUSSION: Several subgroups had mild-moderate severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections but were hospitalized for underlying conditions (pregnancy, cardiovascular disease [CVD], etc.). Subgroup 7 included solid organ transplant recipients who mostly developed mild-moderate or severe disease. Subgroup 9 had a history of type-2 diabetes, kidney and CVD, and suffered the highest rates of heart failure (45.2%) and end-stage renal disease (80.6%). Subgroup 13 was the oldest (median: 82.7 years) and had mixed severity but high mortality (33.3%). Subgroup 17 had critical disease and the highest mortality (64.6%), with age (median: 68.1 years) being the only notable risk factor. Subgroups 18–20 had critical disease with high complication rates and long hospitalizations (median: 40+ days). All subgroups are detailed in the full text. A chord diagram depicts the most common transitions, and paths with the highest prevalence, longest hospitalizations, lowest and highest mortalities are presented. Understanding these subgroups and their pathways may aid clinicians in their decisions for better management and earlier intervention for patients. Oxford University Press 2022-10-18 /pmc/articles/PMC9620768/ /pubmed/36255273 http://dx.doi.org/10.1093/jamia/ocac208 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com https://academic.oup.com/pages/standard-publication-reuse-rightsThis article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/pages/standard-publication-reuse-rights) |
spellingShingle | Research and Applications Ta, Casey N Zucker, Jason E Chiu, Po-Hsiang Fang, Yilu Natarajan, Karthik Weng, Chunhua Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records |
title | Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records |
title_full | Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records |
title_fullStr | Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records |
title_full_unstemmed | Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records |
title_short | Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records |
title_sort | clinical and temporal characterization of covid-19 subgroups using patient vector embeddings of electronic health records |
topic | Research and Applications |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620768/ https://www.ncbi.nlm.nih.gov/pubmed/36255273 http://dx.doi.org/10.1093/jamia/ocac208 |
work_keys_str_mv | AT tacaseyn clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords AT zuckerjasone clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords AT chiupohsiang clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords AT fangyilu clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords AT natarajankarthik clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords AT wengchunhua clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords |