Cargando…

Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records

OBJECTIVE: To identify and characterize clinical subgroups of hospitalized Coronavirus Disease 2019 (COVID-19) patients. MATERIALS AND METHODS: Electronic health records of hospitalized COVID-19 patients at NewYork-Presbyterian/Columbia University Irving Medical Center were temporally sequenced and...

Descripción completa

Detalles Bibliográficos
Autores principales: Ta, Casey N, Zucker, Jason E, Chiu, Po-Hsiang, Fang, Yilu, Natarajan, Karthik, Weng, Chunhua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620768/
https://www.ncbi.nlm.nih.gov/pubmed/36255273
http://dx.doi.org/10.1093/jamia/ocac208
_version_ 1784821392093478912
author Ta, Casey N
Zucker, Jason E
Chiu, Po-Hsiang
Fang, Yilu
Natarajan, Karthik
Weng, Chunhua
author_facet Ta, Casey N
Zucker, Jason E
Chiu, Po-Hsiang
Fang, Yilu
Natarajan, Karthik
Weng, Chunhua
author_sort Ta, Casey N
collection PubMed
description OBJECTIVE: To identify and characterize clinical subgroups of hospitalized Coronavirus Disease 2019 (COVID-19) patients. MATERIALS AND METHODS: Electronic health records of hospitalized COVID-19 patients at NewYork-Presbyterian/Columbia University Irving Medical Center were temporally sequenced and transformed into patient vector representations using Paragraph Vector models. K-means clustering was performed to identify subgroups. RESULTS: A diverse cohort of 11 313 patients with COVID-19 and hospitalizations between March 2, 2020 and December 1, 2021 were identified; median [IQR] age: 61.2 [40.3–74.3]; 51.5% female. Twenty subgroups of hospitalized COVID-19 patients, labeled by increasing severity, were characterized by their demographics, conditions, outcomes, and severity (mild-moderate/severe/critical). Subgroup temporal patterns were characterized by the durations in each subgroup, transitions between subgroups, and the complete paths throughout the course of hospitalization. DISCUSSION: Several subgroups had mild-moderate severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections but were hospitalized for underlying conditions (pregnancy, cardiovascular disease [CVD], etc.). Subgroup 7 included solid organ transplant recipients who mostly developed mild-moderate or severe disease. Subgroup 9 had a history of type-2 diabetes, kidney and CVD, and suffered the highest rates of heart failure (45.2%) and end-stage renal disease (80.6%). Subgroup 13 was the oldest (median: 82.7 years) and had mixed severity but high mortality (33.3%). Subgroup 17 had critical disease and the highest mortality (64.6%), with age (median: 68.1 years) being the only notable risk factor. Subgroups 18–20 had critical disease with high complication rates and long hospitalizations (median: 40+ days). All subgroups are detailed in the full text. A chord diagram depicts the most common transitions, and paths with the highest prevalence, longest hospitalizations, lowest and highest mortalities are presented. Understanding these subgroups and their pathways may aid clinicians in their decisions for better management and earlier intervention for patients.
format Online
Article
Text
id pubmed-9620768
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96207682022-11-04 Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records Ta, Casey N Zucker, Jason E Chiu, Po-Hsiang Fang, Yilu Natarajan, Karthik Weng, Chunhua J Am Med Inform Assoc Research and Applications OBJECTIVE: To identify and characterize clinical subgroups of hospitalized Coronavirus Disease 2019 (COVID-19) patients. MATERIALS AND METHODS: Electronic health records of hospitalized COVID-19 patients at NewYork-Presbyterian/Columbia University Irving Medical Center were temporally sequenced and transformed into patient vector representations using Paragraph Vector models. K-means clustering was performed to identify subgroups. RESULTS: A diverse cohort of 11 313 patients with COVID-19 and hospitalizations between March 2, 2020 and December 1, 2021 were identified; median [IQR] age: 61.2 [40.3–74.3]; 51.5% female. Twenty subgroups of hospitalized COVID-19 patients, labeled by increasing severity, were characterized by their demographics, conditions, outcomes, and severity (mild-moderate/severe/critical). Subgroup temporal patterns were characterized by the durations in each subgroup, transitions between subgroups, and the complete paths throughout the course of hospitalization. DISCUSSION: Several subgroups had mild-moderate severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections but were hospitalized for underlying conditions (pregnancy, cardiovascular disease [CVD], etc.). Subgroup 7 included solid organ transplant recipients who mostly developed mild-moderate or severe disease. Subgroup 9 had a history of type-2 diabetes, kidney and CVD, and suffered the highest rates of heart failure (45.2%) and end-stage renal disease (80.6%). Subgroup 13 was the oldest (median: 82.7 years) and had mixed severity but high mortality (33.3%). Subgroup 17 had critical disease and the highest mortality (64.6%), with age (median: 68.1 years) being the only notable risk factor. Subgroups 18–20 had critical disease with high complication rates and long hospitalizations (median: 40+ days). All subgroups are detailed in the full text. A chord diagram depicts the most common transitions, and paths with the highest prevalence, longest hospitalizations, lowest and highest mortalities are presented. Understanding these subgroups and their pathways may aid clinicians in their decisions for better management and earlier intervention for patients. Oxford University Press 2022-10-18 /pmc/articles/PMC9620768/ /pubmed/36255273 http://dx.doi.org/10.1093/jamia/ocac208 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com https://academic.oup.com/pages/standard-publication-reuse-rightsThis article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/pages/standard-publication-reuse-rights)
spellingShingle Research and Applications
Ta, Casey N
Zucker, Jason E
Chiu, Po-Hsiang
Fang, Yilu
Natarajan, Karthik
Weng, Chunhua
Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
title Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
title_full Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
title_fullStr Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
title_full_unstemmed Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
title_short Clinical and temporal characterization of COVID-19 subgroups using patient vector embeddings of electronic health records
title_sort clinical and temporal characterization of covid-19 subgroups using patient vector embeddings of electronic health records
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9620768/
https://www.ncbi.nlm.nih.gov/pubmed/36255273
http://dx.doi.org/10.1093/jamia/ocac208
work_keys_str_mv AT tacaseyn clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords
AT zuckerjasone clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords
AT chiupohsiang clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords
AT fangyilu clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords
AT natarajankarthik clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords
AT wengchunhua clinicalandtemporalcharacterizationofcovid19subgroupsusingpatientvectorembeddingsofelectronichealthrecords