Cargando…

Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study

BACKGROUND: Characterizing Post-Acute Sequelae of COVID (SARS-CoV-2 Infection), or PASC has been challenging due to the multitude of sub-phenotypes, temporal attributes, and definitions. Scalable characterization of PASC sub-phenotypes can enhance screening capacities, disease management, and treatm...

Descripción completa

Detalles Bibliográficos
Autores principales: Dagliati, Arianna, Strasser, Zachary H., Hossein Abad, Zahra Shakeri, Klann, Jeffrey G., Wagholikar, Kavishwar B., Mesa, Rebecca, Visweswaran, Shyam, Morris, Michele, Luo, Yuan, Henderson, Darren W., Samayamuthu, Malarkodi Jebathilagam, Tan, Bryce W.Q., Verdy, Guillame, Omenn, Gilbert S., Xia, Zongqi, Bellazzi, Riccardo, Murphy, Shawn N., Holmes, John H., Estiri, Hossein
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10511779/
https://www.ncbi.nlm.nih.gov/pubmed/37745021
http://dx.doi.org/10.1016/j.eclinm.2023.102210
_version_ 1785108217555058688
author Dagliati, Arianna
Strasser, Zachary H.
Hossein Abad, Zahra Shakeri
Klann, Jeffrey G.
Wagholikar, Kavishwar B.
Mesa, Rebecca
Visweswaran, Shyam
Morris, Michele
Luo, Yuan
Henderson, Darren W.
Samayamuthu, Malarkodi Jebathilagam
Tan, Bryce W.Q.
Verdy, Guillame
Omenn, Gilbert S.
Xia, Zongqi
Bellazzi, Riccardo
Murphy, Shawn N.
Holmes, John H.
Estiri, Hossein
author_facet Dagliati, Arianna
Strasser, Zachary H.
Hossein Abad, Zahra Shakeri
Klann, Jeffrey G.
Wagholikar, Kavishwar B.
Mesa, Rebecca
Visweswaran, Shyam
Morris, Michele
Luo, Yuan
Henderson, Darren W.
Samayamuthu, Malarkodi Jebathilagam
Tan, Bryce W.Q.
Verdy, Guillame
Omenn, Gilbert S.
Xia, Zongqi
Bellazzi, Riccardo
Murphy, Shawn N.
Holmes, John H.
Estiri, Hossein
author_sort Dagliati, Arianna
collection PubMed
description BACKGROUND: Characterizing Post-Acute Sequelae of COVID (SARS-CoV-2 Infection), or PASC has been challenging due to the multitude of sub-phenotypes, temporal attributes, and definitions. Scalable characterization of PASC sub-phenotypes can enhance screening capacities, disease management, and treatment planning. METHODS: We conducted a retrospective multi-centre observational cohort study, leveraging longitudinal electronic health record (EHR) data of 30,422 patients from three healthcare systems in the Consortium for the Clinical Characterization of COVID-19 by EHR (4CE). From the total cohort, we applied a deductive approach on 12,424 individuals with follow-up data and developed a distributed representation learning process for providing augmented definitions for PASC sub-phenotypes. FINDINGS: Our framework characterized seven PASC sub-phenotypes. We estimated that on average 15.7% of the hospitalized COVID-19 patients were likely to suffer from at least one PASC symptom and almost 5.98%, on average, had multiple symptoms. Joint pain and dyspnea had the highest prevalence, with an average prevalence of 5.45% and 4.53%, respectively. INTERPRETATION: We provided a scalable framework to every participating healthcare system for estimating PASC sub-phenotypes prevalence and temporal attributes, thus developing a unified model that characterizes augmented sub-phenotypes across the different systems. FUNDING: Authors are supported by 10.13039/100000060National Institute of Allergy and Infectious Diseases, 10.13039/100000049National Institute on Aging, 10.13039/100006108National Center for Advancing Translational Sciences, 10.13039/501100001349National Medical Research Council, 10.13039/100000065National Institute of Neurological Disorders and Stroke, 10.13039/501100000780European Union, 10.13039/100000002National Institutes of Health, 10.13039/100006108National Center for Advancing Translational Sciences.
format Online
Article
Text
id pubmed-10511779
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-105117792023-09-22 Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study Dagliati, Arianna Strasser, Zachary H. Hossein Abad, Zahra Shakeri Klann, Jeffrey G. Wagholikar, Kavishwar B. Mesa, Rebecca Visweswaran, Shyam Morris, Michele Luo, Yuan Henderson, Darren W. Samayamuthu, Malarkodi Jebathilagam Tan, Bryce W.Q. Verdy, Guillame Omenn, Gilbert S. Xia, Zongqi Bellazzi, Riccardo Murphy, Shawn N. Holmes, John H. Estiri, Hossein eClinicalMedicine Articles BACKGROUND: Characterizing Post-Acute Sequelae of COVID (SARS-CoV-2 Infection), or PASC has been challenging due to the multitude of sub-phenotypes, temporal attributes, and definitions. Scalable characterization of PASC sub-phenotypes can enhance screening capacities, disease management, and treatment planning. METHODS: We conducted a retrospective multi-centre observational cohort study, leveraging longitudinal electronic health record (EHR) data of 30,422 patients from three healthcare systems in the Consortium for the Clinical Characterization of COVID-19 by EHR (4CE). From the total cohort, we applied a deductive approach on 12,424 individuals with follow-up data and developed a distributed representation learning process for providing augmented definitions for PASC sub-phenotypes. FINDINGS: Our framework characterized seven PASC sub-phenotypes. We estimated that on average 15.7% of the hospitalized COVID-19 patients were likely to suffer from at least one PASC symptom and almost 5.98%, on average, had multiple symptoms. Joint pain and dyspnea had the highest prevalence, with an average prevalence of 5.45% and 4.53%, respectively. INTERPRETATION: We provided a scalable framework to every participating healthcare system for estimating PASC sub-phenotypes prevalence and temporal attributes, thus developing a unified model that characterizes augmented sub-phenotypes across the different systems. FUNDING: Authors are supported by 10.13039/100000060National Institute of Allergy and Infectious Diseases, 10.13039/100000049National Institute on Aging, 10.13039/100006108National Center for Advancing Translational Sciences, 10.13039/501100001349National Medical Research Council, 10.13039/100000065National Institute of Neurological Disorders and Stroke, 10.13039/501100000780European Union, 10.13039/100000002National Institutes of Health, 10.13039/100006108National Center for Advancing Translational Sciences. Elsevier 2023-09-14 /pmc/articles/PMC10511779/ /pubmed/37745021 http://dx.doi.org/10.1016/j.eclinm.2023.102210 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Articles
Dagliati, Arianna
Strasser, Zachary H.
Hossein Abad, Zahra Shakeri
Klann, Jeffrey G.
Wagholikar, Kavishwar B.
Mesa, Rebecca
Visweswaran, Shyam
Morris, Michele
Luo, Yuan
Henderson, Darren W.
Samayamuthu, Malarkodi Jebathilagam
Tan, Bryce W.Q.
Verdy, Guillame
Omenn, Gilbert S.
Xia, Zongqi
Bellazzi, Riccardo
Murphy, Shawn N.
Holmes, John H.
Estiri, Hossein
Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
title Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
title_full Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
title_fullStr Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
title_full_unstemmed Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
title_short Characterization of long COVID temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
title_sort characterization of long covid temporal sub-phenotypes by distributed representation learning from electronic health record data: a cohort study
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10511779/
https://www.ncbi.nlm.nih.gov/pubmed/37745021
http://dx.doi.org/10.1016/j.eclinm.2023.102210
work_keys_str_mv AT dagliatiarianna characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT strasserzacharyh characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT hosseinabadzahrashakeri characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT klannjeffreyg characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT wagholikarkavishwarb characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT mesarebecca characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT visweswaranshyam characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT morrismichele characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT luoyuan characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT hendersondarrenw characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT samayamuthumalarkodijebathilagam characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT tanbrycewq characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT verdyguillame characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT omenngilberts characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT xiazongqi characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT bellazziriccardo characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT murphyshawnn characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT holmesjohnh characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy
AT estirihossein characterizationoflongcovidtemporalsubphenotypesbydistributedrepresentationlearningfromelectronichealthrecorddataacohortstudy