Cargando…

Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19

OBJECTIVE: Identifying the time of SARS-CoV-2 viral infection relative to specific gestational weeks is critical for delineating the role of viral infection timing in adverse pregnancy outcomes. However, this task is difficult when it comes to Electronic Health Records (EHR). In combating the COVID-...

Descripción completa

Detalles Bibliográficos
Autores principales: Lyu, Tianchu, Liang, Chen, Liu, Jihong, Campbell, Berry, Hung, Peiyin, Shih, Yi-Wen, Ghumman, Nadia, Li, Xiaoming
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9621451/
https://www.ncbi.nlm.nih.gov/pubmed/36315520
http://dx.doi.org/10.1371/journal.pone.0276923
_version_ 1784821557623783424
author Lyu, Tianchu
Liang, Chen
Liu, Jihong
Campbell, Berry
Hung, Peiyin
Shih, Yi-Wen
Ghumman, Nadia
Li, Xiaoming
author_facet Lyu, Tianchu
Liang, Chen
Liu, Jihong
Campbell, Berry
Hung, Peiyin
Shih, Yi-Wen
Ghumman, Nadia
Li, Xiaoming
author_sort Lyu, Tianchu
collection PubMed
description OBJECTIVE: Identifying the time of SARS-CoV-2 viral infection relative to specific gestational weeks is critical for delineating the role of viral infection timing in adverse pregnancy outcomes. However, this task is difficult when it comes to Electronic Health Records (EHR). In combating the COVID-19 pandemic for maternal health, we sought to develop and validate a clinical information extraction algorithm to detect the time of clinical events relative to gestational weeks. MATERIALS AND METHODS: We used EHR from the National COVID Cohort Collaborative (N3C), in which the EHR are normalized by the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). We performed EHR phenotyping, resulting in 270,897 pregnant women (June 1(st), 2018 to May 31(st), 2021). We developed a rule-based algorithm and performed a multi-level evaluation to test content validity and clinical validity, and extreme length of gestation (<150 or >300). RESULTS: The algorithm identified 296,194 pregnancies (16,659 COVID-19, 174,744 without COVID-19) in 270,897 pregnant women. For inferring gestational age, 95% cases (n = 40) have moderate-high accuracy (Cohen’s Kappa = 0.62); 100% cases (n = 40) have moderate-high granularity of temporal information (Cohen’s Kappa = 1). For inferring delivery dates, the accuracy is 100% (Cohen’s Kappa = 1). The accuracy of gestational age detection for the extreme length of gestation is 93.3% (Cohen’s Kappa = 1). Mothers with COVID-19 showed higher prevalence in obesity or overweight (35.1% vs. 29.5%), diabetes (17.8% vs. 17.0%), chronic obstructive pulmonary disease (0.2% vs. 0.1%), respiratory distress syndrome or acute respiratory failure (1.8% vs. 0.2%). DISCUSSION: We explored the characteristics of pregnant women by different gestational weeks of SARS-CoV-2 infection with our algorithm. TED-PC is the first to infer the exact gestational week linked with every clinical event from EHR and detect the timing of SARS-CoV-2 infection in pregnant women. CONCLUSION: The algorithm shows excellent clinical validity in inferring gestational age and delivery dates, which supports multiple EHR cohorts on N3C studying the impact of COVID-19 on pregnancy.
format Online
Article
Text
id pubmed-9621451
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-96214512022-11-01 Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19 Lyu, Tianchu Liang, Chen Liu, Jihong Campbell, Berry Hung, Peiyin Shih, Yi-Wen Ghumman, Nadia Li, Xiaoming PLoS One Research Article OBJECTIVE: Identifying the time of SARS-CoV-2 viral infection relative to specific gestational weeks is critical for delineating the role of viral infection timing in adverse pregnancy outcomes. However, this task is difficult when it comes to Electronic Health Records (EHR). In combating the COVID-19 pandemic for maternal health, we sought to develop and validate a clinical information extraction algorithm to detect the time of clinical events relative to gestational weeks. MATERIALS AND METHODS: We used EHR from the National COVID Cohort Collaborative (N3C), in which the EHR are normalized by the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). We performed EHR phenotyping, resulting in 270,897 pregnant women (June 1(st), 2018 to May 31(st), 2021). We developed a rule-based algorithm and performed a multi-level evaluation to test content validity and clinical validity, and extreme length of gestation (<150 or >300). RESULTS: The algorithm identified 296,194 pregnancies (16,659 COVID-19, 174,744 without COVID-19) in 270,897 pregnant women. For inferring gestational age, 95% cases (n = 40) have moderate-high accuracy (Cohen’s Kappa = 0.62); 100% cases (n = 40) have moderate-high granularity of temporal information (Cohen’s Kappa = 1). For inferring delivery dates, the accuracy is 100% (Cohen’s Kappa = 1). The accuracy of gestational age detection for the extreme length of gestation is 93.3% (Cohen’s Kappa = 1). Mothers with COVID-19 showed higher prevalence in obesity or overweight (35.1% vs. 29.5%), diabetes (17.8% vs. 17.0%), chronic obstructive pulmonary disease (0.2% vs. 0.1%), respiratory distress syndrome or acute respiratory failure (1.8% vs. 0.2%). DISCUSSION: We explored the characteristics of pregnant women by different gestational weeks of SARS-CoV-2 infection with our algorithm. TED-PC is the first to infer the exact gestational week linked with every clinical event from EHR and detect the timing of SARS-CoV-2 infection in pregnant women. CONCLUSION: The algorithm shows excellent clinical validity in inferring gestational age and delivery dates, which supports multiple EHR cohorts on N3C studying the impact of COVID-19 on pregnancy. Public Library of Science 2022-10-31 /pmc/articles/PMC9621451/ /pubmed/36315520 http://dx.doi.org/10.1371/journal.pone.0276923 Text en © 2022 Lyu et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lyu, Tianchu
Liang, Chen
Liu, Jihong
Campbell, Berry
Hung, Peiyin
Shih, Yi-Wen
Ghumman, Nadia
Li, Xiaoming
Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19
title Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19
title_full Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19
title_fullStr Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19
title_full_unstemmed Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19
title_short Temporal Events Detector for Pregnancy Care (TED-PC): A rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without COVID-19
title_sort temporal events detector for pregnancy care (ted-pc): a rule-based algorithm to infer gestational age and delivery date from electronic health records of pregnant women with and without covid-19
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9621451/
https://www.ncbi.nlm.nih.gov/pubmed/36315520
http://dx.doi.org/10.1371/journal.pone.0276923
work_keys_str_mv AT lyutianchu temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT liangchen temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT liujihong temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT campbellberry temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT hungpeiyin temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT shihyiwen temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT ghummannadia temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT lixiaoming temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19
AT temporaleventsdetectorforpregnancycaretedpcarulebasedalgorithmtoinfergestationalageanddeliverydatefromelectronichealthrecordsofpregnantwomenwithandwithoutcovid19