Cargando…
Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C)
OBJECTIVE: To define pregnancy episodes and estimate gestational aging within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C). MATERIALS AND METHODS: We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated wi...
Autores principales: | , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387155/ https://www.ncbi.nlm.nih.gov/pubmed/35982668 http://dx.doi.org/10.1101/2022.08.04.22278439 |
_version_ | 1784769963286855680 |
---|---|
author | Jones, Sara Bradwell, Katie R. Chan, Lauren E. Olson-Chen, Courtney Tarleton, Jessica Wilkins, Kenneth J. Qin, Qiuyuan Faherty, Emily Groene Lau, Yan Kwan Xie, Catherine Kao, Yu-Han Liebman, Michael N. Mariona, Federico Challa, Anup Li, Li Ratcliffe, Sarah J. McMurry, Julie A. Haendel, Melissa A. Patel, Rena C. Hill, Elaine L. |
author_facet | Jones, Sara Bradwell, Katie R. Chan, Lauren E. Olson-Chen, Courtney Tarleton, Jessica Wilkins, Kenneth J. Qin, Qiuyuan Faherty, Emily Groene Lau, Yan Kwan Xie, Catherine Kao, Yu-Han Liebman, Michael N. Mariona, Federico Challa, Anup Li, Li Ratcliffe, Sarah J. McMurry, Julie A. Haendel, Melissa A. Patel, Rena C. Hill, Elaine L. |
author_sort | Jones, Sara |
collection | PubMed |
description | OBJECTIVE: To define pregnancy episodes and estimate gestational aging within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C). MATERIALS AND METHODS: We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated with Pregnancy Progression Signatures (HIPPS) and applied it to EHR data in the N3C from 1 January 2018 to 7 April 2022. HIPPS combines: 1) an extension of a previously published pregnancy episode algorithm, 2) a novel algorithm to detect gestational aging-specific signatures of a progressing pregnancy for further episode support, and 3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated three types of pregnancy cohorts based on the level of precision for gestational aging and pregnancy outcomes for comparison of COVID-19 and other characteristics. RESULTS: We identified 628,165 pregnant persons with 816,471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, spontaneous abortions), and 23.3% had unknown outcomes. We were able to estimate start dates within one week of precision for 431,173 (52.8%) episodes. 66,019 (8.1%) episodes had incident COVID-19 during pregnancy. Across varying COVID-19 cohorts, patient characteristics were generally similar though pregnancy outcomes differed. DISCUSSION: HIPPS provides support for pregnancy-related variables based on EHR data for researchers to define pregnancy cohorts. Our approach performed well based on clinician validation. CONCLUSION: We have developed a novel and robust approach for inferring pregnancy episodes and gestational aging that addresses data inconsistency and missingness in EHR data. |
format | Online Article Text |
id | pubmed-9387155 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-93871552022-08-19 Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) Jones, Sara Bradwell, Katie R. Chan, Lauren E. Olson-Chen, Courtney Tarleton, Jessica Wilkins, Kenneth J. Qin, Qiuyuan Faherty, Emily Groene Lau, Yan Kwan Xie, Catherine Kao, Yu-Han Liebman, Michael N. Mariona, Federico Challa, Anup Li, Li Ratcliffe, Sarah J. McMurry, Julie A. Haendel, Melissa A. Patel, Rena C. Hill, Elaine L. medRxiv Article OBJECTIVE: To define pregnancy episodes and estimate gestational aging within electronic health record (EHR) data from the National COVID Cohort Collaborative (N3C). MATERIALS AND METHODS: We developed a comprehensive approach, named Hierarchy and rule-based pregnancy episode Inference integrated with Pregnancy Progression Signatures (HIPPS) and applied it to EHR data in the N3C from 1 January 2018 to 7 April 2022. HIPPS combines: 1) an extension of a previously published pregnancy episode algorithm, 2) a novel algorithm to detect gestational aging-specific signatures of a progressing pregnancy for further episode support, and 3) pregnancy start date inference. Clinicians performed validation of HIPPS on a subset of episodes. We then generated three types of pregnancy cohorts based on the level of precision for gestational aging and pregnancy outcomes for comparison of COVID-19 and other characteristics. RESULTS: We identified 628,165 pregnant persons with 816,471 pregnancy episodes, of which 52.3% were live births, 24.4% were other outcomes (stillbirth, ectopic pregnancy, spontaneous abortions), and 23.3% had unknown outcomes. We were able to estimate start dates within one week of precision for 431,173 (52.8%) episodes. 66,019 (8.1%) episodes had incident COVID-19 during pregnancy. Across varying COVID-19 cohorts, patient characteristics were generally similar though pregnancy outcomes differed. DISCUSSION: HIPPS provides support for pregnancy-related variables based on EHR data for researchers to define pregnancy cohorts. Our approach performed well based on clinician validation. CONCLUSION: We have developed a novel and robust approach for inferring pregnancy episodes and gestational aging that addresses data inconsistency and missingness in EHR data. Cold Spring Harbor Laboratory 2022-08-06 /pmc/articles/PMC9387155/ /pubmed/35982668 http://dx.doi.org/10.1101/2022.08.04.22278439 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Jones, Sara Bradwell, Katie R. Chan, Lauren E. Olson-Chen, Courtney Tarleton, Jessica Wilkins, Kenneth J. Qin, Qiuyuan Faherty, Emily Groene Lau, Yan Kwan Xie, Catherine Kao, Yu-Han Liebman, Michael N. Mariona, Federico Challa, Anup Li, Li Ratcliffe, Sarah J. McMurry, Julie A. Haendel, Melissa A. Patel, Rena C. Hill, Elaine L. Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) |
title | Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) |
title_full | Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) |
title_fullStr | Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) |
title_full_unstemmed | Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) |
title_short | Who is pregnant? defining real-world data-based pregnancy episodes in the National COVID Cohort Collaborative (N3C) |
title_sort | who is pregnant? defining real-world data-based pregnancy episodes in the national covid cohort collaborative (n3c) |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9387155/ https://www.ncbi.nlm.nih.gov/pubmed/35982668 http://dx.doi.org/10.1101/2022.08.04.22278439 |
work_keys_str_mv | AT jonessara whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT bradwellkatier whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT chanlaurene whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT olsonchencourtney whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT tarletonjessica whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT wilkinskennethj whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT qinqiuyuan whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT fahertyemilygroene whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT lauyankwan whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT xiecatherine whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT kaoyuhan whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT liebmanmichaeln whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT marionafederico whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT challaanup whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT lili whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT ratcliffesarahj whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT mcmurryjuliea whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT haendelmelissaa whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT patelrenac whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c AT hillelainel whoispregnantdefiningrealworlddatabasedpregnancyepisodesinthenationalcovidcohortcollaborativen3c |