Cargando…

Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases

BACKGROUND: The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) can be used to transform observational health data to a common format. CDM transformation allows for analysis across disparate databases for the generation of new, real-word evidence, which is especially import...

Descripción completa

Detalles Bibliográficos
Autores principales: Biedermann, Patricia, Ong, Rose, Davydov, Alexander, Orlova, Alexandra, Solovyev, Philip, Sun, Hong, Wetherill, Graham, Brand, Monika, Didden, Eva-Maria
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8565035/
https://www.ncbi.nlm.nih.gov/pubmed/34727871
http://dx.doi.org/10.1186/s12874-021-01434-3
_version_ 1784593735902822400
author Biedermann, Patricia
Ong, Rose
Davydov, Alexander
Orlova, Alexandra
Solovyev, Philip
Sun, Hong
Wetherill, Graham
Brand, Monika
Didden, Eva-Maria
author_facet Biedermann, Patricia
Ong, Rose
Davydov, Alexander
Orlova, Alexandra
Solovyev, Philip
Sun, Hong
Wetherill, Graham
Brand, Monika
Didden, Eva-Maria
author_sort Biedermann, Patricia
collection PubMed
description BACKGROUND: The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) can be used to transform observational health data to a common format. CDM transformation allows for analysis across disparate databases for the generation of new, real-word evidence, which is especially important in rare disease where data are limited. Pulmonary hypertension (PH) is a progressive, life-threatening disease, with rare subgroups such as pulmonary arterial hypertension (PAH), for which generating real-world evidence is challenging. Our objective is to document the process and outcomes of transforming registry data in PH to the OMOP CDM, and highlight challenges and our potential solutions. METHODS: Three observational studies were transformed from the Clinical Data Interchange Standards Consortium study data tabulation model (SDTM) to OMOP CDM format. OPUS was a prospective, multi-centre registry (2014–2020) and OrPHeUS was a retrospective, multi-centre chart review (2013–2017); both enrolled patients newly treated with macitentan in the US. EXPOSURE is a prospective, multi-centre cohort study (2017–ongoing) of patients newly treated with selexipag or any PAH-specific therapy in Europe and Canada. OMOP CDM version 5.3.1 with recent OMOP CDM vocabulary was used. Imputation rules were defined and applied for missing dates to avoid exclusion of data. Custom target concepts were introduced when existing concepts did not provide sufficient granularity. RESULTS: Of the 6622 patients in the three registry studies, records were mapped for 6457. Custom target concepts were introduced for PAH subgroups (by combining SNOMED concepts or creating custom concepts) and World Health Organization functional class. Per the OMOP CDM convention, records about the absence of an event, or the lack of information, were not mapped. Excluding these non-event records, 4% (OPUS), 2% (OrPHeUS) and 1% (EXPOSURE) of records were not mapped. CONCLUSIONS: SDTM data from three registries were transformed to the OMOP CDM with limited exclusion of data and deviation from the SDTM database content. Future researchers can apply our strategy and methods in different disease areas, with tailoring as necessary. Mapping registry data to the OMOP CDM facilitates more efficient collaborations between researchers and establishment of federated data networks, which is an unmet need in rare diseases. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-021-01434-3.
format Online
Article
Text
id pubmed-8565035
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-85650352021-11-04 Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases Biedermann, Patricia Ong, Rose Davydov, Alexander Orlova, Alexandra Solovyev, Philip Sun, Hong Wetherill, Graham Brand, Monika Didden, Eva-Maria BMC Med Res Methodol Research BACKGROUND: The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) can be used to transform observational health data to a common format. CDM transformation allows for analysis across disparate databases for the generation of new, real-word evidence, which is especially important in rare disease where data are limited. Pulmonary hypertension (PH) is a progressive, life-threatening disease, with rare subgroups such as pulmonary arterial hypertension (PAH), for which generating real-world evidence is challenging. Our objective is to document the process and outcomes of transforming registry data in PH to the OMOP CDM, and highlight challenges and our potential solutions. METHODS: Three observational studies were transformed from the Clinical Data Interchange Standards Consortium study data tabulation model (SDTM) to OMOP CDM format. OPUS was a prospective, multi-centre registry (2014–2020) and OrPHeUS was a retrospective, multi-centre chart review (2013–2017); both enrolled patients newly treated with macitentan in the US. EXPOSURE is a prospective, multi-centre cohort study (2017–ongoing) of patients newly treated with selexipag or any PAH-specific therapy in Europe and Canada. OMOP CDM version 5.3.1 with recent OMOP CDM vocabulary was used. Imputation rules were defined and applied for missing dates to avoid exclusion of data. Custom target concepts were introduced when existing concepts did not provide sufficient granularity. RESULTS: Of the 6622 patients in the three registry studies, records were mapped for 6457. Custom target concepts were introduced for PAH subgroups (by combining SNOMED concepts or creating custom concepts) and World Health Organization functional class. Per the OMOP CDM convention, records about the absence of an event, or the lack of information, were not mapped. Excluding these non-event records, 4% (OPUS), 2% (OrPHeUS) and 1% (EXPOSURE) of records were not mapped. CONCLUSIONS: SDTM data from three registries were transformed to the OMOP CDM with limited exclusion of data and deviation from the SDTM database content. Future researchers can apply our strategy and methods in different disease areas, with tailoring as necessary. Mapping registry data to the OMOP CDM facilitates more efficient collaborations between researchers and establishment of federated data networks, which is an unmet need in rare diseases. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-021-01434-3. BioMed Central 2021-11-02 /pmc/articles/PMC8565035/ /pubmed/34727871 http://dx.doi.org/10.1186/s12874-021-01434-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Biedermann, Patricia
Ong, Rose
Davydov, Alexander
Orlova, Alexandra
Solovyev, Philip
Sun, Hong
Wetherill, Graham
Brand, Monika
Didden, Eva-Maria
Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases
title Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases
title_full Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases
title_fullStr Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases
title_full_unstemmed Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases
title_short Standardizing registry data to the OMOP Common Data Model: experience from three pulmonary hypertension databases
title_sort standardizing registry data to the omop common data model: experience from three pulmonary hypertension databases
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8565035/
https://www.ncbi.nlm.nih.gov/pubmed/34727871
http://dx.doi.org/10.1186/s12874-021-01434-3
work_keys_str_mv AT biedermannpatricia standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT ongrose standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT davydovalexander standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT orlovaalexandra standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT solovyevphilip standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT sunhong standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT wetherillgraham standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT brandmonika standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases
AT diddenevamaria standardizingregistrydatatotheomopcommondatamodelexperiencefromthreepulmonaryhypertensiondatabases