Cargando…
Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
BACKGROUND: Data heterogeneity is a common phenomenon related to the secondary use of electronic health records (EHR) data from different sources. The Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM) organizes healthcare data into standard data structures using conc...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284277/ https://www.ncbi.nlm.nih.gov/pubmed/30526572 http://dx.doi.org/10.1186/s12911-018-0686-7 |
_version_ | 1783379306332815360 |
---|---|
author | Hong, Na Zhang, Ning Wu, Huawei Lu, Shanshan Yu, Yue Hou, Li Lu, Yinying Liu, Hongfang Jiang, Guoqian |
author_facet | Hong, Na Zhang, Ning Wu, Huawei Lu, Shanshan Yu, Yue Hou, Li Lu, Yinying Liu, Hongfang Jiang, Guoqian |
author_sort | Hong, Na |
collection | PubMed |
description | BACKGROUND: Data heterogeneity is a common phenomenon related to the secondary use of electronic health records (EHR) data from different sources. The Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM) organizes healthcare data into standard data structures using concepts that are explicitly and formally specified through standard vocabularies, thereby facilitating large-scale analysis. The objective of this study is to design, develop, and evaluate generic survival analysis routines built using the OHDSI CDM. METHODS: We used intrahepatic cholangiocarcinoma (ICC) patient data to implement CDM-based survival analysis methods. Our methods comprise the following modules: 1) Mapping local terms to standard OHDSI concepts. The analytical expression of variables and values related to demographic characteristics, medical history, smoking status, laboratory results, and tumor feature data. These data were mapped to standard OHDSI concepts through a manual analysis; 2) Loading patient data into the CDM using the concept mappings; 3) Developing an R interface that supports the portable survival analysis on top of OHDSI CDM, and comparing the CDM-based analysis results with those using traditional statistical analysis methods. RESULTS: Our dataset contained 346 patients diagnosed with ICC. The collected clinical data contains 115 variables, of which 75 variables were mapped to the OHDSI concepts. These concepts mainly belong to four domains: condition, observation, measurement, and procedure. The corresponding standard concepts are scattered in six vocabularies: ICD10CM, ICD10PCS, SNOMED, LOINC, NDFRT, and READ. We loaded a total of 25,950 patient data records into the OHDSI CDM database. However, 40 variables failed to map to the OHDSI CDM as they mostly belong to imaging data and pathological data. CONCLUSIONS: Our study demonstrates that conducting survival analysis using the OHDSI CDM is feasible and can produce reusable analysis routines. However, challenges to be overcome include 1) semantic loss caused by inaccurate mapping and value normalization; 2) incomplete OHDSI vocabularies describing imaging data, pathological data, and modular data representation. |
format | Online Article Text |
id | pubmed-6284277 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-62842772018-12-14 Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma Hong, Na Zhang, Ning Wu, Huawei Lu, Shanshan Yu, Yue Hou, Li Lu, Yinying Liu, Hongfang Jiang, Guoqian BMC Med Inform Decis Mak Research BACKGROUND: Data heterogeneity is a common phenomenon related to the secondary use of electronic health records (EHR) data from different sources. The Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM) organizes healthcare data into standard data structures using concepts that are explicitly and formally specified through standard vocabularies, thereby facilitating large-scale analysis. The objective of this study is to design, develop, and evaluate generic survival analysis routines built using the OHDSI CDM. METHODS: We used intrahepatic cholangiocarcinoma (ICC) patient data to implement CDM-based survival analysis methods. Our methods comprise the following modules: 1) Mapping local terms to standard OHDSI concepts. The analytical expression of variables and values related to demographic characteristics, medical history, smoking status, laboratory results, and tumor feature data. These data were mapped to standard OHDSI concepts through a manual analysis; 2) Loading patient data into the CDM using the concept mappings; 3) Developing an R interface that supports the portable survival analysis on top of OHDSI CDM, and comparing the CDM-based analysis results with those using traditional statistical analysis methods. RESULTS: Our dataset contained 346 patients diagnosed with ICC. The collected clinical data contains 115 variables, of which 75 variables were mapped to the OHDSI concepts. These concepts mainly belong to four domains: condition, observation, measurement, and procedure. The corresponding standard concepts are scattered in six vocabularies: ICD10CM, ICD10PCS, SNOMED, LOINC, NDFRT, and READ. We loaded a total of 25,950 patient data records into the OHDSI CDM database. However, 40 variables failed to map to the OHDSI CDM as they mostly belong to imaging data and pathological data. CONCLUSIONS: Our study demonstrates that conducting survival analysis using the OHDSI CDM is feasible and can produce reusable analysis routines. However, challenges to be overcome include 1) semantic loss caused by inaccurate mapping and value normalization; 2) incomplete OHDSI vocabularies describing imaging data, pathological data, and modular data representation. BioMed Central 2018-12-07 /pmc/articles/PMC6284277/ /pubmed/30526572 http://dx.doi.org/10.1186/s12911-018-0686-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Hong, Na Zhang, Ning Wu, Huawei Lu, Shanshan Yu, Yue Hou, Li Lu, Yinying Liu, Hongfang Jiang, Guoqian Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma |
title | Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma |
title_full | Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma |
title_fullStr | Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma |
title_full_unstemmed | Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma |
title_short | Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma |
title_sort | preliminary exploration of survival analysis using the ohdsi common data model: a case study of intrahepatic cholangiocarcinoma |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284277/ https://www.ncbi.nlm.nih.gov/pubmed/30526572 http://dx.doi.org/10.1186/s12911-018-0686-7 |
work_keys_str_mv | AT hongna preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT zhangning preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT wuhuawei preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT lushanshan preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT yuyue preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT houli preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT luyinying preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT liuhongfang preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma AT jiangguoqian preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma |