Cargando…

Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma

BACKGROUND: Data heterogeneity is a common phenomenon related to the secondary use of electronic health records (EHR) data from different sources. The Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM) organizes healthcare data into standard data structures using conc...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Na, Zhang, Ning, Wu, Huawei, Lu, Shanshan, Yu, Yue, Hou, Li, Lu, Yinying, Liu, Hongfang, Jiang, Guoqian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284277/
https://www.ncbi.nlm.nih.gov/pubmed/30526572
http://dx.doi.org/10.1186/s12911-018-0686-7
_version_ 1783379306332815360
author Hong, Na
Zhang, Ning
Wu, Huawei
Lu, Shanshan
Yu, Yue
Hou, Li
Lu, Yinying
Liu, Hongfang
Jiang, Guoqian
author_facet Hong, Na
Zhang, Ning
Wu, Huawei
Lu, Shanshan
Yu, Yue
Hou, Li
Lu, Yinying
Liu, Hongfang
Jiang, Guoqian
author_sort Hong, Na
collection PubMed
description BACKGROUND: Data heterogeneity is a common phenomenon related to the secondary use of electronic health records (EHR) data from different sources. The Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM) organizes healthcare data into standard data structures using concepts that are explicitly and formally specified through standard vocabularies, thereby facilitating large-scale analysis. The objective of this study is to design, develop, and evaluate generic survival analysis routines built using the OHDSI CDM. METHODS: We used intrahepatic cholangiocarcinoma (ICC) patient data to implement CDM-based survival analysis methods. Our methods comprise the following modules: 1) Mapping local terms to standard OHDSI concepts. The analytical expression of variables and values related to demographic characteristics, medical history, smoking status, laboratory results, and tumor feature data. These data were mapped to standard OHDSI concepts through a manual analysis; 2) Loading patient data into the CDM using the concept mappings; 3) Developing an R interface that supports the portable survival analysis on top of OHDSI CDM, and comparing the CDM-based analysis results with those using traditional statistical analysis methods. RESULTS: Our dataset contained 346 patients diagnosed with ICC. The collected clinical data contains 115 variables, of which 75 variables were mapped to the OHDSI concepts. These concepts mainly belong to four domains: condition, observation, measurement, and procedure. The corresponding standard concepts are scattered in six vocabularies: ICD10CM, ICD10PCS, SNOMED, LOINC, NDFRT, and READ. We loaded a total of 25,950 patient data records into the OHDSI CDM database. However, 40 variables failed to map to the OHDSI CDM as they mostly belong to imaging data and pathological data. CONCLUSIONS: Our study demonstrates that conducting survival analysis using the OHDSI CDM is feasible and can produce reusable analysis routines. However, challenges to be overcome include 1) semantic loss caused by inaccurate mapping and value normalization; 2) incomplete OHDSI vocabularies describing imaging data, pathological data, and modular data representation.
format Online
Article
Text
id pubmed-6284277
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62842772018-12-14 Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma Hong, Na Zhang, Ning Wu, Huawei Lu, Shanshan Yu, Yue Hou, Li Lu, Yinying Liu, Hongfang Jiang, Guoqian BMC Med Inform Decis Mak Research BACKGROUND: Data heterogeneity is a common phenomenon related to the secondary use of electronic health records (EHR) data from different sources. The Observational Health Data Sciences and Informatics (OHDSI) Common Data Model (CDM) organizes healthcare data into standard data structures using concepts that are explicitly and formally specified through standard vocabularies, thereby facilitating large-scale analysis. The objective of this study is to design, develop, and evaluate generic survival analysis routines built using the OHDSI CDM. METHODS: We used intrahepatic cholangiocarcinoma (ICC) patient data to implement CDM-based survival analysis methods. Our methods comprise the following modules: 1) Mapping local terms to standard OHDSI concepts. The analytical expression of variables and values related to demographic characteristics, medical history, smoking status, laboratory results, and tumor feature data. These data were mapped to standard OHDSI concepts through a manual analysis; 2) Loading patient data into the CDM using the concept mappings; 3) Developing an R interface that supports the portable survival analysis on top of OHDSI CDM, and comparing the CDM-based analysis results with those using traditional statistical analysis methods. RESULTS: Our dataset contained 346 patients diagnosed with ICC. The collected clinical data contains 115 variables, of which 75 variables were mapped to the OHDSI concepts. These concepts mainly belong to four domains: condition, observation, measurement, and procedure. The corresponding standard concepts are scattered in six vocabularies: ICD10CM, ICD10PCS, SNOMED, LOINC, NDFRT, and READ. We loaded a total of 25,950 patient data records into the OHDSI CDM database. However, 40 variables failed to map to the OHDSI CDM as they mostly belong to imaging data and pathological data. CONCLUSIONS: Our study demonstrates that conducting survival analysis using the OHDSI CDM is feasible and can produce reusable analysis routines. However, challenges to be overcome include 1) semantic loss caused by inaccurate mapping and value normalization; 2) incomplete OHDSI vocabularies describing imaging data, pathological data, and modular data representation. BioMed Central 2018-12-07 /pmc/articles/PMC6284277/ /pubmed/30526572 http://dx.doi.org/10.1186/s12911-018-0686-7 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Hong, Na
Zhang, Ning
Wu, Huawei
Lu, Shanshan
Yu, Yue
Hou, Li
Lu, Yinying
Liu, Hongfang
Jiang, Guoqian
Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
title Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
title_full Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
title_fullStr Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
title_full_unstemmed Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
title_short Preliminary exploration of survival analysis using the OHDSI common data model: a case study of intrahepatic cholangiocarcinoma
title_sort preliminary exploration of survival analysis using the ohdsi common data model: a case study of intrahepatic cholangiocarcinoma
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6284277/
https://www.ncbi.nlm.nih.gov/pubmed/30526572
http://dx.doi.org/10.1186/s12911-018-0686-7
work_keys_str_mv AT hongna preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT zhangning preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT wuhuawei preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT lushanshan preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT yuyue preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT houli preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT luyinying preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT liuhongfang preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma
AT jiangguoqian preliminaryexplorationofsurvivalanalysisusingtheohdsicommondatamodelacasestudyofintrahepaticcholangiocarcinoma