Cargando…

Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset

Background  Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform...

Descripción completa

Detalles Bibliográficos
Autores principales: Sathappan, Selva Muthu Kumaran, Jeon, Young Seok, Dang, Trung Kien, Lim, Su Chi, Shao, Yi-Ming, Tai, E Shyong, Feng, Mengling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Georg Thieme Verlag KG 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357458/
https://www.ncbi.nlm.nih.gov/pubmed/34380168
http://dx.doi.org/10.1055/s-0041-1732301
_version_ 1783737131467800576
author Sathappan, Selva Muthu Kumaran
Jeon, Young Seok
Dang, Trung Kien
Lim, Su Chi
Shao, Yi-Ming
Tai, E Shyong
Feng, Mengling
author_facet Sathappan, Selva Muthu Kumaran
Jeon, Young Seok
Dang, Trung Kien
Lim, Su Chi
Shao, Yi-Ming
Tai, E Shyong
Feng, Mengling
author_sort Sathappan, Selva Muthu Kumaran
collection PubMed
description Background  Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform clinical practice that can help better manage the disease. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) was chosen as the framework for integrating data with disparate formats. Objective  The study aimed to evaluate the feasibility of converting Singapore based data source, comprising of electronic health records (EHR), cognitive and depression assessment questionnaire data to OMOP CDM standard. Additionally, we also validate whether our OMOP CDM instance is fit for the purpose of research by executing a simple treatment pathways study using Atlas, a graphical user interface tool to conduct analysis on OMOP CDM data as a proof of concept. Methods  We used de-identified EHR, cognitive, and depression assessment questionnaires data from a tertiary care hospital in Singapore to convert it to version 5.3.1 of OMOP CDM standard. We evaluate the OMOP CDM conversion by (1) assessing the mapping coverage (that is the percentage of source terms mapped to OMOP CDM standard); (2) local raw dataset versus CDM dataset analysis; and (3) Implementing Harmonized Intrinsic Data Quality Framework using an open-source R package called Data Quality Dashboard. Results  The content coverage of OMOP CDM vocabularies is more than 90% for clinical data, but only around 11% for questionnaire data. The comparison of characteristics between source and target data returned consistent results and our transformed data did not pass 38 (1.4%) out of 2,622 quality checks. Conclusion  Adoption of OMOP CDM at our site demonstrated that EHR data are feasible for standardization with minimal information loss, whereas challenges remain for standardizing cognitive and depression assessment questionnaire data that requires further work.
format Online
Article
Text
id pubmed-8357458
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Georg Thieme Verlag KG
record_format MEDLINE/PubMed
spelling pubmed-83574582021-08-17 Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset Sathappan, Selva Muthu Kumaran Jeon, Young Seok Dang, Trung Kien Lim, Su Chi Shao, Yi-Ming Tai, E Shyong Feng, Mengling Appl Clin Inform Background  Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform clinical practice that can help better manage the disease. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) was chosen as the framework for integrating data with disparate formats. Objective  The study aimed to evaluate the feasibility of converting Singapore based data source, comprising of electronic health records (EHR), cognitive and depression assessment questionnaire data to OMOP CDM standard. Additionally, we also validate whether our OMOP CDM instance is fit for the purpose of research by executing a simple treatment pathways study using Atlas, a graphical user interface tool to conduct analysis on OMOP CDM data as a proof of concept. Methods  We used de-identified EHR, cognitive, and depression assessment questionnaires data from a tertiary care hospital in Singapore to convert it to version 5.3.1 of OMOP CDM standard. We evaluate the OMOP CDM conversion by (1) assessing the mapping coverage (that is the percentage of source terms mapped to OMOP CDM standard); (2) local raw dataset versus CDM dataset analysis; and (3) Implementing Harmonized Intrinsic Data Quality Framework using an open-source R package called Data Quality Dashboard. Results  The content coverage of OMOP CDM vocabularies is more than 90% for clinical data, but only around 11% for questionnaire data. The comparison of characteristics between source and target data returned consistent results and our transformed data did not pass 38 (1.4%) out of 2,622 quality checks. Conclusion  Adoption of OMOP CDM at our site demonstrated that EHR data are feasible for standardization with minimal information loss, whereas challenges remain for standardizing cognitive and depression assessment questionnaire data that requires further work. Georg Thieme Verlag KG 2021-08 2021-08-11 /pmc/articles/PMC8357458/ /pubmed/34380168 http://dx.doi.org/10.1055/s-0041-1732301 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. ( https://creativecommons.org/licenses/by-nc-nd/4.0/ ) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited.
spellingShingle Sathappan, Selva Muthu Kumaran
Jeon, Young Seok
Dang, Trung Kien
Lim, Su Chi
Shao, Yi-Ming
Tai, E Shyong
Feng, Mengling
Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
title Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
title_full Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
title_fullStr Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
title_full_unstemmed Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
title_short Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
title_sort transformation of electronic health records and questionnaire data to omop cdm: a feasibility study using sg_t2dm dataset
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357458/
https://www.ncbi.nlm.nih.gov/pubmed/34380168
http://dx.doi.org/10.1055/s-0041-1732301
work_keys_str_mv AT sathappanselvamuthukumaran transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset
AT jeonyoungseok transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset
AT dangtrungkien transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset
AT limsuchi transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset
AT shaoyiming transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset
AT taieshyong transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset
AT fengmengling transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset