Cargando…
Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset
Background Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Georg Thieme Verlag KG
2021
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357458/ https://www.ncbi.nlm.nih.gov/pubmed/34380168 http://dx.doi.org/10.1055/s-0041-1732301 |
_version_ | 1783737131467800576 |
---|---|
author | Sathappan, Selva Muthu Kumaran Jeon, Young Seok Dang, Trung Kien Lim, Su Chi Shao, Yi-Ming Tai, E Shyong Feng, Mengling |
author_facet | Sathappan, Selva Muthu Kumaran Jeon, Young Seok Dang, Trung Kien Lim, Su Chi Shao, Yi-Ming Tai, E Shyong Feng, Mengling |
author_sort | Sathappan, Selva Muthu Kumaran |
collection | PubMed |
description | Background Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform clinical practice that can help better manage the disease. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) was chosen as the framework for integrating data with disparate formats. Objective The study aimed to evaluate the feasibility of converting Singapore based data source, comprising of electronic health records (EHR), cognitive and depression assessment questionnaire data to OMOP CDM standard. Additionally, we also validate whether our OMOP CDM instance is fit for the purpose of research by executing a simple treatment pathways study using Atlas, a graphical user interface tool to conduct analysis on OMOP CDM data as a proof of concept. Methods We used de-identified EHR, cognitive, and depression assessment questionnaires data from a tertiary care hospital in Singapore to convert it to version 5.3.1 of OMOP CDM standard. We evaluate the OMOP CDM conversion by (1) assessing the mapping coverage (that is the percentage of source terms mapped to OMOP CDM standard); (2) local raw dataset versus CDM dataset analysis; and (3) Implementing Harmonized Intrinsic Data Quality Framework using an open-source R package called Data Quality Dashboard. Results The content coverage of OMOP CDM vocabularies is more than 90% for clinical data, but only around 11% for questionnaire data. The comparison of characteristics between source and target data returned consistent results and our transformed data did not pass 38 (1.4%) out of 2,622 quality checks. Conclusion Adoption of OMOP CDM at our site demonstrated that EHR data are feasible for standardization with minimal information loss, whereas challenges remain for standardizing cognitive and depression assessment questionnaire data that requires further work. |
format | Online Article Text |
id | pubmed-8357458 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Georg Thieme Verlag KG |
record_format | MEDLINE/PubMed |
spelling | pubmed-83574582021-08-17 Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset Sathappan, Selva Muthu Kumaran Jeon, Young Seok Dang, Trung Kien Lim, Su Chi Shao, Yi-Ming Tai, E Shyong Feng, Mengling Appl Clin Inform Background Diabetes mellitus (DM) is an important public health concern in Singapore and places a massive burden on health care spending. Tackling chronic diseases such as DM requires innovative strategies to integrate patients' data from diverse sources and use scientific discovery to inform clinical practice that can help better manage the disease. The Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) was chosen as the framework for integrating data with disparate formats. Objective The study aimed to evaluate the feasibility of converting Singapore based data source, comprising of electronic health records (EHR), cognitive and depression assessment questionnaire data to OMOP CDM standard. Additionally, we also validate whether our OMOP CDM instance is fit for the purpose of research by executing a simple treatment pathways study using Atlas, a graphical user interface tool to conduct analysis on OMOP CDM data as a proof of concept. Methods We used de-identified EHR, cognitive, and depression assessment questionnaires data from a tertiary care hospital in Singapore to convert it to version 5.3.1 of OMOP CDM standard. We evaluate the OMOP CDM conversion by (1) assessing the mapping coverage (that is the percentage of source terms mapped to OMOP CDM standard); (2) local raw dataset versus CDM dataset analysis; and (3) Implementing Harmonized Intrinsic Data Quality Framework using an open-source R package called Data Quality Dashboard. Results The content coverage of OMOP CDM vocabularies is more than 90% for clinical data, but only around 11% for questionnaire data. The comparison of characteristics between source and target data returned consistent results and our transformed data did not pass 38 (1.4%) out of 2,622 quality checks. Conclusion Adoption of OMOP CDM at our site demonstrated that EHR data are feasible for standardization with minimal information loss, whereas challenges remain for standardizing cognitive and depression assessment questionnaire data that requires further work. Georg Thieme Verlag KG 2021-08 2021-08-11 /pmc/articles/PMC8357458/ /pubmed/34380168 http://dx.doi.org/10.1055/s-0041-1732301 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution-NonDerivative-NonCommercial License, permitting copying and reproduction so long as the original work is given appropriate credit. Contents may not be used for commercial purposes, or adapted, remixed, transformed or built upon. ( https://creativecommons.org/licenses/by-nc-nd/4.0/ ) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License, which permits unrestricted reproduction and distribution, for non-commercial purposes only; and use and reproduction, but not distribution, of adapted material for non-commercial purposes only, provided the original work is properly cited. |
spellingShingle | Sathappan, Selva Muthu Kumaran Jeon, Young Seok Dang, Trung Kien Lim, Su Chi Shao, Yi-Ming Tai, E Shyong Feng, Mengling Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset |
title | Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset |
title_full | Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset |
title_fullStr | Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset |
title_full_unstemmed | Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset |
title_short | Transformation of Electronic Health Records and Questionnaire Data to OMOP CDM: A Feasibility Study Using SG_T2DM Dataset |
title_sort | transformation of electronic health records and questionnaire data to omop cdm: a feasibility study using sg_t2dm dataset |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357458/ https://www.ncbi.nlm.nih.gov/pubmed/34380168 http://dx.doi.org/10.1055/s-0041-1732301 |
work_keys_str_mv | AT sathappanselvamuthukumaran transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset AT jeonyoungseok transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset AT dangtrungkien transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset AT limsuchi transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset AT shaoyiming transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset AT taieshyong transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset AT fengmengling transformationofelectronichealthrecordsandquestionnairedatatoomopcdmafeasibilitystudyusingsgt2dmdataset |