Cargando…
A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
BACKGROUND: Contributing health data to national, regional, and local networks or registries requires data stored in local systems with local structures and codes to be extracted, transformed, and loaded into a standard format called a Common Data Model (CDM). These processes called Extract, Transfo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Ubiquity Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5994935/ https://www.ncbi.nlm.nih.gov/pubmed/29930958 http://dx.doi.org/10.5334/egems.222 |
_version_ | 1783330532156768256 |
---|---|
author | Ong, Toan Pradhananga, Rosina Holve, Erin Kahn, Michael G. |
author_facet | Ong, Toan Pradhananga, Rosina Holve, Erin Kahn, Michael G. |
author_sort | Ong, Toan |
collection | PubMed |
description | BACKGROUND: Contributing health data to national, regional, and local networks or registries requires data stored in local systems with local structures and codes to be extracted, transformed, and loaded into a standard format called a Common Data Model (CDM). These processes called Extract, Transform, Load (ETL) require data partners or contributors to invest in costly technical resources with specialized skills in data models, terminologies, and programming. Given the wide range of tasks, skills, and technologies required to transform data into a CDM, a classification of ETL challenges can help identify needed resources, which in turn may encourage data partners with less-technical capabilities to participate in data-sharing networks. METHODS: We conducted key-informant interviews with data partner representatives to survey the ETL challenges faced in clinical data research networks (CDRNs) and registries. A list of ETL challenges, organized into six themes was vetted during a one-day workshop with a wide range of network stakeholders including data partners, researchers, and policy experts. RESULTS: We identified 24 technical ETL challenges related to the data sharing process. All of these ETL challenges were rated as “important” or “very important” by workshop participants using a five point Likert scale. Based on these findings, a framework for categorizing ETL challenges according to ETL phases, themes, and levels of data network participation was developed. CONCLUSIONS: Overcoming ETL technical challenges require significant investments in a broad array of information technologies and human resources. Identifying these technical obstacles can inform optimal resource allocation to minimize the barriers and cost of entry for new data partners into extant networks, which in turn can expand data networks’ inclusiveness and diversity. This paper offers pertinent information and guiding framework that are relevant for data partners in ascertaining challenges associated with contributing data in data networks. |
format | Online Article Text |
id | pubmed-5994935 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Ubiquity Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-59949352018-06-21 A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation Ong, Toan Pradhananga, Rosina Holve, Erin Kahn, Michael G. EGEMS (Wash DC) Research BACKGROUND: Contributing health data to national, regional, and local networks or registries requires data stored in local systems with local structures and codes to be extracted, transformed, and loaded into a standard format called a Common Data Model (CDM). These processes called Extract, Transform, Load (ETL) require data partners or contributors to invest in costly technical resources with specialized skills in data models, terminologies, and programming. Given the wide range of tasks, skills, and technologies required to transform data into a CDM, a classification of ETL challenges can help identify needed resources, which in turn may encourage data partners with less-technical capabilities to participate in data-sharing networks. METHODS: We conducted key-informant interviews with data partner representatives to survey the ETL challenges faced in clinical data research networks (CDRNs) and registries. A list of ETL challenges, organized into six themes was vetted during a one-day workshop with a wide range of network stakeholders including data partners, researchers, and policy experts. RESULTS: We identified 24 technical ETL challenges related to the data sharing process. All of these ETL challenges were rated as “important” or “very important” by workshop participants using a five point Likert scale. Based on these findings, a framework for categorizing ETL challenges according to ETL phases, themes, and levels of data network participation was developed. CONCLUSIONS: Overcoming ETL technical challenges require significant investments in a broad array of information technologies and human resources. Identifying these technical obstacles can inform optimal resource allocation to minimize the barriers and cost of entry for new data partners into extant networks, which in turn can expand data networks’ inclusiveness and diversity. This paper offers pertinent information and guiding framework that are relevant for data partners in ascertaining challenges associated with contributing data in data networks. Ubiquity Press 2017-06-13 /pmc/articles/PMC5994935/ /pubmed/29930958 http://dx.doi.org/10.5334/egems.222 Text en Copyright: © 2018 The Author(s) https://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0), which permits unrestricted use and distribution, for non-commercial purposes, as long as the original material has not been modified, and provided the original author and source are credited. See https://creativecommons.org/licenses/by-nc-nd/3.0/. |
spellingShingle | Research Ong, Toan Pradhananga, Rosina Holve, Erin Kahn, Michael G. A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation |
title | A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation |
title_full | A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation |
title_fullStr | A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation |
title_full_unstemmed | A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation |
title_short | A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation |
title_sort | framework for classification of electronic health data extraction-transformation-loading challenges in data network participation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5994935/ https://www.ncbi.nlm.nih.gov/pubmed/29930958 http://dx.doi.org/10.5334/egems.222 |
work_keys_str_mv | AT ongtoan aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT pradhanangarosina aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT holveerin aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT kahnmichaelg aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT ongtoan frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT pradhanangarosina frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT holveerin frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation AT kahnmichaelg frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation |