Cargando…

A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation

BACKGROUND: Contributing health data to national, regional, and local networks or registries requires data stored in local systems with local structures and codes to be extracted, transformed, and loaded into a standard format called a Common Data Model (CDM). These processes called Extract, Transfo...

Descripción completa

Detalles Bibliográficos
Autores principales: Ong, Toan, Pradhananga, Rosina, Holve, Erin, Kahn, Michael G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Ubiquity Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5994935/
https://www.ncbi.nlm.nih.gov/pubmed/29930958
http://dx.doi.org/10.5334/egems.222
_version_ 1783330532156768256
author Ong, Toan
Pradhananga, Rosina
Holve, Erin
Kahn, Michael G.
author_facet Ong, Toan
Pradhananga, Rosina
Holve, Erin
Kahn, Michael G.
author_sort Ong, Toan
collection PubMed
description BACKGROUND: Contributing health data to national, regional, and local networks or registries requires data stored in local systems with local structures and codes to be extracted, transformed, and loaded into a standard format called a Common Data Model (CDM). These processes called Extract, Transform, Load (ETL) require data partners or contributors to invest in costly technical resources with specialized skills in data models, terminologies, and programming. Given the wide range of tasks, skills, and technologies required to transform data into a CDM, a classification of ETL challenges can help identify needed resources, which in turn may encourage data partners with less-technical capabilities to participate in data-sharing networks. METHODS: We conducted key-informant interviews with data partner representatives to survey the ETL challenges faced in clinical data research networks (CDRNs) and registries. A list of ETL challenges, organized into six themes was vetted during a one-day workshop with a wide range of network stakeholders including data partners, researchers, and policy experts. RESULTS: We identified 24 technical ETL challenges related to the data sharing process. All of these ETL challenges were rated as “important” or “very important” by workshop participants using a five point Likert scale. Based on these findings, a framework for categorizing ETL challenges according to ETL phases, themes, and levels of data network participation was developed. CONCLUSIONS: Overcoming ETL technical challenges require significant investments in a broad array of information technologies and human resources. Identifying these technical obstacles can inform optimal resource allocation to minimize the barriers and cost of entry for new data partners into extant networks, which in turn can expand data networks’ inclusiveness and diversity. This paper offers pertinent information and guiding framework that are relevant for data partners in ascertaining challenges associated with contributing data in data networks.
format Online
Article
Text
id pubmed-5994935
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Ubiquity Press
record_format MEDLINE/PubMed
spelling pubmed-59949352018-06-21 A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation Ong, Toan Pradhananga, Rosina Holve, Erin Kahn, Michael G. EGEMS (Wash DC) Research BACKGROUND: Contributing health data to national, regional, and local networks or registries requires data stored in local systems with local structures and codes to be extracted, transformed, and loaded into a standard format called a Common Data Model (CDM). These processes called Extract, Transform, Load (ETL) require data partners or contributors to invest in costly technical resources with specialized skills in data models, terminologies, and programming. Given the wide range of tasks, skills, and technologies required to transform data into a CDM, a classification of ETL challenges can help identify needed resources, which in turn may encourage data partners with less-technical capabilities to participate in data-sharing networks. METHODS: We conducted key-informant interviews with data partner representatives to survey the ETL challenges faced in clinical data research networks (CDRNs) and registries. A list of ETL challenges, organized into six themes was vetted during a one-day workshop with a wide range of network stakeholders including data partners, researchers, and policy experts. RESULTS: We identified 24 technical ETL challenges related to the data sharing process. All of these ETL challenges were rated as “important” or “very important” by workshop participants using a five point Likert scale. Based on these findings, a framework for categorizing ETL challenges according to ETL phases, themes, and levels of data network participation was developed. CONCLUSIONS: Overcoming ETL technical challenges require significant investments in a broad array of information technologies and human resources. Identifying these technical obstacles can inform optimal resource allocation to minimize the barriers and cost of entry for new data partners into extant networks, which in turn can expand data networks’ inclusiveness and diversity. This paper offers pertinent information and guiding framework that are relevant for data partners in ascertaining challenges associated with contributing data in data networks. Ubiquity Press 2017-06-13 /pmc/articles/PMC5994935/ /pubmed/29930958 http://dx.doi.org/10.5334/egems.222 Text en Copyright: © 2018 The Author(s) https://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported (CC BY-NC-ND 3.0), which permits unrestricted use and distribution, for non-commercial purposes, as long as the original material has not been modified, and provided the original author and source are credited. See https://creativecommons.org/licenses/by-nc-nd/3.0/.
spellingShingle Research
Ong, Toan
Pradhananga, Rosina
Holve, Erin
Kahn, Michael G.
A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
title A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
title_full A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
title_fullStr A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
title_full_unstemmed A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
title_short A Framework for Classification of Electronic Health Data Extraction-Transformation-Loading Challenges in Data Network Participation
title_sort framework for classification of electronic health data extraction-transformation-loading challenges in data network participation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5994935/
https://www.ncbi.nlm.nih.gov/pubmed/29930958
http://dx.doi.org/10.5334/egems.222
work_keys_str_mv AT ongtoan aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT pradhanangarosina aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT holveerin aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT kahnmichaelg aframeworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT ongtoan frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT pradhanangarosina frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT holveerin frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation
AT kahnmichaelg frameworkforclassificationofelectronichealthdataextractiontransformationloadingchallengesindatanetworkparticipation