Cargando…

The custodian administered research extract server: “improving the pipeline” in linked data delivery systems

BACKGROUND: At Western Australia’s Data Linkage Branch (DLB) the extraction of linked data has become increasingly complex over the past decade and classical methods of data delivery are unsuited to the larger extractions which have become the norm. The Custodian Administered Research Extract Server...

Descripción completa

Detalles Bibliográficos
Autores principales: Eitelhuber, Tom, Davis, Geoff
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376516/
https://www.ncbi.nlm.nih.gov/pubmed/25825670
http://dx.doi.org/10.1186/2047-2501-2-6
_version_ 1782363749408047104
author Eitelhuber, Tom
Davis, Geoff
author_facet Eitelhuber, Tom
Davis, Geoff
author_sort Eitelhuber, Tom
collection PubMed
description BACKGROUND: At Western Australia’s Data Linkage Branch (DLB) the extraction of linked data has become increasingly complex over the past decade and classical methods of data delivery are unsuited to the larger extractions which have become the norm. The Custodian Administered Research Extract Server (CARES) is a fast, accurate and predictable approach to linked data extraction. METHODS: The Data Linkage Branch (DLB) creates linkage keys within and between datasets. To comply with the separation principal, these keys are sent to applicable data collection agencies for extraction. Routing requests through multiple channels is inefficient and makes it hard to monitor work and predict delivery times. CARES was developed to address these shortcomings and involved ongoing consultation with the Custodians and staff of collections, plus challenges of hardware, programming, governance and security. RESULTS: The introduction of CARES has reduced the workload burden of linked data extractions, while improving the efficiency, stability and predictability of turnaround times. CONCLUSIONS: As the scope of a linkage system broadens, challenges in data delivery are inevitable. CARES overcomes multiple obstacles with no sacrifice to the integrity, confidentiality or security of data. CARES is a valuable component of linkage infrastructure that is operable at any scale and adaptable to many data environments.
format Online
Article
Text
id pubmed-4376516
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43765162015-03-31 The custodian administered research extract server: “improving the pipeline” in linked data delivery systems Eitelhuber, Tom Davis, Geoff Health Inf Sci Syst Research BACKGROUND: At Western Australia’s Data Linkage Branch (DLB) the extraction of linked data has become increasingly complex over the past decade and classical methods of data delivery are unsuited to the larger extractions which have become the norm. The Custodian Administered Research Extract Server (CARES) is a fast, accurate and predictable approach to linked data extraction. METHODS: The Data Linkage Branch (DLB) creates linkage keys within and between datasets. To comply with the separation principal, these keys are sent to applicable data collection agencies for extraction. Routing requests through multiple channels is inefficient and makes it hard to monitor work and predict delivery times. CARES was developed to address these shortcomings and involved ongoing consultation with the Custodians and staff of collections, plus challenges of hardware, programming, governance and security. RESULTS: The introduction of CARES has reduced the workload burden of linked data extractions, while improving the efficiency, stability and predictability of turnaround times. CONCLUSIONS: As the scope of a linkage system broadens, challenges in data delivery are inevitable. CARES overcomes multiple obstacles with no sacrifice to the integrity, confidentiality or security of data. CARES is a valuable component of linkage infrastructure that is operable at any scale and adaptable to many data environments. BioMed Central 2014-08-18 /pmc/articles/PMC4376516/ /pubmed/25825670 http://dx.doi.org/10.1186/2047-2501-2-6 Text en © Eitelhuber and Davis; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Eitelhuber, Tom
Davis, Geoff
The custodian administered research extract server: “improving the pipeline” in linked data delivery systems
title The custodian administered research extract server: “improving the pipeline” in linked data delivery systems
title_full The custodian administered research extract server: “improving the pipeline” in linked data delivery systems
title_fullStr The custodian administered research extract server: “improving the pipeline” in linked data delivery systems
title_full_unstemmed The custodian administered research extract server: “improving the pipeline” in linked data delivery systems
title_short The custodian administered research extract server: “improving the pipeline” in linked data delivery systems
title_sort custodian administered research extract server: “improving the pipeline” in linked data delivery systems
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376516/
https://www.ncbi.nlm.nih.gov/pubmed/25825670
http://dx.doi.org/10.1186/2047-2501-2-6
work_keys_str_mv AT eitelhubertom thecustodianadministeredresearchextractserverimprovingthepipelineinlinkeddatadeliverysystems
AT davisgeoff thecustodianadministeredresearchextractserverimprovingthepipelineinlinkeddatadeliverysystems
AT eitelhubertom custodianadministeredresearchextractserverimprovingthepipelineinlinkeddatadeliverysystems
AT davisgeoff custodianadministeredresearchextractserverimprovingthepipelineinlinkeddatadeliverysystems