Cargando…

The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data

BACKGROUND: The Health Informatics Centre at the University of Dundee provides a service to securely host clinical datasets and extract relevant data for anonymized cohorts to researchers to enable them to answer key research questions. As is common in research using routine healthcare data, the ser...

Descripción completa

Detalles Bibliográficos
Autores principales: Nind, Thomas, Galloway, James, McAllister, Gordon, Scobbie, Donald, Bonney, Wilfred, Hall, Christopher, Tramma, Leandro, Reel, Parminder, Groves, Martin, Appleby, Philip, Doney, Alex, Guthrie, Bruce, Jefferson, Emily
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6041881/
https://www.ncbi.nlm.nih.gov/pubmed/29790950
http://dx.doi.org/10.1093/gigascience/giy060
_version_ 1783339062487154688
author Nind, Thomas
Galloway, James
McAllister, Gordon
Scobbie, Donald
Bonney, Wilfred
Hall, Christopher
Tramma, Leandro
Reel, Parminder
Groves, Martin
Appleby, Philip
Doney, Alex
Guthrie, Bruce
Jefferson, Emily
author_facet Nind, Thomas
Galloway, James
McAllister, Gordon
Scobbie, Donald
Bonney, Wilfred
Hall, Christopher
Tramma, Leandro
Reel, Parminder
Groves, Martin
Appleby, Philip
Doney, Alex
Guthrie, Bruce
Jefferson, Emily
author_sort Nind, Thomas
collection PubMed
description BACKGROUND: The Health Informatics Centre at the University of Dundee provides a service to securely host clinical datasets and extract relevant data for anonymized cohorts to researchers to enable them to answer key research questions. As is common in research using routine healthcare data, the service was historically delivered using ad-hoc processes resulting in the slow provision of data whose provenance was often hidden to the researchers using it. This paper describes the development and evaluation of the Research Data Management Platform (RDMP): an open source tool to load, manage, clean, and curate longitudinal healthcare data for research and provide reproducible and updateable datasets for defined cohorts to researchers. RESULTS: Between 2013 and 2017, RDMP tool implementation tripled the productivity of data analysts producing data releases for researchers from 7.1 to 25.3 per month and reduced the error rate from 12.7% to 3.1%. The effort on data management reduced from a mean of 24.6 to 3.0 hours per data release. The waiting time for researchers to receive data after agreeing a specification reduced from approximately 6 months to less than 1 week. The software is scalable and currently manages 163 datasets. A total 1,321 data extracts for research have been produced, with the largest extract linking data from 70 different datasets. CONCLUSIONS: The tools and processes that encompass the RDMP not only fulfil the research data management requirements of researchers but also support the seamless collaboration of data cleaning, data transformation, data summarization and data quality assessment activities by different research groups.
format Online
Article
Text
id pubmed-6041881
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60418812018-07-17 The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data Nind, Thomas Galloway, James McAllister, Gordon Scobbie, Donald Bonney, Wilfred Hall, Christopher Tramma, Leandro Reel, Parminder Groves, Martin Appleby, Philip Doney, Alex Guthrie, Bruce Jefferson, Emily Gigascience Technical Note BACKGROUND: The Health Informatics Centre at the University of Dundee provides a service to securely host clinical datasets and extract relevant data for anonymized cohorts to researchers to enable them to answer key research questions. As is common in research using routine healthcare data, the service was historically delivered using ad-hoc processes resulting in the slow provision of data whose provenance was often hidden to the researchers using it. This paper describes the development and evaluation of the Research Data Management Platform (RDMP): an open source tool to load, manage, clean, and curate longitudinal healthcare data for research and provide reproducible and updateable datasets for defined cohorts to researchers. RESULTS: Between 2013 and 2017, RDMP tool implementation tripled the productivity of data analysts producing data releases for researchers from 7.1 to 25.3 per month and reduced the error rate from 12.7% to 3.1%. The effort on data management reduced from a mean of 24.6 to 3.0 hours per data release. The waiting time for researchers to receive data after agreeing a specification reduced from approximately 6 months to less than 1 week. The software is scalable and currently manages 163 datasets. A total 1,321 data extracts for research have been produced, with the largest extract linking data from 70 different datasets. CONCLUSIONS: The tools and processes that encompass the RDMP not only fulfil the research data management requirements of researchers but also support the seamless collaboration of data cleaning, data transformation, data summarization and data quality assessment activities by different research groups. Oxford University Press 2018-05-22 /pmc/articles/PMC6041881/ /pubmed/29790950 http://dx.doi.org/10.1093/gigascience/giy060 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Nind, Thomas
Galloway, James
McAllister, Gordon
Scobbie, Donald
Bonney, Wilfred
Hall, Christopher
Tramma, Leandro
Reel, Parminder
Groves, Martin
Appleby, Philip
Doney, Alex
Guthrie, Bruce
Jefferson, Emily
The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
title The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
title_full The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
title_fullStr The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
title_full_unstemmed The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
title_short The research data management platform (RDMP): A novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
title_sort research data management platform (rdmp): a novel, process driven, open-source tool for the management of longitudinal cohorts of clinical data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6041881/
https://www.ncbi.nlm.nih.gov/pubmed/29790950
http://dx.doi.org/10.1093/gigascience/giy060
work_keys_str_mv AT nindthomas theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT gallowayjames theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT mcallistergordon theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT scobbiedonald theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT bonneywilfred theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT hallchristopher theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT trammaleandro theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT reelparminder theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT grovesmartin theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT applebyphilip theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT doneyalex theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT guthriebruce theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT jeffersonemily theresearchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT nindthomas researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT gallowayjames researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT mcallistergordon researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT scobbiedonald researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT bonneywilfred researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT hallchristopher researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT trammaleandro researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT reelparminder researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT grovesmartin researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT applebyphilip researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT doneyalex researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT guthriebruce researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata
AT jeffersonemily researchdatamanagementplatformrdmpanovelprocessdrivenopensourcetoolforthemanagementoflongitudinalcohortsofclinicaldata