Cargando…

Embedding data provenance into the Learning Health System to facilitate reproducible research

INTRODUCTION: The learning health system (LHS) community has taken up the challenge of bringing the complex relationship between clinical research and practice into this brave new world. At the heart of the LHS vision is the notion of routine capture, transformation, and dissemination of data and kn...

Descripción completa

Detalles Bibliográficos
Autor principal: Curcin, Vasa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Ltd 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6516719/
https://www.ncbi.nlm.nih.gov/pubmed/31245557
http://dx.doi.org/10.1002/lrh2.10019
_version_ 1783418233305432064
author Curcin, Vasa
author_facet Curcin, Vasa
author_sort Curcin, Vasa
collection PubMed
description INTRODUCTION: The learning health system (LHS) community has taken up the challenge of bringing the complex relationship between clinical research and practice into this brave new world. At the heart of the LHS vision is the notion of routine capture, transformation, and dissemination of data and knowledge, with various use cases, such as clinical studies, quality improvement initiatives, and decision support, constructed on top of specific routes that the data is taking through the system. In order to stop this increased data volume and analytical complexity from obfuscating the research process, it is essential to establish trust in the system through implementing reproducibility and auditability throughout the workflow. METHODS: Data provenance technologies can automatically capture the trace of the research task and resulting data, thereby facilitating reproducible research. While some computational domains, such as bioinformatics, have embraced the technology through provenance‐enabled execution middlewares, disciplines based on distributed, heterogeneous software, such as medical research, are only starting on the road to adoption, motivated by the institutional pressures to improve transparency and reproducibility. RESULTS: Guided by the experiences of the TRANSFoRm project, we present the opportunities that data provenance offers to the LHS community. We illustrate how provenance can facilitate documenting 21 CFR Part 11 compliance for Food and Drug Administration submissions and provide auditability for decisions made by the decision support tools and discuss the transformational effect of routine provenance capture on data privacy, study reporting, and publishing medical research. CONCLUSIONS: If the scaling up of the LHS is to succeed, we have to embed mechanisms to verify trust in the system inside our research instruments. In the research world increasingly reliant on electronic tools, provenance gives us a lingua franca to achieve traceability, which we have shown to be essential to building these mechanisms. To realize the vision of making computable provenance a feasible approach to implementing reproducibility in the LHS, we have to provide viable mechanisms for adoption. These include defining meaningful provenance models for problem domains and also introducing provenance support to existing tools in a minimally invasive manner.
format Online
Article
Text
id pubmed-6516719
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher John Wiley & Sons, Ltd
record_format MEDLINE/PubMed
spelling pubmed-65167192019-06-26 Embedding data provenance into the Learning Health System to facilitate reproducible research Curcin, Vasa Learn Health Syst Technical Report INTRODUCTION: The learning health system (LHS) community has taken up the challenge of bringing the complex relationship between clinical research and practice into this brave new world. At the heart of the LHS vision is the notion of routine capture, transformation, and dissemination of data and knowledge, with various use cases, such as clinical studies, quality improvement initiatives, and decision support, constructed on top of specific routes that the data is taking through the system. In order to stop this increased data volume and analytical complexity from obfuscating the research process, it is essential to establish trust in the system through implementing reproducibility and auditability throughout the workflow. METHODS: Data provenance technologies can automatically capture the trace of the research task and resulting data, thereby facilitating reproducible research. While some computational domains, such as bioinformatics, have embraced the technology through provenance‐enabled execution middlewares, disciplines based on distributed, heterogeneous software, such as medical research, are only starting on the road to adoption, motivated by the institutional pressures to improve transparency and reproducibility. RESULTS: Guided by the experiences of the TRANSFoRm project, we present the opportunities that data provenance offers to the LHS community. We illustrate how provenance can facilitate documenting 21 CFR Part 11 compliance for Food and Drug Administration submissions and provide auditability for decisions made by the decision support tools and discuss the transformational effect of routine provenance capture on data privacy, study reporting, and publishing medical research. CONCLUSIONS: If the scaling up of the LHS is to succeed, we have to embed mechanisms to verify trust in the system inside our research instruments. In the research world increasingly reliant on electronic tools, provenance gives us a lingua franca to achieve traceability, which we have shown to be essential to building these mechanisms. To realize the vision of making computable provenance a feasible approach to implementing reproducibility in the LHS, we have to provide viable mechanisms for adoption. These include defining meaningful provenance models for problem domains and also introducing provenance support to existing tools in a minimally invasive manner. John Wiley & Sons, Ltd 2016-12-27 /pmc/articles/PMC6516719/ /pubmed/31245557 http://dx.doi.org/10.1002/lrh2.10019 Text en © 2016 The Authors. Learning Health Systems published by Wiley Periodicals, Inc. on behalf of the University of Michigan This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Report
Curcin, Vasa
Embedding data provenance into the Learning Health System to facilitate reproducible research
title Embedding data provenance into the Learning Health System to facilitate reproducible research
title_full Embedding data provenance into the Learning Health System to facilitate reproducible research
title_fullStr Embedding data provenance into the Learning Health System to facilitate reproducible research
title_full_unstemmed Embedding data provenance into the Learning Health System to facilitate reproducible research
title_short Embedding data provenance into the Learning Health System to facilitate reproducible research
title_sort embedding data provenance into the learning health system to facilitate reproducible research
topic Technical Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6516719/
https://www.ncbi.nlm.nih.gov/pubmed/31245557
http://dx.doi.org/10.1002/lrh2.10019
work_keys_str_mv AT curcinvasa embeddingdataprovenanceintothelearninghealthsystemtofacilitatereproducibleresearch