Cargando…

Embedding Analytics within the Curation of Scientific Workflows

This paper reports on the ongoing activities and curation practices of the National Center for Biomolecular NMR Data Processing and Analysis(). Over the past several years, the Center has been developing and extending computational workflow management software for use by a community of biomolecular...

Descripción completa

Detalles Bibliográficos
Autores principales: Weatherby, Gerard, Gryk, Michael R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990377/
https://www.ncbi.nlm.nih.gov/pubmed/33767737
http://dx.doi.org/10.2218/ijdc.v15i1.709
_version_ 1783669062185779200
author Weatherby, Gerard
Gryk, Michael R.
author_facet Weatherby, Gerard
Gryk, Michael R.
author_sort Weatherby, Gerard
collection PubMed
description This paper reports on the ongoing activities and curation practices of the National Center for Biomolecular NMR Data Processing and Analysis(). Over the past several years, the Center has been developing and extending computational workflow management software for use by a community of biomolecular NMR spectroscopists. Previous work had been to refactor the workflow system to utilize the PREMIS framework for reporting retrospective provenance as well as for sharing workflows between scientists and to support data reuse. In this paper, we report on our recent efforts to embed analytics within the workflow execution and within provenance tracking. Important metrics for each of the intermediate datasets are included within the corresponding PREMIS intellectual object, which allows for both inspection of the operation of individual actors as well as visualization of the changes throughout a full processing workflow. These metrics can be viewed within the workflow management system or through standalone metadata widgets. Our approach is to support a hybrid approach of both automated, workflow execution as well as manual intervention and metadata management. In this combination, the workflow system and metadata widgets encourage the domain experts to be avid curators of the data which they create, fostering both computational reproducibility and scientific data reuse.
format Online
Article
Text
id pubmed-7990377
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-79903772021-03-24 Embedding Analytics within the Curation of Scientific Workflows Weatherby, Gerard Gryk, Michael R. Int J Digit Curation Article This paper reports on the ongoing activities and curation practices of the National Center for Biomolecular NMR Data Processing and Analysis(). Over the past several years, the Center has been developing and extending computational workflow management software for use by a community of biomolecular NMR spectroscopists. Previous work had been to refactor the workflow system to utilize the PREMIS framework for reporting retrospective provenance as well as for sharing workflows between scientists and to support data reuse. In this paper, we report on our recent efforts to embed analytics within the workflow execution and within provenance tracking. Important metrics for each of the intermediate datasets are included within the corresponding PREMIS intellectual object, which allows for both inspection of the operation of individual actors as well as visualization of the changes throughout a full processing workflow. These metrics can be viewed within the workflow management system or through standalone metadata widgets. Our approach is to support a hybrid approach of both automated, workflow execution as well as manual intervention and metadata management. In this combination, the workflow system and metadata widgets encourage the domain experts to be avid curators of the data which they create, fostering both computational reproducibility and scientific data reuse. 2020 /pmc/articles/PMC7990377/ /pubmed/33767737 http://dx.doi.org/10.2218/ijdc.v15i1.709 Text en This work is released under a Creative Commons Attribution 4.0 International Licence. For details please see http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Weatherby, Gerard
Gryk, Michael R.
Embedding Analytics within the Curation of Scientific Workflows
title Embedding Analytics within the Curation of Scientific Workflows
title_full Embedding Analytics within the Curation of Scientific Workflows
title_fullStr Embedding Analytics within the Curation of Scientific Workflows
title_full_unstemmed Embedding Analytics within the Curation of Scientific Workflows
title_short Embedding Analytics within the Curation of Scientific Workflows
title_sort embedding analytics within the curation of scientific workflows
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7990377/
https://www.ncbi.nlm.nih.gov/pubmed/33767737
http://dx.doi.org/10.2218/ijdc.v15i1.709
work_keys_str_mv AT weatherbygerard embeddinganalyticswithinthecurationofscientificworkflows
AT grykmichaelr embeddinganalyticswithinthecurationofscientificworkflows