Cargando…
Structuring research methods and data with the research object model: genomics workflows as a case study
BACKGROUND: One of the main challenges for biomedical research lies in the computer-assisted integrative study of large and increasingly complex combinations of data in order to understand molecular mechanisms. The preservation of the materials and methods of such computational experiments with clea...
Autores principales: | , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177597/ https://www.ncbi.nlm.nih.gov/pubmed/25276335 http://dx.doi.org/10.1186/2041-1480-5-41 |
_version_ | 1782336792359337984 |
---|---|
author | Hettne, Kristina M Dharuri, Harish Zhao, Jun Wolstencroft, Katherine Belhajjame, Khalid Soiland-Reyes, Stian Mina, Eleni Thompson, Mark Cruickshank, Don Verdes-Montenegro, Lourdes Garrido, Julian de Roure, David Corcho, Oscar Klyne, Graham van Schouwen, Reinout ‘t Hoen, Peter A C Bechhofer, Sean Goble, Carole Roos, Marco |
author_facet | Hettne, Kristina M Dharuri, Harish Zhao, Jun Wolstencroft, Katherine Belhajjame, Khalid Soiland-Reyes, Stian Mina, Eleni Thompson, Mark Cruickshank, Don Verdes-Montenegro, Lourdes Garrido, Julian de Roure, David Corcho, Oscar Klyne, Graham van Schouwen, Reinout ‘t Hoen, Peter A C Bechhofer, Sean Goble, Carole Roos, Marco |
author_sort | Hettne, Kristina M |
collection | PubMed |
description | BACKGROUND: One of the main challenges for biomedical research lies in the computer-assisted integrative study of large and increasingly complex combinations of data in order to understand molecular mechanisms. The preservation of the materials and methods of such computational experiments with clear annotations is essential for understanding an experiment, and this is increasingly recognized in the bioinformatics community. Our assumption is that offering means of digital, structured aggregation and annotation of the objects of an experiment will provide necessary meta-data for a scientist to understand and recreate the results of an experiment. To support this we explored a model for the semantic description of a workflow-centric Research Object (RO), where an RO is defined as a resource that aggregates other resources, e.g., datasets, software, spreadsheets, text, etc. We applied this model to a case study where we analysed human metabolite variation by workflows. RESULTS: We present the application of the workflow-centric RO model for our bioinformatics case study. Three workflows were produced following recently defined Best Practices for workflow design. By modelling the experiment as an RO, we were able to automatically query the experiment and answer questions such as “which particular data was input to a particular workflow to test a particular hypothesis?”, and “which particular conclusions were drawn from a particular workflow?”. CONCLUSIONS: Applying a workflow-centric RO model to aggregate and annotate the resources used in a bioinformatics experiment, allowed us to retrieve the conclusions of the experiment in the context of the driving hypothesis, the executed workflows and their input data. The RO model is an extendable reference model that can be used by other systems as well. AVAILABILITY: The Research Object is available at http://www.myexperiment.org/packs/428 The Wf4Ever Research Object Model is available at http://wf4ever.github.io/ro ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/2041-1480-5-41) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4177597 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41775972014-09-29 Structuring research methods and data with the research object model: genomics workflows as a case study Hettne, Kristina M Dharuri, Harish Zhao, Jun Wolstencroft, Katherine Belhajjame, Khalid Soiland-Reyes, Stian Mina, Eleni Thompson, Mark Cruickshank, Don Verdes-Montenegro, Lourdes Garrido, Julian de Roure, David Corcho, Oscar Klyne, Graham van Schouwen, Reinout ‘t Hoen, Peter A C Bechhofer, Sean Goble, Carole Roos, Marco J Biomed Semantics Research BACKGROUND: One of the main challenges for biomedical research lies in the computer-assisted integrative study of large and increasingly complex combinations of data in order to understand molecular mechanisms. The preservation of the materials and methods of such computational experiments with clear annotations is essential for understanding an experiment, and this is increasingly recognized in the bioinformatics community. Our assumption is that offering means of digital, structured aggregation and annotation of the objects of an experiment will provide necessary meta-data for a scientist to understand and recreate the results of an experiment. To support this we explored a model for the semantic description of a workflow-centric Research Object (RO), where an RO is defined as a resource that aggregates other resources, e.g., datasets, software, spreadsheets, text, etc. We applied this model to a case study where we analysed human metabolite variation by workflows. RESULTS: We present the application of the workflow-centric RO model for our bioinformatics case study. Three workflows were produced following recently defined Best Practices for workflow design. By modelling the experiment as an RO, we were able to automatically query the experiment and answer questions such as “which particular data was input to a particular workflow to test a particular hypothesis?”, and “which particular conclusions were drawn from a particular workflow?”. CONCLUSIONS: Applying a workflow-centric RO model to aggregate and annotate the resources used in a bioinformatics experiment, allowed us to retrieve the conclusions of the experiment in the context of the driving hypothesis, the executed workflows and their input data. The RO model is an extendable reference model that can be used by other systems as well. AVAILABILITY: The Research Object is available at http://www.myexperiment.org/packs/428 The Wf4Ever Research Object Model is available at http://wf4ever.github.io/ro ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/2041-1480-5-41) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-18 /pmc/articles/PMC4177597/ /pubmed/25276335 http://dx.doi.org/10.1186/2041-1480-5-41 Text en © Hettne et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. |
spellingShingle | Research Hettne, Kristina M Dharuri, Harish Zhao, Jun Wolstencroft, Katherine Belhajjame, Khalid Soiland-Reyes, Stian Mina, Eleni Thompson, Mark Cruickshank, Don Verdes-Montenegro, Lourdes Garrido, Julian de Roure, David Corcho, Oscar Klyne, Graham van Schouwen, Reinout ‘t Hoen, Peter A C Bechhofer, Sean Goble, Carole Roos, Marco Structuring research methods and data with the research object model: genomics workflows as a case study |
title | Structuring research methods and data with the research object model: genomics workflows as a case study |
title_full | Structuring research methods and data with the research object model: genomics workflows as a case study |
title_fullStr | Structuring research methods and data with the research object model: genomics workflows as a case study |
title_full_unstemmed | Structuring research methods and data with the research object model: genomics workflows as a case study |
title_short | Structuring research methods and data with the research object model: genomics workflows as a case study |
title_sort | structuring research methods and data with the research object model: genomics workflows as a case study |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4177597/ https://www.ncbi.nlm.nih.gov/pubmed/25276335 http://dx.doi.org/10.1186/2041-1480-5-41 |
work_keys_str_mv | AT hettnekristinam structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT dharuriharish structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT zhaojun structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT wolstencroftkatherine structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT belhajjamekhalid structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT soilandreyesstian structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT minaeleni structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT thompsonmark structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT cruickshankdon structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT verdesmontenegrolourdes structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT garridojulian structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT derouredavid structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT corchooscar structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT klynegraham structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT vanschouwenreinout structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT thoenpeterac structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT bechhofersean structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT goblecarole structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy AT roosmarco structuringresearchmethodsanddatawiththeresearchobjectmodelgenomicsworkflowsasacasestudy |