Cargando…

Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing

A streamlined scientific workflow system that can track the details of the data processing history is critical for the efficient handling of fundamental routines used in scientific research. In the scientific workflow research community, the information that describes the details of data processing...

Descripción completa

Detalles Bibliográficos
Autores principales: Cheng, Xi, Pizarro, Ricardo, Tong, Yunxia, Zoltick, Brad, Luo, Qian, Weinberger, Daniel R., Mattay, Venkata S.
Formato: Texto
Lenguaje:English
Publicado: Frontiers Research Foundation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2763889/
https://www.ncbi.nlm.nih.gov/pubmed/19847314
http://dx.doi.org/10.3389/neuro.11.035.2009
_version_ 1782173050492420096
author Cheng, Xi
Pizarro, Ricardo
Tong, Yunxia
Zoltick, Brad
Luo, Qian
Weinberger, Daniel R.
Mattay, Venkata S.
author_facet Cheng, Xi
Pizarro, Ricardo
Tong, Yunxia
Zoltick, Brad
Luo, Qian
Weinberger, Daniel R.
Mattay, Venkata S.
author_sort Cheng, Xi
collection PubMed
description A streamlined scientific workflow system that can track the details of the data processing history is critical for the efficient handling of fundamental routines used in scientific research. In the scientific workflow research community, the information that describes the details of data processing history is referred to as “provenance” which plays an important role in most of the existing workflow management systems. Despite its importance, however, provenance modeling and management is still a relatively new area in the scientific workflow research community. The proper scope, representation, granularity and implementation of a provenance model can vary from domain to domain and pose a number of challenges for an efficient pipeline design. This paper provides a case study on structured provenance modeling and management problems in the neuroimaging domain by introducing the Bio-Swarm-Pipeline. This new model, which is evaluated in the paper through real world scenarios, systematically addresses the provenance scope, representation, granularity, and implementation issues related to the neuroimaging domain. Although this model stems from applications in neuroimaging, the system can potentially be adapted to a wide range of bio-medical application scenarios.
format Text
id pubmed-2763889
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Frontiers Research Foundation
record_format MEDLINE/PubMed
spelling pubmed-27638892009-10-21 Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing Cheng, Xi Pizarro, Ricardo Tong, Yunxia Zoltick, Brad Luo, Qian Weinberger, Daniel R. Mattay, Venkata S. Front Neuroinformatics Neuroscience A streamlined scientific workflow system that can track the details of the data processing history is critical for the efficient handling of fundamental routines used in scientific research. In the scientific workflow research community, the information that describes the details of data processing history is referred to as “provenance” which plays an important role in most of the existing workflow management systems. Despite its importance, however, provenance modeling and management is still a relatively new area in the scientific workflow research community. The proper scope, representation, granularity and implementation of a provenance model can vary from domain to domain and pose a number of challenges for an efficient pipeline design. This paper provides a case study on structured provenance modeling and management problems in the neuroimaging domain by introducing the Bio-Swarm-Pipeline. This new model, which is evaluated in the paper through real world scenarios, systematically addresses the provenance scope, representation, granularity, and implementation issues related to the neuroimaging domain. Although this model stems from applications in neuroimaging, the system can potentially be adapted to a wide range of bio-medical application scenarios. Frontiers Research Foundation 2009-10-09 /pmc/articles/PMC2763889/ /pubmed/19847314 http://dx.doi.org/10.3389/neuro.11.035.2009 Text en Copyright © 2009 Cheng, Pizarro, Tong, Zoltick, Luo, Weinberger and Mattay. http://www.frontiersin.org/licenseagreement This is an open-access article subject to an exclusive license agreement between the authors and the Frontiers Research Foundation, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are credited.
spellingShingle Neuroscience
Cheng, Xi
Pizarro, Ricardo
Tong, Yunxia
Zoltick, Brad
Luo, Qian
Weinberger, Daniel R.
Mattay, Venkata S.
Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing
title Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing
title_full Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing
title_fullStr Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing
title_full_unstemmed Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing
title_short Bio-Swarm-Pipeline: A Light-Weight, Extensible Batch Processing System for Efficient Biomedical Data Processing
title_sort bio-swarm-pipeline: a light-weight, extensible batch processing system for efficient biomedical data processing
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2763889/
https://www.ncbi.nlm.nih.gov/pubmed/19847314
http://dx.doi.org/10.3389/neuro.11.035.2009
work_keys_str_mv AT chengxi bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT pizarroricardo bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT tongyunxia bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT zoltickbrad bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT luoqian bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT weinbergerdanielr bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing
AT mattayvenkatas bioswarmpipelinealightweightextensiblebatchprocessingsystemforefficientbiomedicaldataprocessing