Cargando…

BioFlow: a web based workflow management software for design and execution of genomics pipelines

BACKGROUND: Bioinformatics data analysis is usually done sequentially by chaining together multiple tools. These are created by writing scripts and tracking the inputs and outputs of all stages. Writing such scripts require programming skills. Executing multiple pipelines in parallel and keeping tra...

Descripción completa

Detalles Bibliográficos
Autores principales: Garner, Harold, Puthige, Ashwin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4179862/
http://dx.doi.org/10.1186/1751-0473-9-20
_version_ 1782337155277783040
author Garner, Harold
Puthige, Ashwin
author_facet Garner, Harold
Puthige, Ashwin
author_sort Garner, Harold
collection PubMed
description BACKGROUND: Bioinformatics data analysis is usually done sequentially by chaining together multiple tools. These are created by writing scripts and tracking the inputs and outputs of all stages. Writing such scripts require programming skills. Executing multiple pipelines in parallel and keeping track of all the generated files is difficult and error prone. Checking results and task completion requires users to remotely login to their servers and run commands to identify process status. Users would benefit from a web-based tool that allows creation and execution of pipelines remotely. The tool should also keep track of all the files generated and maintain a history of user activities. RESULTS: A software tool for building and executing workflows is described here. The individual tools in the workflows can be any command line executable or script. The software has an intuitive mechanism for adding new tools to be used in workflows. It contains a workflow designer where workflows can be creating by visually connecting various components. Workflows are executed by job runners. The outputs and the job history are saved. The tool is web based software tool and all actions can be performed remotely. CONCLUSIONS: Users without scripting knowledge can utilize the tool to build pipelines for executing tasks. Pipelines can be modeled as workflows that are reusable. BioFlow enables users to easily add new tools to the database. The workflows can be created and executed remotely. A number of parallel jobs can be easily controlled. Distributed execution is possible by running multiple instances of the application. Any number of tasks can be executed and the output will be stored making it is easy to correlate the outputs to the jobs executed.
format Online
Article
Text
id pubmed-4179862
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41798622014-10-01 BioFlow: a web based workflow management software for design and execution of genomics pipelines Garner, Harold Puthige, Ashwin Source Code Biol Med Software Review BACKGROUND: Bioinformatics data analysis is usually done sequentially by chaining together multiple tools. These are created by writing scripts and tracking the inputs and outputs of all stages. Writing such scripts require programming skills. Executing multiple pipelines in parallel and keeping track of all the generated files is difficult and error prone. Checking results and task completion requires users to remotely login to their servers and run commands to identify process status. Users would benefit from a web-based tool that allows creation and execution of pipelines remotely. The tool should also keep track of all the files generated and maintain a history of user activities. RESULTS: A software tool for building and executing workflows is described here. The individual tools in the workflows can be any command line executable or script. The software has an intuitive mechanism for adding new tools to be used in workflows. It contains a workflow designer where workflows can be creating by visually connecting various components. Workflows are executed by job runners. The outputs and the job history are saved. The tool is web based software tool and all actions can be performed remotely. CONCLUSIONS: Users without scripting knowledge can utilize the tool to build pipelines for executing tasks. Pipelines can be modeled as workflows that are reusable. BioFlow enables users to easily add new tools to the database. The workflows can be created and executed remotely. A number of parallel jobs can be easily controlled. Distributed execution is possible by running multiple instances of the application. Any number of tasks can be executed and the output will be stored making it is easy to correlate the outputs to the jobs executed. BioMed Central 2014-09-18 /pmc/articles/PMC4179862/ http://dx.doi.org/10.1186/1751-0473-9-20 Text en Copyright © 2014 Garner and Puthige; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software Review
Garner, Harold
Puthige, Ashwin
BioFlow: a web based workflow management software for design and execution of genomics pipelines
title BioFlow: a web based workflow management software for design and execution of genomics pipelines
title_full BioFlow: a web based workflow management software for design and execution of genomics pipelines
title_fullStr BioFlow: a web based workflow management software for design and execution of genomics pipelines
title_full_unstemmed BioFlow: a web based workflow management software for design and execution of genomics pipelines
title_short BioFlow: a web based workflow management software for design and execution of genomics pipelines
title_sort bioflow: a web based workflow management software for design and execution of genomics pipelines
topic Software Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4179862/
http://dx.doi.org/10.1186/1751-0473-9-20
work_keys_str_mv AT garnerharold bioflowawebbasedworkflowmanagementsoftwarefordesignandexecutionofgenomicspipelines
AT puthigeashwin bioflowawebbasedworkflowmanagementsoftwarefordesignandexecutionofgenomicspipelines