Cargando…

CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language

BACKGROUND: Massive growth in the amount of research data and computational analysis has led to increased use of pipeline managers in biomedical computational research. However, each of the >100 such managers uses its own way to describe pipelines, leading to difficulty porting workflows to diffe...

Descripción completa

Detalles Bibliográficos
Autores principales: Kotliar, Michael, Kartashov, Andrey V, Barski, Artem
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6639121/
https://www.ncbi.nlm.nih.gov/pubmed/31321430
http://dx.doi.org/10.1093/gigascience/giz084
_version_ 1783436406328131584
author Kotliar, Michael
Kartashov, Andrey V
Barski, Artem
author_facet Kotliar, Michael
Kartashov, Andrey V
Barski, Artem
author_sort Kotliar, Michael
collection PubMed
description BACKGROUND: Massive growth in the amount of research data and computational analysis has led to increased use of pipeline managers in biomedical computational research. However, each of the >100 such managers uses its own way to describe pipelines, leading to difficulty porting workflows to different environments and therefore poor reproducibility of computational studies. For this reason, the Common Workflow Language (CWL) was recently introduced as a specification for platform-independent workflow description, and work began to transition existing pipelines and workflow managers to CWL. FINDINGS: Herein, we present CWL-Airflow, a package that adds support for CWL to the Apache Airflow pipeline manager. CWL-Airflow uses CWL version 1.0 specification and can run workflows on stand-alone MacOS/Linux servers, on clusters, or on a variety of cloud platforms. A sample CWL pipeline for processing of chromatin immunoprecipitation sequencing data is provided. CONCLUSIONS: CWL-Airflow will provide users with the features of a fully fledged pipeline manager and the ability to execute CWL workflows anywhere Airflow can run—from a laptop to a cluster or cloud environment. CWL-Airflow is available under Apache License, version 2.0 (Apache-2.0), and can be downloaded from https://barski-lab.github.io/cwl-airflow, https://scicrunch.org/resolver/RRID:SCR_017196.
format Online
Article
Text
id pubmed-6639121
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66391212019-07-23 CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language Kotliar, Michael Kartashov, Andrey V Barski, Artem Gigascience Technical Note BACKGROUND: Massive growth in the amount of research data and computational analysis has led to increased use of pipeline managers in biomedical computational research. However, each of the >100 such managers uses its own way to describe pipelines, leading to difficulty porting workflows to different environments and therefore poor reproducibility of computational studies. For this reason, the Common Workflow Language (CWL) was recently introduced as a specification for platform-independent workflow description, and work began to transition existing pipelines and workflow managers to CWL. FINDINGS: Herein, we present CWL-Airflow, a package that adds support for CWL to the Apache Airflow pipeline manager. CWL-Airflow uses CWL version 1.0 specification and can run workflows on stand-alone MacOS/Linux servers, on clusters, or on a variety of cloud platforms. A sample CWL pipeline for processing of chromatin immunoprecipitation sequencing data is provided. CONCLUSIONS: CWL-Airflow will provide users with the features of a fully fledged pipeline manager and the ability to execute CWL workflows anywhere Airflow can run—from a laptop to a cluster or cloud environment. CWL-Airflow is available under Apache License, version 2.0 (Apache-2.0), and can be downloaded from https://barski-lab.github.io/cwl-airflow, https://scicrunch.org/resolver/RRID:SCR_017196. Oxford University Press 2019-07-18 /pmc/articles/PMC6639121/ /pubmed/31321430 http://dx.doi.org/10.1093/gigascience/giz084 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Kotliar, Michael
Kartashov, Andrey V
Barski, Artem
CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language
title CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language
title_full CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language
title_fullStr CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language
title_full_unstemmed CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language
title_short CWL-Airflow: a lightweight pipeline manager supporting Common Workflow Language
title_sort cwl-airflow: a lightweight pipeline manager supporting common workflow language
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6639121/
https://www.ncbi.nlm.nih.gov/pubmed/31321430
http://dx.doi.org/10.1093/gigascience/giz084
work_keys_str_mv AT kotliarmichael cwlairflowalightweightpipelinemanagersupportingcommonworkflowlanguage
AT kartashovandreyv cwlairflowalightweightpipelinemanagersupportingcommonworkflowlanguage
AT barskiartem cwlairflowalightweightpipelinemanagersupportingcommonworkflowlanguage