Cargando…

Kronos: a workflow assembler for genome analytics and informatics

BACKGROUND: The field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into...

Descripción completa

Detalles Bibliográficos
Autores principales: Taghiyar, M. Jafar, Rosner, Jamie, Grewal, Diljot, Grande, Bruno M., Aniba, Radhouane, Grewal, Jasleen, Boutros, Paul C., Morin, Ryan D., Bashashati, Ali, Shah, Sohrab P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5569921/
https://www.ncbi.nlm.nih.gov/pubmed/28655203
http://dx.doi.org/10.1093/gigascience/gix042
_version_ 1783259079285669888
author Taghiyar, M. Jafar
Rosner, Jamie
Grewal, Diljot
Grande, Bruno M.
Aniba, Radhouane
Grewal, Jasleen
Boutros, Paul C.
Morin, Ryan D.
Bashashati, Ali
Shah, Sohrab P.
author_facet Taghiyar, M. Jafar
Rosner, Jamie
Grewal, Diljot
Grande, Bruno M.
Aniba, Radhouane
Grewal, Jasleen
Boutros, Paul C.
Morin, Ryan D.
Bashashati, Ali
Shah, Sohrab P.
author_sort Taghiyar, M. Jafar
collection PubMed
description BACKGROUND: The field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into “best practices” for automated analysis of next-generation sequencing datasets) still requires significant programming investment and expertise. RESULTS: We present Kronos, a software platform for facilitating the development and execution of modular, auditable, and distributable bioinformatics workflows. Kronos obviates the need for explicit coding of workflows by compiling a text configuration file into executable Python applications. Making analysis modules would still require programming. The framework of each workflow includes a run manager to execute the encoded workflows locally (or on a cluster or cloud), parallelize tasks, and log all runtime events. The resulting workflows are highly modular and configurable by construction, facilitating flexible and extensible meta-applications that can be modified easily through configuration file editing. The workflows are fully encoded for ease of distribution and can be instantiated on external systems, a step toward reproducible research and comparative analyses. We introduce a framework for building Kronos components that function as shareable, modular nodes in Kronos workflows. CONCLUSIONS: The Kronos platform provides a standard framework for developers to implement custom tools, reuse existing tools, and contribute to the community at large. Kronos is shipped with both Docker and Amazon Web Services Machine Images. It is free, open source, and available through the Python Package Index and at https://github.com/jtaghiyar/kronos.
format Online
Article
Text
id pubmed-5569921
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55699212017-08-29 Kronos: a workflow assembler for genome analytics and informatics Taghiyar, M. Jafar Rosner, Jamie Grewal, Diljot Grande, Bruno M. Aniba, Radhouane Grewal, Jasleen Boutros, Paul C. Morin, Ryan D. Bashashati, Ali Shah, Sohrab P. Gigascience Research BACKGROUND: The field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into “best practices” for automated analysis of next-generation sequencing datasets) still requires significant programming investment and expertise. RESULTS: We present Kronos, a software platform for facilitating the development and execution of modular, auditable, and distributable bioinformatics workflows. Kronos obviates the need for explicit coding of workflows by compiling a text configuration file into executable Python applications. Making analysis modules would still require programming. The framework of each workflow includes a run manager to execute the encoded workflows locally (or on a cluster or cloud), parallelize tasks, and log all runtime events. The resulting workflows are highly modular and configurable by construction, facilitating flexible and extensible meta-applications that can be modified easily through configuration file editing. The workflows are fully encoded for ease of distribution and can be instantiated on external systems, a step toward reproducible research and comparative analyses. We introduce a framework for building Kronos components that function as shareable, modular nodes in Kronos workflows. CONCLUSIONS: The Kronos platform provides a standard framework for developers to implement custom tools, reuse existing tools, and contribute to the community at large. Kronos is shipped with both Docker and Amazon Web Services Machine Images. It is free, open source, and available through the Python Package Index and at https://github.com/jtaghiyar/kronos. Oxford University Press 2017-06-26 /pmc/articles/PMC5569921/ /pubmed/28655203 http://dx.doi.org/10.1093/gigascience/gix042 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Taghiyar, M. Jafar
Rosner, Jamie
Grewal, Diljot
Grande, Bruno M.
Aniba, Radhouane
Grewal, Jasleen
Boutros, Paul C.
Morin, Ryan D.
Bashashati, Ali
Shah, Sohrab P.
Kronos: a workflow assembler for genome analytics and informatics
title Kronos: a workflow assembler for genome analytics and informatics
title_full Kronos: a workflow assembler for genome analytics and informatics
title_fullStr Kronos: a workflow assembler for genome analytics and informatics
title_full_unstemmed Kronos: a workflow assembler for genome analytics and informatics
title_short Kronos: a workflow assembler for genome analytics and informatics
title_sort kronos: a workflow assembler for genome analytics and informatics
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5569921/
https://www.ncbi.nlm.nih.gov/pubmed/28655203
http://dx.doi.org/10.1093/gigascience/gix042
work_keys_str_mv AT taghiyarmjafar kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT rosnerjamie kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT grewaldiljot kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT grandebrunom kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT anibaradhouane kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT grewaljasleen kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT boutrospaulc kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT morinryand kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT bashashatiali kronosaworkflowassemblerforgenomeanalyticsandinformatics
AT shahsohrabp kronosaworkflowassemblerforgenomeanalyticsandinformatics