Cargando…
Kronos: a workflow assembler for genome analytics and informatics
BACKGROUND: The field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5569921/ https://www.ncbi.nlm.nih.gov/pubmed/28655203 http://dx.doi.org/10.1093/gigascience/gix042 |
_version_ | 1783259079285669888 |
---|---|
author | Taghiyar, M. Jafar Rosner, Jamie Grewal, Diljot Grande, Bruno M. Aniba, Radhouane Grewal, Jasleen Boutros, Paul C. Morin, Ryan D. Bashashati, Ali Shah, Sohrab P. |
author_facet | Taghiyar, M. Jafar Rosner, Jamie Grewal, Diljot Grande, Bruno M. Aniba, Radhouane Grewal, Jasleen Boutros, Paul C. Morin, Ryan D. Bashashati, Ali Shah, Sohrab P. |
author_sort | Taghiyar, M. Jafar |
collection | PubMed |
description | BACKGROUND: The field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into “best practices” for automated analysis of next-generation sequencing datasets) still requires significant programming investment and expertise. RESULTS: We present Kronos, a software platform for facilitating the development and execution of modular, auditable, and distributable bioinformatics workflows. Kronos obviates the need for explicit coding of workflows by compiling a text configuration file into executable Python applications. Making analysis modules would still require programming. The framework of each workflow includes a run manager to execute the encoded workflows locally (or on a cluster or cloud), parallelize tasks, and log all runtime events. The resulting workflows are highly modular and configurable by construction, facilitating flexible and extensible meta-applications that can be modified easily through configuration file editing. The workflows are fully encoded for ease of distribution and can be instantiated on external systems, a step toward reproducible research and comparative analyses. We introduce a framework for building Kronos components that function as shareable, modular nodes in Kronos workflows. CONCLUSIONS: The Kronos platform provides a standard framework for developers to implement custom tools, reuse existing tools, and contribute to the community at large. Kronos is shipped with both Docker and Amazon Web Services Machine Images. It is free, open source, and available through the Python Package Index and at https://github.com/jtaghiyar/kronos. |
format | Online Article Text |
id | pubmed-5569921 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-55699212017-08-29 Kronos: a workflow assembler for genome analytics and informatics Taghiyar, M. Jafar Rosner, Jamie Grewal, Diljot Grande, Bruno M. Aniba, Radhouane Grewal, Jasleen Boutros, Paul C. Morin, Ryan D. Bashashati, Ali Shah, Sohrab P. Gigascience Research BACKGROUND: The field of next-generation sequencing informatics has matured to a point where algorithmic advances in sequence alignment and individual feature detection methods have stabilized. Practical and robust implementation of complex analytical workflows (where such tools are structured into “best practices” for automated analysis of next-generation sequencing datasets) still requires significant programming investment and expertise. RESULTS: We present Kronos, a software platform for facilitating the development and execution of modular, auditable, and distributable bioinformatics workflows. Kronos obviates the need for explicit coding of workflows by compiling a text configuration file into executable Python applications. Making analysis modules would still require programming. The framework of each workflow includes a run manager to execute the encoded workflows locally (or on a cluster or cloud), parallelize tasks, and log all runtime events. The resulting workflows are highly modular and configurable by construction, facilitating flexible and extensible meta-applications that can be modified easily through configuration file editing. The workflows are fully encoded for ease of distribution and can be instantiated on external systems, a step toward reproducible research and comparative analyses. We introduce a framework for building Kronos components that function as shareable, modular nodes in Kronos workflows. CONCLUSIONS: The Kronos platform provides a standard framework for developers to implement custom tools, reuse existing tools, and contribute to the community at large. Kronos is shipped with both Docker and Amazon Web Services Machine Images. It is free, open source, and available through the Python Package Index and at https://github.com/jtaghiyar/kronos. Oxford University Press 2017-06-26 /pmc/articles/PMC5569921/ /pubmed/28655203 http://dx.doi.org/10.1093/gigascience/gix042 Text en © The Authors 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Taghiyar, M. Jafar Rosner, Jamie Grewal, Diljot Grande, Bruno M. Aniba, Radhouane Grewal, Jasleen Boutros, Paul C. Morin, Ryan D. Bashashati, Ali Shah, Sohrab P. Kronos: a workflow assembler for genome analytics and informatics |
title | Kronos: a workflow assembler for genome analytics and informatics |
title_full | Kronos: a workflow assembler for genome analytics and informatics |
title_fullStr | Kronos: a workflow assembler for genome analytics and informatics |
title_full_unstemmed | Kronos: a workflow assembler for genome analytics and informatics |
title_short | Kronos: a workflow assembler for genome analytics and informatics |
title_sort | kronos: a workflow assembler for genome analytics and informatics |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5569921/ https://www.ncbi.nlm.nih.gov/pubmed/28655203 http://dx.doi.org/10.1093/gigascience/gix042 |
work_keys_str_mv | AT taghiyarmjafar kronosaworkflowassemblerforgenomeanalyticsandinformatics AT rosnerjamie kronosaworkflowassemblerforgenomeanalyticsandinformatics AT grewaldiljot kronosaworkflowassemblerforgenomeanalyticsandinformatics AT grandebrunom kronosaworkflowassemblerforgenomeanalyticsandinformatics AT anibaradhouane kronosaworkflowassemblerforgenomeanalyticsandinformatics AT grewaljasleen kronosaworkflowassemblerforgenomeanalyticsandinformatics AT boutrospaulc kronosaworkflowassemblerforgenomeanalyticsandinformatics AT morinryand kronosaworkflowassemblerforgenomeanalyticsandinformatics AT bashashatiali kronosaworkflowassemblerforgenomeanalyticsandinformatics AT shahsohrabp kronosaworkflowassemblerforgenomeanalyticsandinformatics |