Cargando…

Arteria: An automation system for a sequencing core facility

BACKGROUND: In recent years, nucleotide sequencing has become increasingly instrumental in both research and clinical settings. This has led to an explosive growth in sequencing data produced worldwide. As the amount of data increases, so does the need for automated solutions for data processing and...

Descripción completa

Detalles Bibliográficos
Autores principales: Dahlberg, Johan, Hermansson, Johan, Sturlaugsson, Steinar, Lysenkova, Mariya, Smeds, Patrik, Ladenvall, Claes, Guimera, Roman Valls, Reisinger, Florian, Hofmann, Oliver, Larsson, Pontus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6905352/
https://www.ncbi.nlm.nih.gov/pubmed/31825479
http://dx.doi.org/10.1093/gigascience/giz135
_version_ 1783478154387521536
author Dahlberg, Johan
Hermansson, Johan
Sturlaugsson, Steinar
Lysenkova, Mariya
Smeds, Patrik
Ladenvall, Claes
Guimera, Roman Valls
Reisinger, Florian
Hofmann, Oliver
Larsson, Pontus
author_facet Dahlberg, Johan
Hermansson, Johan
Sturlaugsson, Steinar
Lysenkova, Mariya
Smeds, Patrik
Ladenvall, Claes
Guimera, Roman Valls
Reisinger, Florian
Hofmann, Oliver
Larsson, Pontus
author_sort Dahlberg, Johan
collection PubMed
description BACKGROUND: In recent years, nucleotide sequencing has become increasingly instrumental in both research and clinical settings. This has led to an explosive growth in sequencing data produced worldwide. As the amount of data increases, so does the need for automated solutions for data processing and analysis. The concept of workflows has gained favour in the bioinformatics community, but there is little in the scientific literature describing end-to-end automation systems. Arteria is an automation system that aims at providing a solution to the data-related operational challenges that face sequencing core facilities. FINDINGS: Arteria is built on existing open source technologies, with a modular design allowing for a community-driven effort to create plug-and-play micro-services. In this article we describe the system, elaborate on the underlying conceptual framework, and present an example implementation. Arteria can be reduced to 3 conceptual levels: orchestration (using an event-based model of automation), process (the steps involved in processing sequencing data, modelled as workflows), and execution (using a series of RESTful micro-services). This creates a system that is both flexible and scalable. Arteria-based systems have been successfully deployed at 3 sequencing core facilities. The Arteria Project code, written largely in Python, is available as open source software, and more information can be found at https://arteria-project.github.io/ . CONCLUSIONS: We describe the Arteria system and the underlying conceptual framework, demonstrating how this model can be used to automate data handling and analysis in the context of a sequencing core facility.
format Online
Article
Text
id pubmed-6905352
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-69053522019-12-16 Arteria: An automation system for a sequencing core facility Dahlberg, Johan Hermansson, Johan Sturlaugsson, Steinar Lysenkova, Mariya Smeds, Patrik Ladenvall, Claes Guimera, Roman Valls Reisinger, Florian Hofmann, Oliver Larsson, Pontus Gigascience Technical Note BACKGROUND: In recent years, nucleotide sequencing has become increasingly instrumental in both research and clinical settings. This has led to an explosive growth in sequencing data produced worldwide. As the amount of data increases, so does the need for automated solutions for data processing and analysis. The concept of workflows has gained favour in the bioinformatics community, but there is little in the scientific literature describing end-to-end automation systems. Arteria is an automation system that aims at providing a solution to the data-related operational challenges that face sequencing core facilities. FINDINGS: Arteria is built on existing open source technologies, with a modular design allowing for a community-driven effort to create plug-and-play micro-services. In this article we describe the system, elaborate on the underlying conceptual framework, and present an example implementation. Arteria can be reduced to 3 conceptual levels: orchestration (using an event-based model of automation), process (the steps involved in processing sequencing data, modelled as workflows), and execution (using a series of RESTful micro-services). This creates a system that is both flexible and scalable. Arteria-based systems have been successfully deployed at 3 sequencing core facilities. The Arteria Project code, written largely in Python, is available as open source software, and more information can be found at https://arteria-project.github.io/ . CONCLUSIONS: We describe the Arteria system and the underlying conceptual framework, demonstrating how this model can be used to automate data handling and analysis in the context of a sequencing core facility. Oxford University Press 2019-12-11 /pmc/articles/PMC6905352/ /pubmed/31825479 http://dx.doi.org/10.1093/gigascience/giz135 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Dahlberg, Johan
Hermansson, Johan
Sturlaugsson, Steinar
Lysenkova, Mariya
Smeds, Patrik
Ladenvall, Claes
Guimera, Roman Valls
Reisinger, Florian
Hofmann, Oliver
Larsson, Pontus
Arteria: An automation system for a sequencing core facility
title Arteria: An automation system for a sequencing core facility
title_full Arteria: An automation system for a sequencing core facility
title_fullStr Arteria: An automation system for a sequencing core facility
title_full_unstemmed Arteria: An automation system for a sequencing core facility
title_short Arteria: An automation system for a sequencing core facility
title_sort arteria: an automation system for a sequencing core facility
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6905352/
https://www.ncbi.nlm.nih.gov/pubmed/31825479
http://dx.doi.org/10.1093/gigascience/giz135
work_keys_str_mv AT dahlbergjohan arteriaanautomationsystemforasequencingcorefacility
AT hermanssonjohan arteriaanautomationsystemforasequencingcorefacility
AT sturlaugssonsteinar arteriaanautomationsystemforasequencingcorefacility
AT lysenkovamariya arteriaanautomationsystemforasequencingcorefacility
AT smedspatrik arteriaanautomationsystemforasequencingcorefacility
AT ladenvallclaes arteriaanautomationsystemforasequencingcorefacility
AT guimeraromanvalls arteriaanautomationsystemforasequencingcorefacility
AT reisingerflorian arteriaanautomationsystemforasequencingcorefacility
AT hofmannoliver arteriaanautomationsystemforasequencingcorefacility
AT larssonpontus arteriaanautomationsystemforasequencingcorefacility