Cargando…

Perspectives on automated composition of workflows in the life sciences

Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and impleme...

Descripción completa

Detalles Bibliográficos
Autores principales: Lamprecht, Anna-Lena, Palmblad, Magnus, Ison, Jon, Schwämmle, Veit, Al Manir, Mohammad Sadnan, Altintas, Ilkay, Baker, Christopher J. O., Ben Hadj Amor, Ammar, Capella-Gutierrez, Salvador, Charonyktakis, Paulos, Crusoe, Michael R., Gil, Yolanda, Goble, Carole, Griffin, Timothy J., Groth, Paul, Ienasescu, Hans, Jagtap, Pratik, Kalaš, Matúš, Kasalica, Vedran, Khanteymoori, Alireza, Kuhn, Tobias, Mei, Hailiang, Ménager, Hervé, Möller, Steffen, Richardson, Robin A., Robert, Vincent, Soiland-Reyes, Stian, Stevens, Robert, Szaniszlo, Szoke, Verberne, Suzan, Verhoeven, Aswin, Wolstencroft, Katherine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573700/
https://www.ncbi.nlm.nih.gov/pubmed/34804501
http://dx.doi.org/10.12688/f1000research.54159.1
_version_ 1784595481256525824
author Lamprecht, Anna-Lena
Palmblad, Magnus
Ison, Jon
Schwämmle, Veit
Al Manir, Mohammad Sadnan
Altintas, Ilkay
Baker, Christopher J. O.
Ben Hadj Amor, Ammar
Capella-Gutierrez, Salvador
Charonyktakis, Paulos
Crusoe, Michael R.
Gil, Yolanda
Goble, Carole
Griffin, Timothy J.
Groth, Paul
Ienasescu, Hans
Jagtap, Pratik
Kalaš, Matúš
Kasalica, Vedran
Khanteymoori, Alireza
Kuhn, Tobias
Mei, Hailiang
Ménager, Hervé
Möller, Steffen
Richardson, Robin A.
Robert, Vincent
Soiland-Reyes, Stian
Stevens, Robert
Szaniszlo, Szoke
Verberne, Suzan
Verhoeven, Aswin
Wolstencroft, Katherine
author_facet Lamprecht, Anna-Lena
Palmblad, Magnus
Ison, Jon
Schwämmle, Veit
Al Manir, Mohammad Sadnan
Altintas, Ilkay
Baker, Christopher J. O.
Ben Hadj Amor, Ammar
Capella-Gutierrez, Salvador
Charonyktakis, Paulos
Crusoe, Michael R.
Gil, Yolanda
Goble, Carole
Griffin, Timothy J.
Groth, Paul
Ienasescu, Hans
Jagtap, Pratik
Kalaš, Matúš
Kasalica, Vedran
Khanteymoori, Alireza
Kuhn, Tobias
Mei, Hailiang
Ménager, Hervé
Möller, Steffen
Richardson, Robin A.
Robert, Vincent
Soiland-Reyes, Stian
Stevens, Robert
Szaniszlo, Szoke
Verberne, Suzan
Verhoeven, Aswin
Wolstencroft, Katherine
author_sort Lamprecht, Anna-Lena
collection PubMed
description Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future.
format Online
Article
Text
id pubmed-8573700
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-85737002021-11-18 Perspectives on automated composition of workflows in the life sciences Lamprecht, Anna-Lena Palmblad, Magnus Ison, Jon Schwämmle, Veit Al Manir, Mohammad Sadnan Altintas, Ilkay Baker, Christopher J. O. Ben Hadj Amor, Ammar Capella-Gutierrez, Salvador Charonyktakis, Paulos Crusoe, Michael R. Gil, Yolanda Goble, Carole Griffin, Timothy J. Groth, Paul Ienasescu, Hans Jagtap, Pratik Kalaš, Matúš Kasalica, Vedran Khanteymoori, Alireza Kuhn, Tobias Mei, Hailiang Ménager, Hervé Möller, Steffen Richardson, Robin A. Robert, Vincent Soiland-Reyes, Stian Stevens, Robert Szaniszlo, Szoke Verberne, Suzan Verhoeven, Aswin Wolstencroft, Katherine F1000Res Opinion Article Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future. F1000 Research Limited 2021-09-07 /pmc/articles/PMC8573700/ /pubmed/34804501 http://dx.doi.org/10.12688/f1000research.54159.1 Text en Copyright: © 2021 Lamprecht AL et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Opinion Article
Lamprecht, Anna-Lena
Palmblad, Magnus
Ison, Jon
Schwämmle, Veit
Al Manir, Mohammad Sadnan
Altintas, Ilkay
Baker, Christopher J. O.
Ben Hadj Amor, Ammar
Capella-Gutierrez, Salvador
Charonyktakis, Paulos
Crusoe, Michael R.
Gil, Yolanda
Goble, Carole
Griffin, Timothy J.
Groth, Paul
Ienasescu, Hans
Jagtap, Pratik
Kalaš, Matúš
Kasalica, Vedran
Khanteymoori, Alireza
Kuhn, Tobias
Mei, Hailiang
Ménager, Hervé
Möller, Steffen
Richardson, Robin A.
Robert, Vincent
Soiland-Reyes, Stian
Stevens, Robert
Szaniszlo, Szoke
Verberne, Suzan
Verhoeven, Aswin
Wolstencroft, Katherine
Perspectives on automated composition of workflows in the life sciences
title Perspectives on automated composition of workflows in the life sciences
title_full Perspectives on automated composition of workflows in the life sciences
title_fullStr Perspectives on automated composition of workflows in the life sciences
title_full_unstemmed Perspectives on automated composition of workflows in the life sciences
title_short Perspectives on automated composition of workflows in the life sciences
title_sort perspectives on automated composition of workflows in the life sciences
topic Opinion Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573700/
https://www.ncbi.nlm.nih.gov/pubmed/34804501
http://dx.doi.org/10.12688/f1000research.54159.1
work_keys_str_mv AT lamprechtannalena perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT palmbladmagnus perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT isonjon perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT schwammleveit perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT almanirmohammadsadnan perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT altintasilkay perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT bakerchristopherjo perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT benhadjamorammar perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT capellagutierrezsalvador perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT charonyktakispaulos perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT crusoemichaelr perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT gilyolanda perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT goblecarole perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT griffintimothyj perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT grothpaul perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT ienasescuhans perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT jagtappratik perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT kalasmatus perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT kasalicavedran perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT khanteymoorialireza perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT kuhntobias perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT meihailiang perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT menagerherve perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT mollersteffen perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT richardsonrobina perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT robertvincent perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT soilandreyesstian perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT stevensrobert perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT szaniszloszoke perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT verbernesuzan perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT verhoevenaswin perspectivesonautomatedcompositionofworkflowsinthelifesciences
AT wolstencroftkatherine perspectivesonautomatedcompositionofworkflowsinthelifesciences