Cargando…
Perspectives on automated composition of workflows in the life sciences
Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and impleme...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573700/ https://www.ncbi.nlm.nih.gov/pubmed/34804501 http://dx.doi.org/10.12688/f1000research.54159.1 |
_version_ | 1784595481256525824 |
---|---|
author | Lamprecht, Anna-Lena Palmblad, Magnus Ison, Jon Schwämmle, Veit Al Manir, Mohammad Sadnan Altintas, Ilkay Baker, Christopher J. O. Ben Hadj Amor, Ammar Capella-Gutierrez, Salvador Charonyktakis, Paulos Crusoe, Michael R. Gil, Yolanda Goble, Carole Griffin, Timothy J. Groth, Paul Ienasescu, Hans Jagtap, Pratik Kalaš, Matúš Kasalica, Vedran Khanteymoori, Alireza Kuhn, Tobias Mei, Hailiang Ménager, Hervé Möller, Steffen Richardson, Robin A. Robert, Vincent Soiland-Reyes, Stian Stevens, Robert Szaniszlo, Szoke Verberne, Suzan Verhoeven, Aswin Wolstencroft, Katherine |
author_facet | Lamprecht, Anna-Lena Palmblad, Magnus Ison, Jon Schwämmle, Veit Al Manir, Mohammad Sadnan Altintas, Ilkay Baker, Christopher J. O. Ben Hadj Amor, Ammar Capella-Gutierrez, Salvador Charonyktakis, Paulos Crusoe, Michael R. Gil, Yolanda Goble, Carole Griffin, Timothy J. Groth, Paul Ienasescu, Hans Jagtap, Pratik Kalaš, Matúš Kasalica, Vedran Khanteymoori, Alireza Kuhn, Tobias Mei, Hailiang Ménager, Hervé Möller, Steffen Richardson, Robin A. Robert, Vincent Soiland-Reyes, Stian Stevens, Robert Szaniszlo, Szoke Verberne, Suzan Verhoeven, Aswin Wolstencroft, Katherine |
author_sort | Lamprecht, Anna-Lena |
collection | PubMed |
description | Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future. |
format | Online Article Text |
id | pubmed-8573700 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-85737002021-11-18 Perspectives on automated composition of workflows in the life sciences Lamprecht, Anna-Lena Palmblad, Magnus Ison, Jon Schwämmle, Veit Al Manir, Mohammad Sadnan Altintas, Ilkay Baker, Christopher J. O. Ben Hadj Amor, Ammar Capella-Gutierrez, Salvador Charonyktakis, Paulos Crusoe, Michael R. Gil, Yolanda Goble, Carole Griffin, Timothy J. Groth, Paul Ienasescu, Hans Jagtap, Pratik Kalaš, Matúš Kasalica, Vedran Khanteymoori, Alireza Kuhn, Tobias Mei, Hailiang Ménager, Hervé Möller, Steffen Richardson, Robin A. Robert, Vincent Soiland-Reyes, Stian Stevens, Robert Szaniszlo, Szoke Verberne, Suzan Verhoeven, Aswin Wolstencroft, Katherine F1000Res Opinion Article Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the “big picture” of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future. F1000 Research Limited 2021-09-07 /pmc/articles/PMC8573700/ /pubmed/34804501 http://dx.doi.org/10.12688/f1000research.54159.1 Text en Copyright: © 2021 Lamprecht AL et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Opinion Article Lamprecht, Anna-Lena Palmblad, Magnus Ison, Jon Schwämmle, Veit Al Manir, Mohammad Sadnan Altintas, Ilkay Baker, Christopher J. O. Ben Hadj Amor, Ammar Capella-Gutierrez, Salvador Charonyktakis, Paulos Crusoe, Michael R. Gil, Yolanda Goble, Carole Griffin, Timothy J. Groth, Paul Ienasescu, Hans Jagtap, Pratik Kalaš, Matúš Kasalica, Vedran Khanteymoori, Alireza Kuhn, Tobias Mei, Hailiang Ménager, Hervé Möller, Steffen Richardson, Robin A. Robert, Vincent Soiland-Reyes, Stian Stevens, Robert Szaniszlo, Szoke Verberne, Suzan Verhoeven, Aswin Wolstencroft, Katherine Perspectives on automated composition of workflows in the life sciences |
title | Perspectives on automated composition of workflows in the life sciences |
title_full | Perspectives on automated composition of workflows in the life sciences |
title_fullStr | Perspectives on automated composition of workflows in the life sciences |
title_full_unstemmed | Perspectives on automated composition of workflows in the life sciences |
title_short | Perspectives on automated composition of workflows in the life sciences |
title_sort | perspectives on automated composition of workflows in the life sciences |
topic | Opinion Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573700/ https://www.ncbi.nlm.nih.gov/pubmed/34804501 http://dx.doi.org/10.12688/f1000research.54159.1 |
work_keys_str_mv | AT lamprechtannalena perspectivesonautomatedcompositionofworkflowsinthelifesciences AT palmbladmagnus perspectivesonautomatedcompositionofworkflowsinthelifesciences AT isonjon perspectivesonautomatedcompositionofworkflowsinthelifesciences AT schwammleveit perspectivesonautomatedcompositionofworkflowsinthelifesciences AT almanirmohammadsadnan perspectivesonautomatedcompositionofworkflowsinthelifesciences AT altintasilkay perspectivesonautomatedcompositionofworkflowsinthelifesciences AT bakerchristopherjo perspectivesonautomatedcompositionofworkflowsinthelifesciences AT benhadjamorammar perspectivesonautomatedcompositionofworkflowsinthelifesciences AT capellagutierrezsalvador perspectivesonautomatedcompositionofworkflowsinthelifesciences AT charonyktakispaulos perspectivesonautomatedcompositionofworkflowsinthelifesciences AT crusoemichaelr perspectivesonautomatedcompositionofworkflowsinthelifesciences AT gilyolanda perspectivesonautomatedcompositionofworkflowsinthelifesciences AT goblecarole perspectivesonautomatedcompositionofworkflowsinthelifesciences AT griffintimothyj perspectivesonautomatedcompositionofworkflowsinthelifesciences AT grothpaul perspectivesonautomatedcompositionofworkflowsinthelifesciences AT ienasescuhans perspectivesonautomatedcompositionofworkflowsinthelifesciences AT jagtappratik perspectivesonautomatedcompositionofworkflowsinthelifesciences AT kalasmatus perspectivesonautomatedcompositionofworkflowsinthelifesciences AT kasalicavedran perspectivesonautomatedcompositionofworkflowsinthelifesciences AT khanteymoorialireza perspectivesonautomatedcompositionofworkflowsinthelifesciences AT kuhntobias perspectivesonautomatedcompositionofworkflowsinthelifesciences AT meihailiang perspectivesonautomatedcompositionofworkflowsinthelifesciences AT menagerherve perspectivesonautomatedcompositionofworkflowsinthelifesciences AT mollersteffen perspectivesonautomatedcompositionofworkflowsinthelifesciences AT richardsonrobina perspectivesonautomatedcompositionofworkflowsinthelifesciences AT robertvincent perspectivesonautomatedcompositionofworkflowsinthelifesciences AT soilandreyesstian perspectivesonautomatedcompositionofworkflowsinthelifesciences AT stevensrobert perspectivesonautomatedcompositionofworkflowsinthelifesciences AT szaniszloszoke perspectivesonautomatedcompositionofworkflowsinthelifesciences AT verbernesuzan perspectivesonautomatedcompositionofworkflowsinthelifesciences AT verhoevenaswin perspectivesonautomatedcompositionofworkflowsinthelifesciences AT wolstencroftkatherine perspectivesonautomatedcompositionofworkflowsinthelifesciences |