Cargando…

APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry

[Image: see text] The bio.tools registry is a main catalogue of computational tools in the life sciences. More than 17 000 tools have been registered by the international bioinformatics community. The bio.tools metadata schema includes semantic annotations of tool functions, that is, formal descript...

Descripción completa

Detalles Bibliográficos
Autores principales: Kasalica, Vedran, Schwämmle, Veit, Palmblad, Magnus, Ison, Jon, Lamprecht, Anna-Lena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2021
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8041394/
https://www.ncbi.nlm.nih.gov/pubmed/33720735
http://dx.doi.org/10.1021/acs.jproteome.0c00983
_version_ 1783677930046488576
author Kasalica, Vedran
Schwämmle, Veit
Palmblad, Magnus
Ison, Jon
Lamprecht, Anna-Lena
author_facet Kasalica, Vedran
Schwämmle, Veit
Palmblad, Magnus
Ison, Jon
Lamprecht, Anna-Lena
author_sort Kasalica, Vedran
collection PubMed
description [Image: see text] The bio.tools registry is a main catalogue of computational tools in the life sciences. More than 17 000 tools have been registered by the international bioinformatics community. The bio.tools metadata schema includes semantic annotations of tool functions, that is, formal descriptions of tools’ data types, formats, and operations with terms from the EDAM bioinformatics ontology. Such annotations enable the automated composition of tools into multistep pipelines or workflows. In this Technical Note, we revisit a previous case study on the automated composition of proteomics workflows. We use the same four workflow scenarios but instead of using a small set of tools with carefully handcrafted annotations, we explore workflows directly on bio.tools. We use the Automated Pipeline Explorer (APE), a reimplementation and extension of the workflow composition method previously used. Moving “into the wild” opens up an unprecedented wealth of tools and a huge number of alternative workflows. Automated composition tools can be used to explore this space of possibilities systematically. Inevitably, the mixed quality of semantic annotations in bio.tools leads to unintended or erroneous tool combinations. However, our results also show that additional control mechanisms (tool filters, configuration options, and workflow constraints) can effectively guide the exploration toward smaller sets of more meaningful workflows.
format Online
Article
Text
id pubmed-8041394
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-80413942021-04-13 APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry Kasalica, Vedran Schwämmle, Veit Palmblad, Magnus Ison, Jon Lamprecht, Anna-Lena J Proteome Res [Image: see text] The bio.tools registry is a main catalogue of computational tools in the life sciences. More than 17 000 tools have been registered by the international bioinformatics community. The bio.tools metadata schema includes semantic annotations of tool functions, that is, formal descriptions of tools’ data types, formats, and operations with terms from the EDAM bioinformatics ontology. Such annotations enable the automated composition of tools into multistep pipelines or workflows. In this Technical Note, we revisit a previous case study on the automated composition of proteomics workflows. We use the same four workflow scenarios but instead of using a small set of tools with carefully handcrafted annotations, we explore workflows directly on bio.tools. We use the Automated Pipeline Explorer (APE), a reimplementation and extension of the workflow composition method previously used. Moving “into the wild” opens up an unprecedented wealth of tools and a huge number of alternative workflows. Automated composition tools can be used to explore this space of possibilities systematically. Inevitably, the mixed quality of semantic annotations in bio.tools leads to unintended or erroneous tool combinations. However, our results also show that additional control mechanisms (tool filters, configuration options, and workflow constraints) can effectively guide the exploration toward smaller sets of more meaningful workflows. American Chemical Society 2021-03-15 2021-04-02 /pmc/articles/PMC8041394/ /pubmed/33720735 http://dx.doi.org/10.1021/acs.jproteome.0c00983 Text en © 2021 The Authors. Published by American Chemical Society Permits non-commercial access and re-use, provided that author attribution and integrity are maintained; but does not permit creation of adaptations or other derivative works (https://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Kasalica, Vedran
Schwämmle, Veit
Palmblad, Magnus
Ison, Jon
Lamprecht, Anna-Lena
APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry
title APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry
title_full APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry
title_fullStr APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry
title_full_unstemmed APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry
title_short APE in the Wild: Automated Exploration of Proteomics Workflows in the bio.tools Registry
title_sort ape in the wild: automated exploration of proteomics workflows in the bio.tools registry
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8041394/
https://www.ncbi.nlm.nih.gov/pubmed/33720735
http://dx.doi.org/10.1021/acs.jproteome.0c00983
work_keys_str_mv AT kasalicavedran apeinthewildautomatedexplorationofproteomicsworkflowsinthebiotoolsregistry
AT schwammleveit apeinthewildautomatedexplorationofproteomicsworkflowsinthebiotoolsregistry
AT palmbladmagnus apeinthewildautomatedexplorationofproteomicsworkflowsinthebiotoolsregistry
AT isonjon apeinthewildautomatedexplorationofproteomicsworkflowsinthebiotoolsregistry
AT lamprechtannalena apeinthewildautomatedexplorationofproteomicsworkflowsinthebiotoolsregistry