Cargando…

Semantics-based composition of EMBOSS services

BACKGROUND: More than in other domains the heterogeneous services world in bioinformatics demands for a methodology to classify and relate resources in a both human and machine accessible manner. The Semantic Web, which is meant to address exactly this challenge, is currently one of the most ambitio...

Descripción completa

Detalles Bibliográficos
Autores principales: Lamprecht, Anna-Lena, Naujokat, Stefan, Margaria, Tiziana, Steffen, Bernhard
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3105497/
https://www.ncbi.nlm.nih.gov/pubmed/21388574
http://dx.doi.org/10.1186/2041-1480-2-S1-S5
_version_ 1782204720750788608
author Lamprecht, Anna-Lena
Naujokat, Stefan
Margaria, Tiziana
Steffen, Bernhard
author_facet Lamprecht, Anna-Lena
Naujokat, Stefan
Margaria, Tiziana
Steffen, Bernhard
author_sort Lamprecht, Anna-Lena
collection PubMed
description BACKGROUND: More than in other domains the heterogeneous services world in bioinformatics demands for a methodology to classify and relate resources in a both human and machine accessible manner. The Semantic Web, which is meant to address exactly this challenge, is currently one of the most ambitious projects in computer science. Collective efforts within the community have already led to a basis of standards for semantic service descriptions and meta-information. In combination with process synthesis and planning methods, such knowledge about types and services can facilitate the automatic composition of workflows for particular research questions. RESULTS: In this study we apply the synthesis methodology that is available in the Bio-jETI workflow management framework for the semantics-based composition of EMBOSS services. EMBOSS (European Molecular Biology Open Software Suite) is a collection of 350 tools (March 2010) for various sequence analysis tasks, and thus a rich source of services and types that imply comprehensive domain models for planning and synthesis approaches. We use and compare two different setups of our EMBOSS synthesis domain: 1) a manually defined domain setup where an intuitive, high-level, semantically meaningful nomenclature is applied to describe the input/output behavior of the single EMBOSS tools and their classifications, and 2) a domain setup where this information has been automatically derived from the EMBOSS Ajax Command Definition (ACD) files and the EMBRACE Data and Methods ontology (EDAM). Our experiments demonstrate that these domain models in combination with our synthesis methodology greatly simplify working with the large, heterogeneous, and hence manually intractable EMBOSS collection. However, they also show that with the information that can be derived from the (current) ACD files and EDAM ontology alone, some essential connections between services can not be recognized. CONCLUSIONS: Our results show that adequate domain modeling requires to incorporate as much domain knowledge as possible, far beyond the mere technical aspects of the different types and services. Finding or defining semantically appropriate service and type descriptions is a difficult task, but the bioinformatics community appears to be on the right track towards a Life Science Semantic Web, which will eventually allow automatic service composition methods to unfold their full potential.
format Text
id pubmed-3105497
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31054972011-06-02 Semantics-based composition of EMBOSS services Lamprecht, Anna-Lena Naujokat, Stefan Margaria, Tiziana Steffen, Bernhard J Biomed Semantics Research BACKGROUND: More than in other domains the heterogeneous services world in bioinformatics demands for a methodology to classify and relate resources in a both human and machine accessible manner. The Semantic Web, which is meant to address exactly this challenge, is currently one of the most ambitious projects in computer science. Collective efforts within the community have already led to a basis of standards for semantic service descriptions and meta-information. In combination with process synthesis and planning methods, such knowledge about types and services can facilitate the automatic composition of workflows for particular research questions. RESULTS: In this study we apply the synthesis methodology that is available in the Bio-jETI workflow management framework for the semantics-based composition of EMBOSS services. EMBOSS (European Molecular Biology Open Software Suite) is a collection of 350 tools (March 2010) for various sequence analysis tasks, and thus a rich source of services and types that imply comprehensive domain models for planning and synthesis approaches. We use and compare two different setups of our EMBOSS synthesis domain: 1) a manually defined domain setup where an intuitive, high-level, semantically meaningful nomenclature is applied to describe the input/output behavior of the single EMBOSS tools and their classifications, and 2) a domain setup where this information has been automatically derived from the EMBOSS Ajax Command Definition (ACD) files and the EMBRACE Data and Methods ontology (EDAM). Our experiments demonstrate that these domain models in combination with our synthesis methodology greatly simplify working with the large, heterogeneous, and hence manually intractable EMBOSS collection. However, they also show that with the information that can be derived from the (current) ACD files and EDAM ontology alone, some essential connections between services can not be recognized. CONCLUSIONS: Our results show that adequate domain modeling requires to incorporate as much domain knowledge as possible, far beyond the mere technical aspects of the different types and services. Finding or defining semantically appropriate service and type descriptions is a difficult task, but the bioinformatics community appears to be on the right track towards a Life Science Semantic Web, which will eventually allow automatic service composition methods to unfold their full potential. BioMed Central 2011-03-07 /pmc/articles/PMC3105497/ /pubmed/21388574 http://dx.doi.org/10.1186/2041-1480-2-S1-S5 Text en Copyright ©2011 Lamprecht et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Lamprecht, Anna-Lena
Naujokat, Stefan
Margaria, Tiziana
Steffen, Bernhard
Semantics-based composition of EMBOSS services
title Semantics-based composition of EMBOSS services
title_full Semantics-based composition of EMBOSS services
title_fullStr Semantics-based composition of EMBOSS services
title_full_unstemmed Semantics-based composition of EMBOSS services
title_short Semantics-based composition of EMBOSS services
title_sort semantics-based composition of emboss services
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3105497/
https://www.ncbi.nlm.nih.gov/pubmed/21388574
http://dx.doi.org/10.1186/2041-1480-2-S1-S5
work_keys_str_mv AT lamprechtannalena semanticsbasedcompositionofembossservices
AT naujokatstefan semanticsbasedcompositionofembossservices
AT margariatiziana semanticsbasedcompositionofembossservices
AT steffenbernhard semanticsbasedcompositionofembossservices