Cargando…
A web services choreography scenario for interoperating bioinformatics applications
BACKGROUND: Very often genome-wide data analysis requires the interoperation of multiple databases and analytic tools. A large number of genome databases and bioinformatics applications are available through the web, but it is difficult to automate interoperation because: 1) the platforms on which t...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC394315/ https://www.ncbi.nlm.nih.gov/pubmed/15113410 http://dx.doi.org/10.1186/1471-2105-5-25 |
_version_ | 1782121310978048000 |
---|---|
author | de Knikker, Remko Guo, Youjun Li, Jin-long Kwan, Albert KH Yip, Kevin Y Cheung, David W Cheung, Kei-Hoi |
author_facet | de Knikker, Remko Guo, Youjun Li, Jin-long Kwan, Albert KH Yip, Kevin Y Cheung, David W Cheung, Kei-Hoi |
author_sort | de Knikker, Remko |
collection | PubMed |
description | BACKGROUND: Very often genome-wide data analysis requires the interoperation of multiple databases and analytic tools. A large number of genome databases and bioinformatics applications are available through the web, but it is difficult to automate interoperation because: 1) the platforms on which the applications run are heterogeneous, 2) their web interface is not machine-friendly, 3) they use a non-standard format for data input and output, 4) they do not exploit standards to define application interface and message exchange, and 5) existing protocols for remote messaging are often not firewall-friendly. To overcome these issues, web services have emerged as a standard XML-based model for message exchange between heterogeneous applications. Web services engines have been developed to manage the configuration and execution of a web services workflow. RESULTS: To demonstrate the benefit of using web services over traditional web interfaces, we compare the two implementations of HAPI, a gene expression analysis utility developed by the University of California San Diego (UCSD) that allows visual characterization of groups or clusters of genes based on the biomedical literature. This utility takes a set of microarray spot IDs as input and outputs a hierarchy of MeSH Keywords that correlates to the input and is grouped by Medical Subject Heading (MeSH) category. While the HTML output is easy for humans to visualize, it is difficult for computer applications to interpret semantically. To facilitate the capability of machine processing, we have created a workflow of three web services that replicates the HAPI functionality. These web services use document-style messages, which means that messages are encoded in an XML-based format. We compared three approaches to the implementation of an XML-based workflow: a hard coded Java application, Collaxa BPEL Server and Taverna Workbench. The Java program functions as a web services engine and interoperates with these web services using a web services choreography language (BPEL4WS). CONCLUSION: While it is relatively straightforward to implement and publish web services, the use of web services choreography engines is still in its infancy. However, industry-wide support and push for web services standards is quickly increasing the chance of success in using web services to unify heterogeneous bioinformatics applications. Due to the immaturity of currently available web services engines, it is still most practical to implement a simple, ad-hoc XML-based workflow by hard coding the workflow as a Java application. For advanced web service users the Collaxa BPEL engine facilitates a configuration and management environment that can fully handle XML-based workflow. |
format | Text |
id | pubmed-394315 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-3943152004-04-22 A web services choreography scenario for interoperating bioinformatics applications de Knikker, Remko Guo, Youjun Li, Jin-long Kwan, Albert KH Yip, Kevin Y Cheung, David W Cheung, Kei-Hoi BMC Bioinformatics Research Article BACKGROUND: Very often genome-wide data analysis requires the interoperation of multiple databases and analytic tools. A large number of genome databases and bioinformatics applications are available through the web, but it is difficult to automate interoperation because: 1) the platforms on which the applications run are heterogeneous, 2) their web interface is not machine-friendly, 3) they use a non-standard format for data input and output, 4) they do not exploit standards to define application interface and message exchange, and 5) existing protocols for remote messaging are often not firewall-friendly. To overcome these issues, web services have emerged as a standard XML-based model for message exchange between heterogeneous applications. Web services engines have been developed to manage the configuration and execution of a web services workflow. RESULTS: To demonstrate the benefit of using web services over traditional web interfaces, we compare the two implementations of HAPI, a gene expression analysis utility developed by the University of California San Diego (UCSD) that allows visual characterization of groups or clusters of genes based on the biomedical literature. This utility takes a set of microarray spot IDs as input and outputs a hierarchy of MeSH Keywords that correlates to the input and is grouped by Medical Subject Heading (MeSH) category. While the HTML output is easy for humans to visualize, it is difficult for computer applications to interpret semantically. To facilitate the capability of machine processing, we have created a workflow of three web services that replicates the HAPI functionality. These web services use document-style messages, which means that messages are encoded in an XML-based format. We compared three approaches to the implementation of an XML-based workflow: a hard coded Java application, Collaxa BPEL Server and Taverna Workbench. The Java program functions as a web services engine and interoperates with these web services using a web services choreography language (BPEL4WS). CONCLUSION: While it is relatively straightforward to implement and publish web services, the use of web services choreography engines is still in its infancy. However, industry-wide support and push for web services standards is quickly increasing the chance of success in using web services to unify heterogeneous bioinformatics applications. Due to the immaturity of currently available web services engines, it is still most practical to implement a simple, ad-hoc XML-based workflow by hard coding the workflow as a Java application. For advanced web service users the Collaxa BPEL engine facilitates a configuration and management environment that can fully handle XML-based workflow. BioMed Central 2004-03-10 /pmc/articles/PMC394315/ /pubmed/15113410 http://dx.doi.org/10.1186/1471-2105-5-25 Text en Copyright © 2004 de Knikker et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Research Article de Knikker, Remko Guo, Youjun Li, Jin-long Kwan, Albert KH Yip, Kevin Y Cheung, David W Cheung, Kei-Hoi A web services choreography scenario for interoperating bioinformatics applications |
title | A web services choreography scenario for interoperating bioinformatics applications |
title_full | A web services choreography scenario for interoperating bioinformatics applications |
title_fullStr | A web services choreography scenario for interoperating bioinformatics applications |
title_full_unstemmed | A web services choreography scenario for interoperating bioinformatics applications |
title_short | A web services choreography scenario for interoperating bioinformatics applications |
title_sort | web services choreography scenario for interoperating bioinformatics applications |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC394315/ https://www.ncbi.nlm.nih.gov/pubmed/15113410 http://dx.doi.org/10.1186/1471-2105-5-25 |
work_keys_str_mv | AT deknikkerremko awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT guoyoujun awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT lijinlong awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT kwanalbertkh awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT yipkeviny awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT cheungdavidw awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT cheungkeihoi awebserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT deknikkerremko webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT guoyoujun webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT lijinlong webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT kwanalbertkh webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT yipkeviny webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT cheungdavidw webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications AT cheungkeihoi webserviceschoreographyscenarioforinteroperatingbioinformaticsapplications |