Cargando…

XML schemas for common bioinformatic data types and their application in workflow systems

BACKGROUND: Today, there is a growing need in bioinformatics to combine available software tools into chains, thus building complex applications from existing single-task tools. To create such workflows, the tools involved have to be able to work with each other's data – therefore, a common set...

Descripción completa

Detalles Bibliográficos
Autores principales: Seibel, Philipp N, Krüger, Jan, Hartmeier, Sven, Schwarzer, Knut, Löwenthal, Kai, Mersch, Henning, Dandekar, Thomas, Giegerich, Robert
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2001303/
https://www.ncbi.nlm.nih.gov/pubmed/17087823
http://dx.doi.org/10.1186/1471-2105-7-490
_version_ 1782135580277080064
author Seibel, Philipp N
Krüger, Jan
Hartmeier, Sven
Schwarzer, Knut
Löwenthal, Kai
Mersch, Henning
Dandekar, Thomas
Giegerich, Robert
author_facet Seibel, Philipp N
Krüger, Jan
Hartmeier, Sven
Schwarzer, Knut
Löwenthal, Kai
Mersch, Henning
Dandekar, Thomas
Giegerich, Robert
author_sort Seibel, Philipp N
collection PubMed
description BACKGROUND: Today, there is a growing need in bioinformatics to combine available software tools into chains, thus building complex applications from existing single-task tools. To create such workflows, the tools involved have to be able to work with each other's data – therefore, a common set of well-defined data formats is needed. Unfortunately, current bioinformatic tools use a great variety of heterogeneous formats. RESULTS: Acknowledging the need for common formats, the Helmholtz Open BioInformatics Technology network (HOBIT) identified several basic data types used in bioinformatics and developed appropriate format descriptions, formally defined by XML schemas, and incorporated them in a Java library (BioDOM). These schemas currently cover sequence, sequence alignment, RNA secondary structure and RNA secondary structure alignment formats in a form that is independent of any specific program, thus enabling seamless interoperation of different tools. All XML formats are available at , the BioDOM library can be obtained at . CONCLUSION: The HOBIT XML schemas and the BioDOM library simplify adding XML support to newly created and existing bioinformatic tools, enabling these tools to interoperate seamlessly in workflow scenarios.
format Text
id pubmed-2001303
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-20013032007-10-10 XML schemas for common bioinformatic data types and their application in workflow systems Seibel, Philipp N Krüger, Jan Hartmeier, Sven Schwarzer, Knut Löwenthal, Kai Mersch, Henning Dandekar, Thomas Giegerich, Robert BMC Bioinformatics Software BACKGROUND: Today, there is a growing need in bioinformatics to combine available software tools into chains, thus building complex applications from existing single-task tools. To create such workflows, the tools involved have to be able to work with each other's data – therefore, a common set of well-defined data formats is needed. Unfortunately, current bioinformatic tools use a great variety of heterogeneous formats. RESULTS: Acknowledging the need for common formats, the Helmholtz Open BioInformatics Technology network (HOBIT) identified several basic data types used in bioinformatics and developed appropriate format descriptions, formally defined by XML schemas, and incorporated them in a Java library (BioDOM). These schemas currently cover sequence, sequence alignment, RNA secondary structure and RNA secondary structure alignment formats in a form that is independent of any specific program, thus enabling seamless interoperation of different tools. All XML formats are available at , the BioDOM library can be obtained at . CONCLUSION: The HOBIT XML schemas and the BioDOM library simplify adding XML support to newly created and existing bioinformatic tools, enabling these tools to interoperate seamlessly in workflow scenarios. BioMed Central 2006-11-06 /pmc/articles/PMC2001303/ /pubmed/17087823 http://dx.doi.org/10.1186/1471-2105-7-490 Text en Copyright © 2006 Seibel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Seibel, Philipp N
Krüger, Jan
Hartmeier, Sven
Schwarzer, Knut
Löwenthal, Kai
Mersch, Henning
Dandekar, Thomas
Giegerich, Robert
XML schemas for common bioinformatic data types and their application in workflow systems
title XML schemas for common bioinformatic data types and their application in workflow systems
title_full XML schemas for common bioinformatic data types and their application in workflow systems
title_fullStr XML schemas for common bioinformatic data types and their application in workflow systems
title_full_unstemmed XML schemas for common bioinformatic data types and their application in workflow systems
title_short XML schemas for common bioinformatic data types and their application in workflow systems
title_sort xml schemas for common bioinformatic data types and their application in workflow systems
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2001303/
https://www.ncbi.nlm.nih.gov/pubmed/17087823
http://dx.doi.org/10.1186/1471-2105-7-490
work_keys_str_mv AT seibelphilippn xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT krugerjan xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT hartmeiersven xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT schwarzerknut xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT lowenthalkai xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT merschhenning xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT dandekarthomas xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems
AT giegerichrobert xmlschemasforcommonbioinformaticdatatypesandtheirapplicationinworkflowsystems