Cargando…

Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration

BACKGROUND: Scientific data integration and computational service discovery are challenges for the bioinformatic community. This process is made more difficult by the separate and independent construction of biological databases, which makes the exchange of data between information resources difficu...

Descripción completa

Detalles Bibliográficos
Autores principales: Nelson, Rex T, Avraham, Shulamit, Shoemaker, Randy C, May, Gregory D, Ware, Doreen, Gessler, Damian DG
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2894815/
https://www.ncbi.nlm.nih.gov/pubmed/20525377
http://dx.doi.org/10.1186/1756-0381-3-3
_version_ 1782183218466783232
author Nelson, Rex T
Avraham, Shulamit
Shoemaker, Randy C
May, Gregory D
Ware, Doreen
Gessler, Damian DG
author_facet Nelson, Rex T
Avraham, Shulamit
Shoemaker, Randy C
May, Gregory D
Ware, Doreen
Gessler, Damian DG
author_sort Nelson, Rex T
collection PubMed
description BACKGROUND: Scientific data integration and computational service discovery are challenges for the bioinformatic community. This process is made more difficult by the separate and independent construction of biological databases, which makes the exchange of data between information resources difficult and labor intensive. A recently described semantic web protocol, the Simple Semantic Web Architecture and Protocol (SSWAP; pronounced "swap") offers the ability to describe data and services in a semantically meaningful way. We report how three major information resources (Gramene, SoyBase and the Legume Information System [LIS]) used SSWAP to semantically describe selected data and web services. METHODS: We selected high-priority Quantitative Trait Locus (QTL), genomic mapping, trait, phenotypic, and sequence data and associated services such as BLAST for publication, data retrieval, and service invocation via semantic web services. Data and services were mapped to concepts and categories as implemented in legacy and de novo community ontologies. We used SSWAP to express these offerings in OWL Web Ontology Language (OWL), Resource Description Framework (RDF) and eXtensible Markup Language (XML) documents, which are appropriate for their semantic discovery and retrieval. We implemented SSWAP services to respond to web queries and return data. These services are registered with the SSWAP Discovery Server and are available for semantic discovery at http://sswap.info. RESULTS: A total of ten services delivering QTL information from Gramene were created. From SoyBase, we created six services delivering information about soybean QTLs, and seven services delivering genetic locus information. For LIS we constructed three services, two of which allow the retrieval of DNA and RNA FASTA sequences with the third service providing nucleic acid sequence comparison capability (BLAST). CONCLUSIONS: The need for semantic integration technologies has preceded available solutions. We report the feasibility of mapping high priority data from local, independent, idiosyncratic data schemas to common shared concepts as implemented in web-accessible ontologies. These mappings are then amenable for use in semantic web services. Our implementation of approximately two dozen services means that biological data at three large information resources (Gramene, SoyBase, and LIS) is available for programmatic access, semantic searching, and enhanced interaction between the separate missions of these resources.
format Text
id pubmed-2894815
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28948152010-07-01 Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration Nelson, Rex T Avraham, Shulamit Shoemaker, Randy C May, Gregory D Ware, Doreen Gessler, Damian DG BioData Min Research BACKGROUND: Scientific data integration and computational service discovery are challenges for the bioinformatic community. This process is made more difficult by the separate and independent construction of biological databases, which makes the exchange of data between information resources difficult and labor intensive. A recently described semantic web protocol, the Simple Semantic Web Architecture and Protocol (SSWAP; pronounced "swap") offers the ability to describe data and services in a semantically meaningful way. We report how three major information resources (Gramene, SoyBase and the Legume Information System [LIS]) used SSWAP to semantically describe selected data and web services. METHODS: We selected high-priority Quantitative Trait Locus (QTL), genomic mapping, trait, phenotypic, and sequence data and associated services such as BLAST for publication, data retrieval, and service invocation via semantic web services. Data and services were mapped to concepts and categories as implemented in legacy and de novo community ontologies. We used SSWAP to express these offerings in OWL Web Ontology Language (OWL), Resource Description Framework (RDF) and eXtensible Markup Language (XML) documents, which are appropriate for their semantic discovery and retrieval. We implemented SSWAP services to respond to web queries and return data. These services are registered with the SSWAP Discovery Server and are available for semantic discovery at http://sswap.info. RESULTS: A total of ten services delivering QTL information from Gramene were created. From SoyBase, we created six services delivering information about soybean QTLs, and seven services delivering genetic locus information. For LIS we constructed three services, two of which allow the retrieval of DNA and RNA FASTA sequences with the third service providing nucleic acid sequence comparison capability (BLAST). CONCLUSIONS: The need for semantic integration technologies has preceded available solutions. We report the feasibility of mapping high priority data from local, independent, idiosyncratic data schemas to common shared concepts as implemented in web-accessible ontologies. These mappings are then amenable for use in semantic web services. Our implementation of approximately two dozen services means that biological data at three large information resources (Gramene, SoyBase, and LIS) is available for programmatic access, semantic searching, and enhanced interaction between the separate missions of these resources. BioMed Central 2010-06-04 /pmc/articles/PMC2894815/ /pubmed/20525377 http://dx.doi.org/10.1186/1756-0381-3-3 Text en Copyright ©2010 Nelson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Nelson, Rex T
Avraham, Shulamit
Shoemaker, Randy C
May, Gregory D
Ware, Doreen
Gessler, Damian DG
Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
title Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
title_full Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
title_fullStr Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
title_full_unstemmed Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
title_short Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration
title_sort applications and methods utilizing the simple semantic web architecture and protocol (sswap) for bioinformatics resource discovery and disparate data and service integration
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2894815/
https://www.ncbi.nlm.nih.gov/pubmed/20525377
http://dx.doi.org/10.1186/1756-0381-3-3
work_keys_str_mv AT nelsonrext applicationsandmethodsutilizingthesimplesemanticwebarchitectureandprotocolsswapforbioinformaticsresourcediscoveryanddisparatedataandserviceintegration
AT avrahamshulamit applicationsandmethodsutilizingthesimplesemanticwebarchitectureandprotocolsswapforbioinformaticsresourcediscoveryanddisparatedataandserviceintegration
AT shoemakerrandyc applicationsandmethodsutilizingthesimplesemanticwebarchitectureandprotocolsswapforbioinformaticsresourcediscoveryanddisparatedataandserviceintegration
AT maygregoryd applicationsandmethodsutilizingthesimplesemanticwebarchitectureandprotocolsswapforbioinformaticsresourcediscoveryanddisparatedataandserviceintegration
AT waredoreen applicationsandmethodsutilizingthesimplesemanticwebarchitectureandprotocolsswapforbioinformaticsresourcediscoveryanddisparatedataandserviceintegration
AT gesslerdamiandg applicationsandmethodsutilizingthesimplesemanticwebarchitectureandprotocolsswapforbioinformaticsresourcediscoveryanddisparatedataandserviceintegration