Cargando…

Semantic web data warehousing for caGrid

The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annot...

Descripción completa

Detalles Bibliográficos
Autores principales: McCusker, Jamie P, Phillips, Joshua A, Beltrán, Alejandra González, Finkelstein, Anthony, Krauthammer, Michael
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755823/
https://www.ncbi.nlm.nih.gov/pubmed/19796399
http://dx.doi.org/10.1186/1471-2105-10-S10-S2
_version_ 1782172473055248384
author McCusker, Jamie P
Phillips, Joshua A
Beltrán, Alejandra González
Finkelstein, Anthony
Krauthammer, Michael
author_facet McCusker, Jamie P
Phillips, Joshua A
Beltrán, Alejandra González
Finkelstein, Anthony
Krauthammer, Michael
author_sort McCusker, Jamie P
collection PubMed
description The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG(®) Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-10-S10-S2) contains supplementary material, which is available to authorized users.
format Text
id pubmed-2755823
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27558232009-10-03 Semantic web data warehousing for caGrid McCusker, Jamie P Phillips, Joshua A Beltrán, Alejandra González Finkelstein, Anthony Krauthammer, Michael BMC Bioinformatics Research The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG(®) Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-10-S10-S2) contains supplementary material, which is available to authorized users. BioMed Central 2009-10-01 /pmc/articles/PMC2755823/ /pubmed/19796399 http://dx.doi.org/10.1186/1471-2105-10-S10-S2 Text en © McCusker et al; licensee BioMed Central Ltd. 2009 https://creativecommons.org/licenses/by/2.0/This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. https://creativecommons.org/licenses/by/2.0/ Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
McCusker, Jamie P
Phillips, Joshua A
Beltrán, Alejandra González
Finkelstein, Anthony
Krauthammer, Michael
Semantic web data warehousing for caGrid
title Semantic web data warehousing for caGrid
title_full Semantic web data warehousing for caGrid
title_fullStr Semantic web data warehousing for caGrid
title_full_unstemmed Semantic web data warehousing for caGrid
title_short Semantic web data warehousing for caGrid
title_sort semantic web data warehousing for cagrid
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2755823/
https://www.ncbi.nlm.nih.gov/pubmed/19796399
http://dx.doi.org/10.1186/1471-2105-10-S10-S2
work_keys_str_mv AT mccuskerjamiep semanticwebdatawarehousingforcagrid
AT phillipsjoshuaa semanticwebdatawarehousingforcagrid
AT beltranalejandragonzalez semanticwebdatawarehousingforcagrid
AT finkelsteinanthony semanticwebdatawarehousingforcagrid
AT krauthammermichael semanticwebdatawarehousingforcagrid