Cargando…

Semantic Web repositories for genomics data using the eXframe platform

BACKGROUND: With the advent of inexpensive assay technologies, there has been an unprecedented growth in genomics data as well as the number of databases in which it is stored. In these databases, sample annotation using ontologies and controlled vocabularies is becoming more common. However, the an...

Descripción completa

Detalles Bibliográficos
Autores principales: Merrill, Emily, Corlosquet, Stéphane, Ciccarese, Paolo, Clark, Tim, Das, Sudeshna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4108874/
https://www.ncbi.nlm.nih.gov/pubmed/25093072
http://dx.doi.org/10.1186/2041-1480-5-S1-S3
_version_ 1782327801815236608
author Merrill, Emily
Corlosquet, Stéphane
Ciccarese, Paolo
Clark, Tim
Das, Sudeshna
author_facet Merrill, Emily
Corlosquet, Stéphane
Ciccarese, Paolo
Clark, Tim
Das, Sudeshna
author_sort Merrill, Emily
collection PubMed
description BACKGROUND: With the advent of inexpensive assay technologies, there has been an unprecedented growth in genomics data as well as the number of databases in which it is stored. In these databases, sample annotation using ontologies and controlled vocabularies is becoming more common. However, the annotation is rarely available as Linked Data, in a machine-readable format, or for standardized queries using SPARQL. This makes large-scale reuse, or integration with other knowledge bases very difficult. METHODS: To address this challenge, we have developed the second generation of our eXframe platform, a reusable framework for creating online repositories of genomics experiments. This second generation model now publishes Semantic Web data. To accomplish this, we created an experiment model that covers provenance, citations, external links, assays, biomaterials used in the experiment, and the data collected during the process. The elements of our model are mapped to classes and properties from various established biomedical ontologies. Resource Description Framework (RDF) data is automatically produced using these mappings and indexed in an RDF store with a built-in Sparql Protocol and RDF Query Language (SPARQL) endpoint. CONCLUSIONS: Using the open-source eXframe software, institutions and laboratories can create Semantic Web repositories of their experiments, integrate it with heterogeneous resources and make it interoperable with the vast Semantic Web of biomedical knowledge.
format Online
Article
Text
id pubmed-4108874
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-41088742014-08-04 Semantic Web repositories for genomics data using the eXframe platform Merrill, Emily Corlosquet, Stéphane Ciccarese, Paolo Clark, Tim Das, Sudeshna J Biomed Semantics Proceedings BACKGROUND: With the advent of inexpensive assay technologies, there has been an unprecedented growth in genomics data as well as the number of databases in which it is stored. In these databases, sample annotation using ontologies and controlled vocabularies is becoming more common. However, the annotation is rarely available as Linked Data, in a machine-readable format, or for standardized queries using SPARQL. This makes large-scale reuse, or integration with other knowledge bases very difficult. METHODS: To address this challenge, we have developed the second generation of our eXframe platform, a reusable framework for creating online repositories of genomics experiments. This second generation model now publishes Semantic Web data. To accomplish this, we created an experiment model that covers provenance, citations, external links, assays, biomaterials used in the experiment, and the data collected during the process. The elements of our model are mapped to classes and properties from various established biomedical ontologies. Resource Description Framework (RDF) data is automatically produced using these mappings and indexed in an RDF store with a built-in Sparql Protocol and RDF Query Language (SPARQL) endpoint. CONCLUSIONS: Using the open-source eXframe software, institutions and laboratories can create Semantic Web repositories of their experiments, integrate it with heterogeneous resources and make it interoperable with the vast Semantic Web of biomedical knowledge. BioMed Central 2014-06-03 /pmc/articles/PMC4108874/ /pubmed/25093072 http://dx.doi.org/10.1186/2041-1480-5-S1-S3 Text en Copyright © 2014 Merrill et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Merrill, Emily
Corlosquet, Stéphane
Ciccarese, Paolo
Clark, Tim
Das, Sudeshna
Semantic Web repositories for genomics data using the eXframe platform
title Semantic Web repositories for genomics data using the eXframe platform
title_full Semantic Web repositories for genomics data using the eXframe platform
title_fullStr Semantic Web repositories for genomics data using the eXframe platform
title_full_unstemmed Semantic Web repositories for genomics data using the eXframe platform
title_short Semantic Web repositories for genomics data using the eXframe platform
title_sort semantic web repositories for genomics data using the exframe platform
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4108874/
https://www.ncbi.nlm.nih.gov/pubmed/25093072
http://dx.doi.org/10.1186/2041-1480-5-S1-S3
work_keys_str_mv AT merrillemily semanticwebrepositoriesforgenomicsdatausingtheexframeplatform
AT corlosquetstephane semanticwebrepositoriesforgenomicsdatausingtheexframeplatform
AT ciccaresepaolo semanticwebrepositoriesforgenomicsdatausingtheexframeplatform
AT clarktim semanticwebrepositoriesforgenomicsdatausingtheexframeplatform
AT dassudeshna semanticwebrepositoriesforgenomicsdatausingtheexframeplatform