Cargando…
linkedISA: semantic representation of ISA-Tab experimental metadata
BACKGROUND: Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusabl...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4255742/ https://www.ncbi.nlm.nih.gov/pubmed/25472428 http://dx.doi.org/10.1186/1471-2105-15-S14-S4 |
_version_ | 1782347481091145728 |
---|---|
author | González-Beltrán, Alejandra Maguire, Eamonn Sansone, Susanna-Assunta Rocca-Serra, Philippe |
author_facet | González-Beltrán, Alejandra Maguire, Eamonn Sansone, Susanna-Assunta Rocca-Serra, Philippe |
author_sort | González-Beltrán, Alejandra |
collection | PubMed |
description | BACKGROUND: Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusable. Furthermore, sharing datasets in formats designed for consumption by humans and machines will also maximize their use. The Investigation/Study/Assay (ISA) open source metadata tracking framework facilitates standards-compliant collection, curation, visualization, storage and sharing of datasets, leveraging on other platforms to enable analysis and publication. The ISA software suite includes several components used in increasingly diverse set of life science and biomedical domains; it is underpinned by a general-purpose format, ISA-Tab, and conversions exist into formats required by public repositories. While ISA-Tab works well mainly as a human readable format, we have also implemented a linked data approach to semantically define the ISA-Tab syntax. RESULTS: We present a semantic web representation of the ISA-Tab syntax that complements ISA-Tab's syntactic interoperability with semantic interoperability. We introduce the linkedISA conversion tool from ISA-Tab to the Resource Description Framework (RDF), supporting mappings from the ISA syntax to multiple community-defined, open ontologies and capitalising on user-provided ontology annotations in the experimental metadata. We describe insights of the implementation and how annotations can be expanded driven by the metadata. We applied the conversion tool as part of Bio-GraphIIn, a web-based application supporting integration of the semantically-rich experimental descriptions. Designed in a user-friendly manner, the Bio-GraphIIn interface hides most of the complexities to the users, exposing a familiar tabular view of the experimental description to allow seamless interaction with the RDF representation, and visualising descriptors to drive the query over the semantic representation of the experimental design. In addition, we defined queries over the linkedISA RDF representation and demonstrated its use over the linkedISA conversion of datasets from Nature' Scientific Data online publication. CONCLUSIONS: Our linked data approach has allowed us to: 1) make the ISA-Tab semantics explicit and machine-processable, 2) exploit the existing ontology-based annotations in the ISA-Tab experimental descriptions, 3) augment the ISA-Tab syntax with new descriptive elements, 4) visualise and query elements related to the experimental design. Reasoning over ISA-Tab metadata and associated data will facilitate data integration and knowledge discovery. |
format | Online Article Text |
id | pubmed-4255742 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42557422014-12-05 linkedISA: semantic representation of ISA-Tab experimental metadata González-Beltrán, Alejandra Maguire, Eamonn Sansone, Susanna-Assunta Rocca-Serra, Philippe BMC Bioinformatics Research BACKGROUND: Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusable. Furthermore, sharing datasets in formats designed for consumption by humans and machines will also maximize their use. The Investigation/Study/Assay (ISA) open source metadata tracking framework facilitates standards-compliant collection, curation, visualization, storage and sharing of datasets, leveraging on other platforms to enable analysis and publication. The ISA software suite includes several components used in increasingly diverse set of life science and biomedical domains; it is underpinned by a general-purpose format, ISA-Tab, and conversions exist into formats required by public repositories. While ISA-Tab works well mainly as a human readable format, we have also implemented a linked data approach to semantically define the ISA-Tab syntax. RESULTS: We present a semantic web representation of the ISA-Tab syntax that complements ISA-Tab's syntactic interoperability with semantic interoperability. We introduce the linkedISA conversion tool from ISA-Tab to the Resource Description Framework (RDF), supporting mappings from the ISA syntax to multiple community-defined, open ontologies and capitalising on user-provided ontology annotations in the experimental metadata. We describe insights of the implementation and how annotations can be expanded driven by the metadata. We applied the conversion tool as part of Bio-GraphIIn, a web-based application supporting integration of the semantically-rich experimental descriptions. Designed in a user-friendly manner, the Bio-GraphIIn interface hides most of the complexities to the users, exposing a familiar tabular view of the experimental description to allow seamless interaction with the RDF representation, and visualising descriptors to drive the query over the semantic representation of the experimental design. In addition, we defined queries over the linkedISA RDF representation and demonstrated its use over the linkedISA conversion of datasets from Nature' Scientific Data online publication. CONCLUSIONS: Our linked data approach has allowed us to: 1) make the ISA-Tab semantics explicit and machine-processable, 2) exploit the existing ontology-based annotations in the ISA-Tab experimental descriptions, 3) augment the ISA-Tab syntax with new descriptive elements, 4) visualise and query elements related to the experimental design. Reasoning over ISA-Tab metadata and associated data will facilitate data integration and knowledge discovery. BioMed Central 2014-11-27 /pmc/articles/PMC4255742/ /pubmed/25472428 http://dx.doi.org/10.1186/1471-2105-15-S14-S4 Text en Copyright © 2014 González-Beltrán et al.; licensee BioMed Central. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research González-Beltrán, Alejandra Maguire, Eamonn Sansone, Susanna-Assunta Rocca-Serra, Philippe linkedISA: semantic representation of ISA-Tab experimental metadata |
title | linkedISA: semantic representation of ISA-Tab experimental metadata |
title_full | linkedISA: semantic representation of ISA-Tab experimental metadata |
title_fullStr | linkedISA: semantic representation of ISA-Tab experimental metadata |
title_full_unstemmed | linkedISA: semantic representation of ISA-Tab experimental metadata |
title_short | linkedISA: semantic representation of ISA-Tab experimental metadata |
title_sort | linkedisa: semantic representation of isa-tab experimental metadata |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4255742/ https://www.ncbi.nlm.nih.gov/pubmed/25472428 http://dx.doi.org/10.1186/1471-2105-15-S14-S4 |
work_keys_str_mv | AT gonzalezbeltranalejandra linkedisasemanticrepresentationofisatabexperimentalmetadata AT maguireeamonn linkedisasemanticrepresentationofisatabexperimentalmetadata AT sansonesusannaassunta linkedisasemanticrepresentationofisatabexperimentalmetadata AT roccaserraphilippe linkedisasemanticrepresentationofisatabexperimentalmetadata |