Cargando…

linkedISA: semantic representation of ISA-Tab experimental metadata

BACKGROUND: Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusabl...

Descripción completa

Detalles Bibliográficos
Autores principales: González-Beltrán, Alejandra, Maguire, Eamonn, Sansone, Susanna-Assunta, Rocca-Serra, Philippe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4255742/
https://www.ncbi.nlm.nih.gov/pubmed/25472428
http://dx.doi.org/10.1186/1471-2105-15-S14-S4
_version_ 1782347481091145728
author González-Beltrán, Alejandra
Maguire, Eamonn
Sansone, Susanna-Assunta
Rocca-Serra, Philippe
author_facet González-Beltrán, Alejandra
Maguire, Eamonn
Sansone, Susanna-Assunta
Rocca-Serra, Philippe
author_sort González-Beltrán, Alejandra
collection PubMed
description BACKGROUND: Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusable. Furthermore, sharing datasets in formats designed for consumption by humans and machines will also maximize their use. The Investigation/Study/Assay (ISA) open source metadata tracking framework facilitates standards-compliant collection, curation, visualization, storage and sharing of datasets, leveraging on other platforms to enable analysis and publication. The ISA software suite includes several components used in increasingly diverse set of life science and biomedical domains; it is underpinned by a general-purpose format, ISA-Tab, and conversions exist into formats required by public repositories. While ISA-Tab works well mainly as a human readable format, we have also implemented a linked data approach to semantically define the ISA-Tab syntax. RESULTS: We present a semantic web representation of the ISA-Tab syntax that complements ISA-Tab's syntactic interoperability with semantic interoperability. We introduce the linkedISA conversion tool from ISA-Tab to the Resource Description Framework (RDF), supporting mappings from the ISA syntax to multiple community-defined, open ontologies and capitalising on user-provided ontology annotations in the experimental metadata. We describe insights of the implementation and how annotations can be expanded driven by the metadata. We applied the conversion tool as part of Bio-GraphIIn, a web-based application supporting integration of the semantically-rich experimental descriptions. Designed in a user-friendly manner, the Bio-GraphIIn interface hides most of the complexities to the users, exposing a familiar tabular view of the experimental description to allow seamless interaction with the RDF representation, and visualising descriptors to drive the query over the semantic representation of the experimental design. In addition, we defined queries over the linkedISA RDF representation and demonstrated its use over the linkedISA conversion of datasets from Nature' Scientific Data online publication. CONCLUSIONS: Our linked data approach has allowed us to: 1) make the ISA-Tab semantics explicit and machine-processable, 2) exploit the existing ontology-based annotations in the ISA-Tab experimental descriptions, 3) augment the ISA-Tab syntax with new descriptive elements, 4) visualise and query elements related to the experimental design. Reasoning over ISA-Tab metadata and associated data will facilitate data integration and knowledge discovery.
format Online
Article
Text
id pubmed-4255742
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42557422014-12-05 linkedISA: semantic representation of ISA-Tab experimental metadata González-Beltrán, Alejandra Maguire, Eamonn Sansone, Susanna-Assunta Rocca-Serra, Philippe BMC Bioinformatics Research BACKGROUND: Reporting and sharing experimental metadata- such as the experimental design, characteristics of the samples, and procedures applied, along with the analysis results, in a standardised manner ensures that datasets are comprehensible and, in principle, reproducible, comparable and reusable. Furthermore, sharing datasets in formats designed for consumption by humans and machines will also maximize their use. The Investigation/Study/Assay (ISA) open source metadata tracking framework facilitates standards-compliant collection, curation, visualization, storage and sharing of datasets, leveraging on other platforms to enable analysis and publication. The ISA software suite includes several components used in increasingly diverse set of life science and biomedical domains; it is underpinned by a general-purpose format, ISA-Tab, and conversions exist into formats required by public repositories. While ISA-Tab works well mainly as a human readable format, we have also implemented a linked data approach to semantically define the ISA-Tab syntax. RESULTS: We present a semantic web representation of the ISA-Tab syntax that complements ISA-Tab's syntactic interoperability with semantic interoperability. We introduce the linkedISA conversion tool from ISA-Tab to the Resource Description Framework (RDF), supporting mappings from the ISA syntax to multiple community-defined, open ontologies and capitalising on user-provided ontology annotations in the experimental metadata. We describe insights of the implementation and how annotations can be expanded driven by the metadata. We applied the conversion tool as part of Bio-GraphIIn, a web-based application supporting integration of the semantically-rich experimental descriptions. Designed in a user-friendly manner, the Bio-GraphIIn interface hides most of the complexities to the users, exposing a familiar tabular view of the experimental description to allow seamless interaction with the RDF representation, and visualising descriptors to drive the query over the semantic representation of the experimental design. In addition, we defined queries over the linkedISA RDF representation and demonstrated its use over the linkedISA conversion of datasets from Nature' Scientific Data online publication. CONCLUSIONS: Our linked data approach has allowed us to: 1) make the ISA-Tab semantics explicit and machine-processable, 2) exploit the existing ontology-based annotations in the ISA-Tab experimental descriptions, 3) augment the ISA-Tab syntax with new descriptive elements, 4) visualise and query elements related to the experimental design. Reasoning over ISA-Tab metadata and associated data will facilitate data integration and knowledge discovery. BioMed Central 2014-11-27 /pmc/articles/PMC4255742/ /pubmed/25472428 http://dx.doi.org/10.1186/1471-2105-15-S14-S4 Text en Copyright © 2014 González-Beltrán et al.; licensee BioMed Central. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
González-Beltrán, Alejandra
Maguire, Eamonn
Sansone, Susanna-Assunta
Rocca-Serra, Philippe
linkedISA: semantic representation of ISA-Tab experimental metadata
title linkedISA: semantic representation of ISA-Tab experimental metadata
title_full linkedISA: semantic representation of ISA-Tab experimental metadata
title_fullStr linkedISA: semantic representation of ISA-Tab experimental metadata
title_full_unstemmed linkedISA: semantic representation of ISA-Tab experimental metadata
title_short linkedISA: semantic representation of ISA-Tab experimental metadata
title_sort linkedisa: semantic representation of isa-tab experimental metadata
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4255742/
https://www.ncbi.nlm.nih.gov/pubmed/25472428
http://dx.doi.org/10.1186/1471-2105-15-S14-S4
work_keys_str_mv AT gonzalezbeltranalejandra linkedisasemanticrepresentationofisatabexperimentalmetadata
AT maguireeamonn linkedisasemanticrepresentationofisatabexperimentalmetadata
AT sansonesusannaassunta linkedisasemanticrepresentationofisatabexperimentalmetadata
AT roccaserraphilippe linkedisasemanticrepresentationofisatabexperimentalmetadata