Cargando…

A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL

The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple data sources. However, analyzing evolutionary...

Descripción completa

Detalles Bibliográficos
Autores principales: Sima, Ana Claudia, Dessimoz, Christophe, Stockinger, Kurt, Zahn-Zabal, Monique, Mendes de Farias, Tarcisio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324951/
https://www.ncbi.nlm.nih.gov/pubmed/32612807
http://dx.doi.org/10.12688/f1000research.21027.2
_version_ 1783552058762199040
author Sima, Ana Claudia
Dessimoz, Christophe
Stockinger, Kurt
Zahn-Zabal, Monique
Mendes de Farias, Tarcisio
author_facet Sima, Ana Claudia
Dessimoz, Christophe
Stockinger, Kurt
Zahn-Zabal, Monique
Mendes de Farias, Tarcisio
author_sort Sima, Ana Claudia
collection PubMed
description The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple data sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the equivalent SPARQL constructs required to benefit from this data – in particular, recursive property paths. In this article, we provide a hands-on introduction to querying evolutionary data across several data sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different data sources can be compared, through the use of federated SPARQL queries.
format Online
Article
Text
id pubmed-7324951
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-73249512020-06-30 A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL Sima, Ana Claudia Dessimoz, Christophe Stockinger, Kurt Zahn-Zabal, Monique Mendes de Farias, Tarcisio F1000Res Method Article The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple data sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the equivalent SPARQL constructs required to benefit from this data – in particular, recursive property paths. In this article, we provide a hands-on introduction to querying evolutionary data across several data sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different data sources can be compared, through the use of federated SPARQL queries. F1000 Research Limited 2020-07-22 /pmc/articles/PMC7324951/ /pubmed/32612807 http://dx.doi.org/10.12688/f1000research.21027.2 Text en Copyright: © 2020 Sima AC et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Sima, Ana Claudia
Dessimoz, Christophe
Stockinger, Kurt
Zahn-Zabal, Monique
Mendes de Farias, Tarcisio
A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
title A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
title_full A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
title_fullStr A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
title_full_unstemmed A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
title_short A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
title_sort hands-on introduction to querying evolutionary relationships across multiple data sources using sparql
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324951/
https://www.ncbi.nlm.nih.gov/pubmed/32612807
http://dx.doi.org/10.12688/f1000research.21027.2
work_keys_str_mv AT simaanaclaudia ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT dessimozchristophe ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT stockingerkurt ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT zahnzabalmonique ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT mendesdefariastarcisio ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT simaanaclaudia handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT dessimozchristophe handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT stockingerkurt handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT zahnzabalmonique handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql
AT mendesdefariastarcisio handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql