Cargando…
A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL
The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple data sources. However, analyzing evolutionary...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324951/ https://www.ncbi.nlm.nih.gov/pubmed/32612807 http://dx.doi.org/10.12688/f1000research.21027.2 |
_version_ | 1783552058762199040 |
---|---|
author | Sima, Ana Claudia Dessimoz, Christophe Stockinger, Kurt Zahn-Zabal, Monique Mendes de Farias, Tarcisio |
author_facet | Sima, Ana Claudia Dessimoz, Christophe Stockinger, Kurt Zahn-Zabal, Monique Mendes de Farias, Tarcisio |
author_sort | Sima, Ana Claudia |
collection | PubMed |
description | The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple data sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the equivalent SPARQL constructs required to benefit from this data – in particular, recursive property paths. In this article, we provide a hands-on introduction to querying evolutionary data across several data sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different data sources can be compared, through the use of federated SPARQL queries. |
format | Online Article Text |
id | pubmed-7324951 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-73249512020-06-30 A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL Sima, Ana Claudia Dessimoz, Christophe Stockinger, Kurt Zahn-Zabal, Monique Mendes de Farias, Tarcisio F1000Res Method Article The increasing use of Semantic Web technologies in the life sciences, in particular the use of the Resource Description Framework (RDF) and the RDF query language SPARQL, opens the path for novel integrative analyses, combining information from multiple data sources. However, analyzing evolutionary data in RDF is not trivial, due to the steep learning curve required to understand both the data models adopted by different RDF data sources, as well as the equivalent SPARQL constructs required to benefit from this data – in particular, recursive property paths. In this article, we provide a hands-on introduction to querying evolutionary data across several data sources that publish orthology information in RDF, namely: The Orthologous MAtrix (OMA), the European Bioinformatics Institute (EBI) RDF platform, the Database of Orthologous Groups (OrthoDB) and the Microbial Genome Database (MBGD). We present four protocols in increasing order of complexity. In these protocols, we demonstrate through SPARQL queries how to retrieve pairwise orthologs, homologous groups, and hierarchical orthologous groups. Finally, we show how orthology information in different data sources can be compared, through the use of federated SPARQL queries. F1000 Research Limited 2020-07-22 /pmc/articles/PMC7324951/ /pubmed/32612807 http://dx.doi.org/10.12688/f1000research.21027.2 Text en Copyright: © 2020 Sima AC et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Article Sima, Ana Claudia Dessimoz, Christophe Stockinger, Kurt Zahn-Zabal, Monique Mendes de Farias, Tarcisio A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL |
title | A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL |
title_full | A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL |
title_fullStr | A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL |
title_full_unstemmed | A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL |
title_short | A hands-on introduction to querying evolutionary relationships across multiple data sources using SPARQL |
title_sort | hands-on introduction to querying evolutionary relationships across multiple data sources using sparql |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7324951/ https://www.ncbi.nlm.nih.gov/pubmed/32612807 http://dx.doi.org/10.12688/f1000research.21027.2 |
work_keys_str_mv | AT simaanaclaudia ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT dessimozchristophe ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT stockingerkurt ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT zahnzabalmonique ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT mendesdefariastarcisio ahandsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT simaanaclaudia handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT dessimozchristophe handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT stockingerkurt handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT zahnzabalmonique handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql AT mendesdefariastarcisio handsonintroductiontoqueryingevolutionaryrelationshipsacrossmultipledatasourcesusingsparql |