Cargando…

A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL

Current biological and chemical research is increasingly dependent on the reusability of previously acquired data, which typically come from various sources. Consequently, there is a growing need for database systems and databases stored in them to be interoperable with each other. One of the possib...

Descripción completa

Detalles Bibliográficos
Autores principales: Galgonek, Jakub, Vondrášek, Jiří
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280967/
https://www.ncbi.nlm.nih.gov/pubmed/37340506
http://dx.doi.org/10.1186/s13321-023-00729-5
_version_ 1785060915600687104
author Galgonek, Jakub
Vondrášek, Jiří
author_facet Galgonek, Jakub
Vondrášek, Jiří
author_sort Galgonek, Jakub
collection PubMed
description Current biological and chemical research is increasingly dependent on the reusability of previously acquired data, which typically come from various sources. Consequently, there is a growing need for database systems and databases stored in them to be interoperable with each other. One of the possible solutions to address this issue is to use systems based on Semantic Web technologies, namely on the Resource Description Framework (RDF) to express data and on the SPARQL query language to retrieve the data. Many existing biological and chemical databases are stored in the form of a relational database (RDB). Converting a relational database into the RDF form and storing it in a native RDF database system may not be desirable in many cases. It may be necessary to preserve the original database form, and having two versions of the same data may not be convenient. A solution may be to use a system mapping the relational database to the RDF form. Such a system keeps data in their original relational form and translates incoming SPARQL queries to equivalent SQL queries, which are evaluated by a relational-database system. This review compares different RDB-to-RDF mapping systems with a primary focus on those that can be used free of charge. In addition, it compares different approaches to expressing RDB-to-RDF mappings. The review shows that these systems represent a viable method providing sufficient performance. Their real-life performance is demonstrated on data and queries coming from the neXtProt project. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00729-5.
format Online
Article
Text
id pubmed-10280967
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-102809672023-06-21 A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL Galgonek, Jakub Vondrášek, Jiří J Cheminform Review Current biological and chemical research is increasingly dependent on the reusability of previously acquired data, which typically come from various sources. Consequently, there is a growing need for database systems and databases stored in them to be interoperable with each other. One of the possible solutions to address this issue is to use systems based on Semantic Web technologies, namely on the Resource Description Framework (RDF) to express data and on the SPARQL query language to retrieve the data. Many existing biological and chemical databases are stored in the form of a relational database (RDB). Converting a relational database into the RDF form and storing it in a native RDF database system may not be desirable in many cases. It may be necessary to preserve the original database form, and having two versions of the same data may not be convenient. A solution may be to use a system mapping the relational database to the RDF form. Such a system keeps data in their original relational form and translates incoming SPARQL queries to equivalent SQL queries, which are evaluated by a relational-database system. This review compares different RDB-to-RDF mapping systems with a primary focus on those that can be used free of charge. In addition, it compares different approaches to expressing RDB-to-RDF mappings. The review shows that these systems represent a viable method providing sufficient performance. Their real-life performance is demonstrated on data and queries coming from the neXtProt project. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13321-023-00729-5. Springer International Publishing 2023-06-20 /pmc/articles/PMC10280967/ /pubmed/37340506 http://dx.doi.org/10.1186/s13321-023-00729-5 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Review
Galgonek, Jakub
Vondrášek, Jiří
A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL
title A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL
title_full A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL
title_fullStr A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL
title_full_unstemmed A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL
title_short A comparison of approaches to accessing existing biological and chemical relational databases via SPARQL
title_sort comparison of approaches to accessing existing biological and chemical relational databases via sparql
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280967/
https://www.ncbi.nlm.nih.gov/pubmed/37340506
http://dx.doi.org/10.1186/s13321-023-00729-5
work_keys_str_mv AT galgonekjakub acomparisonofapproachestoaccessingexistingbiologicalandchemicalrelationaldatabasesviasparql
AT vondrasekjiri acomparisonofapproachestoaccessingexistingbiologicalandchemicalrelationaldatabasesviasparql
AT galgonekjakub comparisonofapproachestoaccessingexistingbiologicalandchemicalrelationaldatabasesviasparql
AT vondrasekjiri comparisonofapproachestoaccessingexistingbiologicalandchemicalrelationaldatabasesviasparql