Cargando…

Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph

BACKGROUND: OpenBiodiv is a biodiversity knowledge graph containing a synthetic linked open dataset, OpenBiodiv-LOD, which combines knowledge extracted from academic literature with the taxonomic backbone used by the Global Biodiversity Information Facility. The linked open data is modelled accordin...

Descripción completa

Detalles Bibliográficos
Autores principales: Dimitrova, Mariya, Senderov, Viktor E, Georgiev, Teodor, Zhelezov, Georgi, Penev, Lyubomir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pensoft Publishers 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8486731/
https://www.ncbi.nlm.nih.gov/pubmed/34690512
http://dx.doi.org/10.3897/BDJ.9.e67671
_version_ 1784577807819472896
author Dimitrova, Mariya
Senderov, Viktor E
Georgiev, Teodor
Zhelezov, Georgi
Penev, Lyubomir
author_facet Dimitrova, Mariya
Senderov, Viktor E
Georgiev, Teodor
Zhelezov, Georgi
Penev, Lyubomir
author_sort Dimitrova, Mariya
collection PubMed
description BACKGROUND: OpenBiodiv is a biodiversity knowledge graph containing a synthetic linked open dataset, OpenBiodiv-LOD, which combines knowledge extracted from academic literature with the taxonomic backbone used by the Global Biodiversity Information Facility. The linked open data is modelled according to the OpenBiodiv-O ontology integrating semantic resource types from recognised biodiversity and publishing ontologies with OpenBiodiv-O resource types, introduced to capture the semantics of resources not modelled before. NEW INFORMATION: We introduce the new release of the OpenBiodiv-LOD attained through information extraction and modelling of additional biodiversity entities. It was achieved by further developments to OpenBiodiv-O, the data storage infrastructure and the workflow and accompanying R software packages used for transformation of academic literature into Resource Description Framework (RDF). We discuss how to utilise the LOD in biodiversity informatics and give examples by providing solutions to several competency questions. We investigate performance issues that arise due to the large amount of inferred statements in the graph and conclude that OWL-full inference is impractical for the project and that unnecessary inference should be avoided.
format Online
Article
Text
id pubmed-8486731
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Pensoft Publishers
record_format MEDLINE/PubMed
spelling pubmed-84867312021-10-22 Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph Dimitrova, Mariya Senderov, Viktor E Georgiev, Teodor Zhelezov, Georgi Penev, Lyubomir Biodivers Data J Software Description BACKGROUND: OpenBiodiv is a biodiversity knowledge graph containing a synthetic linked open dataset, OpenBiodiv-LOD, which combines knowledge extracted from academic literature with the taxonomic backbone used by the Global Biodiversity Information Facility. The linked open data is modelled according to the OpenBiodiv-O ontology integrating semantic resource types from recognised biodiversity and publishing ontologies with OpenBiodiv-O resource types, introduced to capture the semantics of resources not modelled before. NEW INFORMATION: We introduce the new release of the OpenBiodiv-LOD attained through information extraction and modelling of additional biodiversity entities. It was achieved by further developments to OpenBiodiv-O, the data storage infrastructure and the workflow and accompanying R software packages used for transformation of academic literature into Resource Description Framework (RDF). We discuss how to utilise the LOD in biodiversity informatics and give examples by providing solutions to several competency questions. We investigate performance issues that arise due to the large amount of inferred statements in the graph and conclude that OWL-full inference is impractical for the project and that unnecessary inference should be avoided. Pensoft Publishers 2021-09-24 /pmc/articles/PMC8486731/ /pubmed/34690512 http://dx.doi.org/10.3897/BDJ.9.e67671 Text en Mariya Dimitrova, Viktor E Senderov, Teodor Georgiev, Georgi Zhelezov, Lyubomir Penev https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Software Description
Dimitrova, Mariya
Senderov, Viktor E
Georgiev, Teodor
Zhelezov, Georgi
Penev, Lyubomir
Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph
title Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph
title_full Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph
title_fullStr Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph
title_full_unstemmed Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph
title_short Infrastructure and Population of the OpenBiodiv Biodiversity Knowledge Graph
title_sort infrastructure and population of the openbiodiv biodiversity knowledge graph
topic Software Description
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8486731/
https://www.ncbi.nlm.nih.gov/pubmed/34690512
http://dx.doi.org/10.3897/BDJ.9.e67671
work_keys_str_mv AT dimitrovamariya infrastructureandpopulationoftheopenbiodivbiodiversityknowledgegraph
AT senderovviktore infrastructureandpopulationoftheopenbiodivbiodiversityknowledgegraph
AT georgievteodor infrastructureandpopulationoftheopenbiodivbiodiversityknowledgegraph
AT zhelezovgeorgi infrastructureandpopulationoftheopenbiodivbiodiversityknowledgegraph
AT penevlyubomir infrastructureandpopulationoftheopenbiodivbiodiversityknowledgegraph