Cargando…

Biospytial: spatial graph-based computing for ecological Big Data

BACKGROUND: The exponential accumulation of environmental and ecological data together with the adoption of open data initiatives bring opportunities and challenges for integrating and synthesising relevant knowledge that need to be addressed, given the ongoing environmental crises. FINDINGS: Here w...

Descripción completa

Detalles Bibliográficos
Autores principales: Escamilla Molgora, Juan M, Sedda, Luigi, Atkinson, Peter M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7213554/
https://www.ncbi.nlm.nih.gov/pubmed/32391910
http://dx.doi.org/10.1093/gigascience/giaa039
_version_ 1783531821346062336
author Escamilla Molgora, Juan M
Sedda, Luigi
Atkinson, Peter M
author_facet Escamilla Molgora, Juan M
Sedda, Luigi
Atkinson, Peter M
author_sort Escamilla Molgora, Juan M
collection PubMed
description BACKGROUND: The exponential accumulation of environmental and ecological data together with the adoption of open data initiatives bring opportunities and challenges for integrating and synthesising relevant knowledge that need to be addressed, given the ongoing environmental crises. FINDINGS: Here we present Biospytial, a modular open source knowledge engine designed to import, organise, analyse and visualise big spatial ecological datasets using the power of graph theory. The engine uses a hybrid graph-relational approach to store and access information. A graph data structure uses linkage relationships to build semantic structures represented as complex data structures stored in a graph database, while tabular and geospatial data are stored in an efficient spatial relational database system. We provide an application using information on species occurrences, their taxonomic classification and climatic datasets. We built a knowledge graph of the Tree of Life embedded in an environmental and geographical grid to perform an analysis on threatened species co-occurring with jaguars (Panthera onca). CONCLUSIONS: The Biospytial approach reduces the complexity of joining datasets using multiple tabular relations, while its scalable design eases the problem of merging datasets from different sources. Its modular design makes it possible to distribute several instances simultaneously, allowing fast and efficient handling of big ecological datasets. The provided example demonstrates the engine’s capabilities in performing basic graph manipulation, analysis and visualizations of taxonomic groups co-occurring in space. The example shows potential avenues for performing novel ecological analyses, biodiversity syntheses and species distribution models aided by a network of taxonomic and spatial relationships.
format Online
Article
Text
id pubmed-7213554
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72135542020-05-15 Biospytial: spatial graph-based computing for ecological Big Data Escamilla Molgora, Juan M Sedda, Luigi Atkinson, Peter M Gigascience Technical Note BACKGROUND: The exponential accumulation of environmental and ecological data together with the adoption of open data initiatives bring opportunities and challenges for integrating and synthesising relevant knowledge that need to be addressed, given the ongoing environmental crises. FINDINGS: Here we present Biospytial, a modular open source knowledge engine designed to import, organise, analyse and visualise big spatial ecological datasets using the power of graph theory. The engine uses a hybrid graph-relational approach to store and access information. A graph data structure uses linkage relationships to build semantic structures represented as complex data structures stored in a graph database, while tabular and geospatial data are stored in an efficient spatial relational database system. We provide an application using information on species occurrences, their taxonomic classification and climatic datasets. We built a knowledge graph of the Tree of Life embedded in an environmental and geographical grid to perform an analysis on threatened species co-occurring with jaguars (Panthera onca). CONCLUSIONS: The Biospytial approach reduces the complexity of joining datasets using multiple tabular relations, while its scalable design eases the problem of merging datasets from different sources. Its modular design makes it possible to distribute several instances simultaneously, allowing fast and efficient handling of big ecological datasets. The provided example demonstrates the engine’s capabilities in performing basic graph manipulation, analysis and visualizations of taxonomic groups co-occurring in space. The example shows potential avenues for performing novel ecological analyses, biodiversity syntheses and species distribution models aided by a network of taxonomic and spatial relationships. Oxford University Press 2020-05-11 /pmc/articles/PMC7213554/ /pubmed/32391910 http://dx.doi.org/10.1093/gigascience/giaa039 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Escamilla Molgora, Juan M
Sedda, Luigi
Atkinson, Peter M
Biospytial: spatial graph-based computing for ecological Big Data
title Biospytial: spatial graph-based computing for ecological Big Data
title_full Biospytial: spatial graph-based computing for ecological Big Data
title_fullStr Biospytial: spatial graph-based computing for ecological Big Data
title_full_unstemmed Biospytial: spatial graph-based computing for ecological Big Data
title_short Biospytial: spatial graph-based computing for ecological Big Data
title_sort biospytial: spatial graph-based computing for ecological big data
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7213554/
https://www.ncbi.nlm.nih.gov/pubmed/32391910
http://dx.doi.org/10.1093/gigascience/giaa039
work_keys_str_mv AT escamillamolgorajuanm biospytialspatialgraphbasedcomputingforecologicalbigdata
AT seddaluigi biospytialspatialgraphbasedcomputingforecologicalbigdata
AT atkinsonpeterm biospytialspatialgraphbasedcomputingforecologicalbigdata