Cargando…
Biospytial: spatial graph-based computing for ecological Big Data
BACKGROUND: The exponential accumulation of environmental and ecological data together with the adoption of open data initiatives bring opportunities and challenges for integrating and synthesising relevant knowledge that need to be addressed, given the ongoing environmental crises. FINDINGS: Here w...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7213554/ https://www.ncbi.nlm.nih.gov/pubmed/32391910 http://dx.doi.org/10.1093/gigascience/giaa039 |
_version_ | 1783531821346062336 |
---|---|
author | Escamilla Molgora, Juan M Sedda, Luigi Atkinson, Peter M |
author_facet | Escamilla Molgora, Juan M Sedda, Luigi Atkinson, Peter M |
author_sort | Escamilla Molgora, Juan M |
collection | PubMed |
description | BACKGROUND: The exponential accumulation of environmental and ecological data together with the adoption of open data initiatives bring opportunities and challenges for integrating and synthesising relevant knowledge that need to be addressed, given the ongoing environmental crises. FINDINGS: Here we present Biospytial, a modular open source knowledge engine designed to import, organise, analyse and visualise big spatial ecological datasets using the power of graph theory. The engine uses a hybrid graph-relational approach to store and access information. A graph data structure uses linkage relationships to build semantic structures represented as complex data structures stored in a graph database, while tabular and geospatial data are stored in an efficient spatial relational database system. We provide an application using information on species occurrences, their taxonomic classification and climatic datasets. We built a knowledge graph of the Tree of Life embedded in an environmental and geographical grid to perform an analysis on threatened species co-occurring with jaguars (Panthera onca). CONCLUSIONS: The Biospytial approach reduces the complexity of joining datasets using multiple tabular relations, while its scalable design eases the problem of merging datasets from different sources. Its modular design makes it possible to distribute several instances simultaneously, allowing fast and efficient handling of big ecological datasets. The provided example demonstrates the engine’s capabilities in performing basic graph manipulation, analysis and visualizations of taxonomic groups co-occurring in space. The example shows potential avenues for performing novel ecological analyses, biodiversity syntheses and species distribution models aided by a network of taxonomic and spatial relationships. |
format | Online Article Text |
id | pubmed-7213554 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-72135542020-05-15 Biospytial: spatial graph-based computing for ecological Big Data Escamilla Molgora, Juan M Sedda, Luigi Atkinson, Peter M Gigascience Technical Note BACKGROUND: The exponential accumulation of environmental and ecological data together with the adoption of open data initiatives bring opportunities and challenges for integrating and synthesising relevant knowledge that need to be addressed, given the ongoing environmental crises. FINDINGS: Here we present Biospytial, a modular open source knowledge engine designed to import, organise, analyse and visualise big spatial ecological datasets using the power of graph theory. The engine uses a hybrid graph-relational approach to store and access information. A graph data structure uses linkage relationships to build semantic structures represented as complex data structures stored in a graph database, while tabular and geospatial data are stored in an efficient spatial relational database system. We provide an application using information on species occurrences, their taxonomic classification and climatic datasets. We built a knowledge graph of the Tree of Life embedded in an environmental and geographical grid to perform an analysis on threatened species co-occurring with jaguars (Panthera onca). CONCLUSIONS: The Biospytial approach reduces the complexity of joining datasets using multiple tabular relations, while its scalable design eases the problem of merging datasets from different sources. Its modular design makes it possible to distribute several instances simultaneously, allowing fast and efficient handling of big ecological datasets. The provided example demonstrates the engine’s capabilities in performing basic graph manipulation, analysis and visualizations of taxonomic groups co-occurring in space. The example shows potential avenues for performing novel ecological analyses, biodiversity syntheses and species distribution models aided by a network of taxonomic and spatial relationships. Oxford University Press 2020-05-11 /pmc/articles/PMC7213554/ /pubmed/32391910 http://dx.doi.org/10.1093/gigascience/giaa039 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Technical Note Escamilla Molgora, Juan M Sedda, Luigi Atkinson, Peter M Biospytial: spatial graph-based computing for ecological Big Data |
title | Biospytial: spatial graph-based computing for ecological Big Data |
title_full | Biospytial: spatial graph-based computing for ecological Big Data |
title_fullStr | Biospytial: spatial graph-based computing for ecological Big Data |
title_full_unstemmed | Biospytial: spatial graph-based computing for ecological Big Data |
title_short | Biospytial: spatial graph-based computing for ecological Big Data |
title_sort | biospytial: spatial graph-based computing for ecological big data |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7213554/ https://www.ncbi.nlm.nih.gov/pubmed/32391910 http://dx.doi.org/10.1093/gigascience/giaa039 |
work_keys_str_mv | AT escamillamolgorajuanm biospytialspatialgraphbasedcomputingforecologicalbigdata AT seddaluigi biospytialspatialgraphbasedcomputingforecologicalbigdata AT atkinsonpeterm biospytialspatialgraphbasedcomputingforecologicalbigdata |