Cargando…

Decentralized but Globally Coordinated Biodiversity Data

Centralized biodiversity data aggregation is too often failing societal needs due to pervasive and systemic data quality deficiencies. We argue for a novel approach that embodies the spirit of the Web (“small pieces loosely joined”) through the decentralized coordination of data across scientific la...

Descripción completa

Detalles Bibliográficos
Autores principales: Sterner, Beckett W., Gilbert, Edward E., Franz, Nico M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931950/
https://www.ncbi.nlm.nih.gov/pubmed/33693407
http://dx.doi.org/10.3389/fdata.2020.519133
_version_ 1783660389956845568
author Sterner, Beckett W.
Gilbert, Edward E.
Franz, Nico M.
author_facet Sterner, Beckett W.
Gilbert, Edward E.
Franz, Nico M.
author_sort Sterner, Beckett W.
collection PubMed
description Centralized biodiversity data aggregation is too often failing societal needs due to pervasive and systemic data quality deficiencies. We argue for a novel approach that embodies the spirit of the Web (“small pieces loosely joined”) through the decentralized coordination of data across scientific languages and communities. The upfront cost of decentralization can be offset by the long-term benefit of achieving sustained expert engagement, higher-quality data products, and ultimately more societal impact for biodiversity data. Our decentralized approach encourages the emergence and evolution of multiple self-identifying communities of practice that are regionally, taxonomically, or institutionally localized. Each community is empowered to control the social and informational design and versioning of their local data infrastructures and signals. With no single aggregator to exert centralized control over biodiversity data, decentralization generates loosely connected networks of mid-level aggregators. Global coordination is nevertheless feasible through automatable data sharing agreements that enable efficient propagation and translation of biodiversity data across communities. The decentralized model also poses novel integration challenges, among which the explicit and continuous articulation of conflicting systematic classifications and phylogenies remain the most challenging. We discuss the development of available solutions, challenges, and outline next steps: the global effort of coordination should focus on developing shared languages for data signal translation, as opposed to homogenizing the data signal itself.
format Online
Article
Text
id pubmed-7931950
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79319502021-03-09 Decentralized but Globally Coordinated Biodiversity Data Sterner, Beckett W. Gilbert, Edward E. Franz, Nico M. Front Big Data Big Data Centralized biodiversity data aggregation is too often failing societal needs due to pervasive and systemic data quality deficiencies. We argue for a novel approach that embodies the spirit of the Web (“small pieces loosely joined”) through the decentralized coordination of data across scientific languages and communities. The upfront cost of decentralization can be offset by the long-term benefit of achieving sustained expert engagement, higher-quality data products, and ultimately more societal impact for biodiversity data. Our decentralized approach encourages the emergence and evolution of multiple self-identifying communities of practice that are regionally, taxonomically, or institutionally localized. Each community is empowered to control the social and informational design and versioning of their local data infrastructures and signals. With no single aggregator to exert centralized control over biodiversity data, decentralization generates loosely connected networks of mid-level aggregators. Global coordination is nevertheless feasible through automatable data sharing agreements that enable efficient propagation and translation of biodiversity data across communities. The decentralized model also poses novel integration challenges, among which the explicit and continuous articulation of conflicting systematic classifications and phylogenies remain the most challenging. We discuss the development of available solutions, challenges, and outline next steps: the global effort of coordination should focus on developing shared languages for data signal translation, as opposed to homogenizing the data signal itself. Frontiers Media S.A. 2020-10-23 /pmc/articles/PMC7931950/ /pubmed/33693407 http://dx.doi.org/10.3389/fdata.2020.519133 Text en Copyright © 2020 Sterner, Gilbert and Franz. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Sterner, Beckett W.
Gilbert, Edward E.
Franz, Nico M.
Decentralized but Globally Coordinated Biodiversity Data
title Decentralized but Globally Coordinated Biodiversity Data
title_full Decentralized but Globally Coordinated Biodiversity Data
title_fullStr Decentralized but Globally Coordinated Biodiversity Data
title_full_unstemmed Decentralized but Globally Coordinated Biodiversity Data
title_short Decentralized but Globally Coordinated Biodiversity Data
title_sort decentralized but globally coordinated biodiversity data
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931950/
https://www.ncbi.nlm.nih.gov/pubmed/33693407
http://dx.doi.org/10.3389/fdata.2020.519133
work_keys_str_mv AT sternerbeckettw decentralizedbutgloballycoordinatedbiodiversitydata
AT gilbertedwarde decentralizedbutgloballycoordinatedbiodiversitydata
AT franznicom decentralizedbutgloballycoordinatedbiodiversitydata