Cargando…

Data hosting infrastructure for primary biodiversity data

BACKGROUND: Today, an unprecedented volume of primary biodiversity data are being generated worldwide, yet significant amounts of these data have been and will continue to be lost after the conclusion of the projects tasked with collecting them. To get the most value out of these data it is imperati...

Descripción completa

Detalles Bibliográficos
Autores principales: Goddard, Anthony, Wilson, Nathan, Cryer, Phil, Yamashita, Grant
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287448/
https://www.ncbi.nlm.nih.gov/pubmed/22373257
http://dx.doi.org/10.1186/1471-2105-12-S15-S5
_version_ 1782224665411846144
author Goddard, Anthony
Wilson, Nathan
Cryer, Phil
Yamashita, Grant
author_facet Goddard, Anthony
Wilson, Nathan
Cryer, Phil
Yamashita, Grant
author_sort Goddard, Anthony
collection PubMed
description BACKGROUND: Today, an unprecedented volume of primary biodiversity data are being generated worldwide, yet significant amounts of these data have been and will continue to be lost after the conclusion of the projects tasked with collecting them. To get the most value out of these data it is imperative to seek a solution whereby these data are rescued, archived and made available to the biodiversity community. To this end, the biodiversity informatics community requires investment in processes and infrastructure to mitigate data loss and provide solutions for long-term hosting and sharing of biodiversity data. DISCUSSION: We review the current state of biodiversity data hosting and investigate the technological and sociological barriers to proper data management. We further explore the rescuing and re-hosting of legacy data, the state of existing toolsets and propose a future direction for the development of new discovery tools. We also explore the role of data standards and licensing in the context of data hosting and preservation. We provide five recommendations for the biodiversity community that will foster better data preservation and access: (1) encourage the community's use of data standards, (2) promote the public domain licensing of data, (3) establish a community of those involved in data hosting and archival, (4) establish hosting centers for biodiversity data, and (5) develop tools for data discovery. CONCLUSION: The community's adoption of standards and development of tools to enable data discovery is essential to sustainable data preservation. Furthermore, the increased adoption of open content licensing, the establishment of data hosting infrastructure and the creation of a data hosting and archiving community are all necessary steps towards the community ensuring that data archival policies become standardized.
format Online
Article
Text
id pubmed-3287448
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32874482012-02-28 Data hosting infrastructure for primary biodiversity data Goddard, Anthony Wilson, Nathan Cryer, Phil Yamashita, Grant BMC Bioinformatics Research BACKGROUND: Today, an unprecedented volume of primary biodiversity data are being generated worldwide, yet significant amounts of these data have been and will continue to be lost after the conclusion of the projects tasked with collecting them. To get the most value out of these data it is imperative to seek a solution whereby these data are rescued, archived and made available to the biodiversity community. To this end, the biodiversity informatics community requires investment in processes and infrastructure to mitigate data loss and provide solutions for long-term hosting and sharing of biodiversity data. DISCUSSION: We review the current state of biodiversity data hosting and investigate the technological and sociological barriers to proper data management. We further explore the rescuing and re-hosting of legacy data, the state of existing toolsets and propose a future direction for the development of new discovery tools. We also explore the role of data standards and licensing in the context of data hosting and preservation. We provide five recommendations for the biodiversity community that will foster better data preservation and access: (1) encourage the community's use of data standards, (2) promote the public domain licensing of data, (3) establish a community of those involved in data hosting and archival, (4) establish hosting centers for biodiversity data, and (5) develop tools for data discovery. CONCLUSION: The community's adoption of standards and development of tools to enable data discovery is essential to sustainable data preservation. Furthermore, the increased adoption of open content licensing, the establishment of data hosting infrastructure and the creation of a data hosting and archiving community are all necessary steps towards the community ensuring that data archival policies become standardized. BioMed Central 2011-12-15 /pmc/articles/PMC3287448/ /pubmed/22373257 http://dx.doi.org/10.1186/1471-2105-12-S15-S5 Text en Copyright ©2011 Goddard et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Goddard, Anthony
Wilson, Nathan
Cryer, Phil
Yamashita, Grant
Data hosting infrastructure for primary biodiversity data
title Data hosting infrastructure for primary biodiversity data
title_full Data hosting infrastructure for primary biodiversity data
title_fullStr Data hosting infrastructure for primary biodiversity data
title_full_unstemmed Data hosting infrastructure for primary biodiversity data
title_short Data hosting infrastructure for primary biodiversity data
title_sort data hosting infrastructure for primary biodiversity data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3287448/
https://www.ncbi.nlm.nih.gov/pubmed/22373257
http://dx.doi.org/10.1186/1471-2105-12-S15-S5
work_keys_str_mv AT goddardanthony datahostinginfrastructureforprimarybiodiversitydata
AT wilsonnathan datahostinginfrastructureforprimarybiodiversitydata
AT cryerphil datahostinginfrastructureforprimarybiodiversitydata
AT yamashitagrant datahostinginfrastructureforprimarybiodiversitydata