Cargando…

A data management workflow of biodiversity data from the field to data users

PREMISE: Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better...

Descripción completa

Detalles Bibliográficos
Autores principales: Hackett, Rachel A., Belitz, Michael W., Gilbert, Edward E., Monfils, Anna K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923704/
https://www.ncbi.nlm.nih.gov/pubmed/31890356
http://dx.doi.org/10.1002/aps3.11310
_version_ 1783481576231796736
author Hackett, Rachel A.
Belitz, Michael W.
Gilbert, Edward E.
Monfils, Anna K.
author_facet Hackett, Rachel A.
Belitz, Michael W.
Gilbert, Edward E.
Monfils, Anna K.
author_sort Hackett, Rachel A.
collection PubMed
description PREMISE: Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better information across geopolitical, administrative, and institutional boundaries. Here, we present our data workflow as a case study of how we collected, shared, and used data from multiple sources. METHODS: In 2012, we initiated the collection of biodiversity data relating to Michigan prairie fens, including data on plant communities and the federally endangered Poweshiek skipperling (Oarisma poweshiek). RESULTS: Over 23,000 occurrence records were compiled in a database following Darwin Core standards. The records were linked with media and biological, chemical, and geometric measurements. We published the data as Global Biodiversity Information Facility data sets and in Symbiota SEINet portals. DISCUSSION: We highlight data collection techniques that optimized transcription time, including the use of predetermined and controlled vocabulary, Darwin Core terms, and data dictionaries. The validity and longevity of our data were supported by voucher specimens, metadata with measurement records, and published manuscripts detailing methods and data sets. Key to our data dissemination was cooperation among partners and the utilization of dynamic tools. To increase data interoperability, we need flexible and customizable data collection templates, coding, and enhanced communication among communities using biodiversity data.
format Online
Article
Text
id pubmed-6923704
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-69237042019-12-30 A data management workflow of biodiversity data from the field to data users Hackett, Rachel A. Belitz, Michael W. Gilbert, Edward E. Monfils, Anna K. Appl Plant Sci Application Article PREMISE: Heterogeneity of biodiversity data from the collections, research, and management communities presents challenges for data findability, accessibility, interoperability, and reusability. Workflows designed with data collection, standards, dissemination, and reuse in mind will generate better information across geopolitical, administrative, and institutional boundaries. Here, we present our data workflow as a case study of how we collected, shared, and used data from multiple sources. METHODS: In 2012, we initiated the collection of biodiversity data relating to Michigan prairie fens, including data on plant communities and the federally endangered Poweshiek skipperling (Oarisma poweshiek). RESULTS: Over 23,000 occurrence records were compiled in a database following Darwin Core standards. The records were linked with media and biological, chemical, and geometric measurements. We published the data as Global Biodiversity Information Facility data sets and in Symbiota SEINet portals. DISCUSSION: We highlight data collection techniques that optimized transcription time, including the use of predetermined and controlled vocabulary, Darwin Core terms, and data dictionaries. The validity and longevity of our data were supported by voucher specimens, metadata with measurement records, and published manuscripts detailing methods and data sets. Key to our data dissemination was cooperation among partners and the utilization of dynamic tools. To increase data interoperability, we need flexible and customizable data collection templates, coding, and enhanced communication among communities using biodiversity data. John Wiley and Sons Inc. 2019-12-20 /pmc/articles/PMC6923704/ /pubmed/31890356 http://dx.doi.org/10.1002/aps3.11310 Text en © 2019 Hackett et al. Applications in Plant Sciences is published by Wiley Periodicals, Inc. on behalf of the Botanical Society of America This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Application Article
Hackett, Rachel A.
Belitz, Michael W.
Gilbert, Edward E.
Monfils, Anna K.
A data management workflow of biodiversity data from the field to data users
title A data management workflow of biodiversity data from the field to data users
title_full A data management workflow of biodiversity data from the field to data users
title_fullStr A data management workflow of biodiversity data from the field to data users
title_full_unstemmed A data management workflow of biodiversity data from the field to data users
title_short A data management workflow of biodiversity data from the field to data users
title_sort data management workflow of biodiversity data from the field to data users
topic Application Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923704/
https://www.ncbi.nlm.nih.gov/pubmed/31890356
http://dx.doi.org/10.1002/aps3.11310
work_keys_str_mv AT hackettrachela adatamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT belitzmichaelw adatamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT gilbertedwarde adatamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT monfilsannak adatamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT hackettrachela datamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT belitzmichaelw datamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT gilbertedwarde datamanagementworkflowofbiodiversitydatafromthefieldtodatausers
AT monfilsannak datamanagementworkflowofbiodiversitydatafromthefieldtodatausers