Cargando…

Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases

Being able to assess the quality and level of completeness of data has become indispensable in marine biodiversity research, especially when dealing with large databases that typically compile data from a variety of sources. Very few integrated databases offer quality flags on the level of the indiv...

Descripción completa

Detalles Bibliográficos
Autores principales: Vandepitte, Leen, Bosch, Samuel, Tyberghein, Lennert, Waumans, Filip, Vanhoorne, Bart, Hernandez, Francisco, De Clerck, Olivier, Mees, Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4309024/
https://www.ncbi.nlm.nih.gov/pubmed/25632106
http://dx.doi.org/10.1093/database/bau125
_version_ 1782354619995783168
author Vandepitte, Leen
Bosch, Samuel
Tyberghein, Lennert
Waumans, Filip
Vanhoorne, Bart
Hernandez, Francisco
De Clerck, Olivier
Mees, Jan
author_facet Vandepitte, Leen
Bosch, Samuel
Tyberghein, Lennert
Waumans, Filip
Vanhoorne, Bart
Hernandez, Francisco
De Clerck, Olivier
Mees, Jan
author_sort Vandepitte, Leen
collection PubMed
description Being able to assess the quality and level of completeness of data has become indispensable in marine biodiversity research, especially when dealing with large databases that typically compile data from a variety of sources. Very few integrated databases offer quality flags on the level of the individual record, making it hard for users to easily extract the data that are fit for their specific purposes. This article describes the different steps that were developed to analyse the quality and completeness of the distribution records within the European and international Ocean Biogeographic Information Systems (EurOBIS and OBIS). Records are checked on data format, completeness and validity of information, quality and detail of the used taxonomy and geographic indications and whether or not the record is a putative outlier. The corresponding quality control (QC) flags will not only help users with their data selection, they will also help the data management team and the data custodians to identify possible gaps and errors in the submitted data, providing scope to improve data quality. The results of these quality control procedures are as of now available on both the EurOBIS and OBIS databases. Through the Biology portal of the European Marine Observation and Data Network (EMODnet Biology), a subset of EurOBIS records—passing a specific combination of these QC steps—is offered to the users. In the future, EMODnet Biology will offer a wide range of filter options through its portal, allowing users to make specific selections themselves. Through LifeWatch, users can already upload their own data and check them against a selection of the here described quality control procedures. Database URL: www.eurobis.org (www.iobis.org; www.emodnet-biology.eu/)
format Online
Article
Text
id pubmed-4309024
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43090242015-02-24 Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases Vandepitte, Leen Bosch, Samuel Tyberghein, Lennert Waumans, Filip Vanhoorne, Bart Hernandez, Francisco De Clerck, Olivier Mees, Jan Database (Oxford) Original Article Being able to assess the quality and level of completeness of data has become indispensable in marine biodiversity research, especially when dealing with large databases that typically compile data from a variety of sources. Very few integrated databases offer quality flags on the level of the individual record, making it hard for users to easily extract the data that are fit for their specific purposes. This article describes the different steps that were developed to analyse the quality and completeness of the distribution records within the European and international Ocean Biogeographic Information Systems (EurOBIS and OBIS). Records are checked on data format, completeness and validity of information, quality and detail of the used taxonomy and geographic indications and whether or not the record is a putative outlier. The corresponding quality control (QC) flags will not only help users with their data selection, they will also help the data management team and the data custodians to identify possible gaps and errors in the submitted data, providing scope to improve data quality. The results of these quality control procedures are as of now available on both the EurOBIS and OBIS databases. Through the Biology portal of the European Marine Observation and Data Network (EMODnet Biology), a subset of EurOBIS records—passing a specific combination of these QC steps—is offered to the users. In the future, EMODnet Biology will offer a wide range of filter options through its portal, allowing users to make specific selections themselves. Through LifeWatch, users can already upload their own data and check them against a selection of the here described quality control procedures. Database URL: www.eurobis.org (www.iobis.org; www.emodnet-biology.eu/) Oxford University Press 2015-01-28 /pmc/articles/PMC4309024/ /pubmed/25632106 http://dx.doi.org/10.1093/database/bau125 Text en © The Author(s) 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Vandepitte, Leen
Bosch, Samuel
Tyberghein, Lennert
Waumans, Filip
Vanhoorne, Bart
Hernandez, Francisco
De Clerck, Olivier
Mees, Jan
Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
title Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
title_full Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
title_fullStr Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
title_full_unstemmed Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
title_short Fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
title_sort fishing for data and sorting the catch: assessing the data quality, completeness and fitness for use of data in marine biogeographic databases
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4309024/
https://www.ncbi.nlm.nih.gov/pubmed/25632106
http://dx.doi.org/10.1093/database/bau125
work_keys_str_mv AT vandepitteleen fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT boschsamuel fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT tybergheinlennert fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT waumansfilip fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT vanhoornebart fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT hernandezfrancisco fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT declerckolivier fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases
AT meesjan fishingfordataandsortingthecatchassessingthedataqualitycompletenessandfitnessforuseofdatainmarinebiogeographicdatabases