Cargando…

Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps

Abstract. BACKGROUND: Open-access biodiversity databases including mainly citizen science data make temporally and spatially extensive species’ observation data available to a wide range of users. Such data have limitations however, which include: sampling bias in favour of recorder distribution, la...

Descripción completa

Detalles Bibliográficos
Autor principal:	Ruete, Alejandro
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Pensoft Publishers 2015
Materias:	Software Description
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4549634/ https://www.ncbi.nlm.nih.gov/pubmed/26312050 http://dx.doi.org/10.3897/BDJ.3.e5361

_version_	1782387340996509696
author	Ruete, Alejandro
author_facet	Ruete, Alejandro
author_sort	Ruete, Alejandro
collection	PubMed
description	Abstract. BACKGROUND: Open-access biodiversity databases including mainly citizen science data make temporally and spatially extensive species’ observation data available to a wide range of users. Such data have limitations however, which include: sampling bias in favour of recorder distribution, lack of survey effort assessment, and lack of coverage of the distribution of all organisms. These limitations are not always recorded, while any technical assessment or scientific research based on such data should include an evaluation of the uncertainty of its source data and researchers should acknowledge this information in their analysis. The here proposed maps of ignorance are a critical and easy way to implement a tool to not only visually explore the quality of the data, but also to filter out unreliable results. NEW INFORMATION: I present simple algorithms to display ignorance maps as a tool to report the spatial distribution of the bias and lack of sampling effort across a study region. Ignorance scores are expressed solely based on raw data in order to rely on the fewest assumptions possible. Therefore there is no prediction or estimation involved. The rationale is based on the assumption that it is appropriate to use species groups as a surrogate for sampling effort because it is likely that an entire group of species observed by similar methods will share similar bias. Simple algorithms are then used to transform raw data into ignorance scores scaled 0-1 that are easily comparable and scalable. Because of the need to perform calculations over big datasets, simplicity is crucial for web-based implementations on infrastructures for biodiversity information. With these algorithms, any infrastructure for biodiversity information can offer a quality report of the observations accessed through them. Users can specify a reference taxonomic group and a time frame according to the research question. The potential of this tool lies in the simplicity of its algorithms and in the lack of assumptions made about the bias distribution, giving the user the freedom to tailor analyses to their specific needs.
format	Online Article Text
id	pubmed-4549634
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Pensoft Publishers
record_format	MEDLINE/PubMed
spelling	pubmed-45496342015-08-26 Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps Ruete, Alejandro Biodivers Data J Software Description Abstract. BACKGROUND: Open-access biodiversity databases including mainly citizen science data make temporally and spatially extensive species’ observation data available to a wide range of users. Such data have limitations however, which include: sampling bias in favour of recorder distribution, lack of survey effort assessment, and lack of coverage of the distribution of all organisms. These limitations are not always recorded, while any technical assessment or scientific research based on such data should include an evaluation of the uncertainty of its source data and researchers should acknowledge this information in their analysis. The here proposed maps of ignorance are a critical and easy way to implement a tool to not only visually explore the quality of the data, but also to filter out unreliable results. NEW INFORMATION: I present simple algorithms to display ignorance maps as a tool to report the spatial distribution of the bias and lack of sampling effort across a study region. Ignorance scores are expressed solely based on raw data in order to rely on the fewest assumptions possible. Therefore there is no prediction or estimation involved. The rationale is based on the assumption that it is appropriate to use species groups as a surrogate for sampling effort because it is likely that an entire group of species observed by similar methods will share similar bias. Simple algorithms are then used to transform raw data into ignorance scores scaled 0-1 that are easily comparable and scalable. Because of the need to perform calculations over big datasets, simplicity is crucial for web-based implementations on infrastructures for biodiversity information. With these algorithms, any infrastructure for biodiversity information can offer a quality report of the observations accessed through them. Users can specify a reference taxonomic group and a time frame according to the research question. The potential of this tool lies in the simplicity of its algorithms and in the lack of assumptions made about the bias distribution, giving the user the freedom to tailor analyses to their specific needs. Pensoft Publishers 2015-07-28 /pmc/articles/PMC4549634/ /pubmed/26312050 http://dx.doi.org/10.3897/BDJ.3.e5361 Text en Alejandro Ruete http://creativecommons.org/licenses/by/4.0 This is an open access article distributed under the terms of the Creative Commons Attribution License 4.0 (CC-BY), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Software Description Ruete, Alejandro Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
title	Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
title_full	Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
title_fullStr	Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
title_full_unstemmed	Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
title_short	Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
title_sort	displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps
topic	Software Description
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4549634/ https://www.ncbi.nlm.nih.gov/pubmed/26312050 http://dx.doi.org/10.3897/BDJ.3.e5361
work_keys_str_mv	AT ruetealejandro displayingbiasinsamplingeffortofdataaccessedfrombiodiversitydatabasesusingignorancemaps

Displaying bias in sampling effort of data accessed from biodiversity databases using ignorance maps

Ejemplares similares