Cargando…

Taking a ‘Big Data’ approach to data quality in a citizen science project

Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citize...

Descripción completa

Detalles Bibliográficos
Autores principales: Kelling, Steve, Fink, Daniel, La Sorte, Frank A., Johnston, Alison, Bruns, Nicholas E., Hochachka, Wesley M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4623867/
https://www.ncbi.nlm.nih.gov/pubmed/26508347
http://dx.doi.org/10.1007/s13280-015-0710-4
_version_ 1782397755052785664
author Kelling, Steve
Fink, Daniel
La Sorte, Frank A.
Johnston, Alison
Bruns, Nicholas E.
Hochachka, Wesley M.
author_facet Kelling, Steve
Fink, Daniel
La Sorte, Frank A.
Johnston, Alison
Bruns, Nicholas E.
Hochachka, Wesley M.
author_sort Kelling, Steve
collection PubMed
description Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citizen science projects can gather sufficient quantities of data, but data collected from volunteers are inherently noisy and heterogeneous. Here we describe a ‘Big Data’ approach to improve the data quality in eBird, a global citizen science project that gathers bird observations. First, eBird’s data submission design ensures that all data meet high standards of completeness and accuracy. Second, we take a ‘sensor calibration’ approach to measure individual variation in eBird participant’s ability to detect and identify birds. Third, we use species distribution models to fill in data gaps. Finally, we provide examples of novel analyses exploring population-level patterns in bird distributions.
format Online
Article
Text
id pubmed-4623867
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-46238672015-10-30 Taking a ‘Big Data’ approach to data quality in a citizen science project Kelling, Steve Fink, Daniel La Sorte, Frank A. Johnston, Alison Bruns, Nicholas E. Hochachka, Wesley M. Ambio Article Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citizen science projects can gather sufficient quantities of data, but data collected from volunteers are inherently noisy and heterogeneous. Here we describe a ‘Big Data’ approach to improve the data quality in eBird, a global citizen science project that gathers bird observations. First, eBird’s data submission design ensures that all data meet high standards of completeness and accuracy. Second, we take a ‘sensor calibration’ approach to measure individual variation in eBird participant’s ability to detect and identify birds. Third, we use species distribution models to fill in data gaps. Finally, we provide examples of novel analyses exploring population-level patterns in bird distributions. Springer Netherlands 2015-10-27 2015-11 /pmc/articles/PMC4623867/ /pubmed/26508347 http://dx.doi.org/10.1007/s13280-015-0710-4 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Article
Kelling, Steve
Fink, Daniel
La Sorte, Frank A.
Johnston, Alison
Bruns, Nicholas E.
Hochachka, Wesley M.
Taking a ‘Big Data’ approach to data quality in a citizen science project
title Taking a ‘Big Data’ approach to data quality in a citizen science project
title_full Taking a ‘Big Data’ approach to data quality in a citizen science project
title_fullStr Taking a ‘Big Data’ approach to data quality in a citizen science project
title_full_unstemmed Taking a ‘Big Data’ approach to data quality in a citizen science project
title_short Taking a ‘Big Data’ approach to data quality in a citizen science project
title_sort taking a ‘big data’ approach to data quality in a citizen science project
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4623867/
https://www.ncbi.nlm.nih.gov/pubmed/26508347
http://dx.doi.org/10.1007/s13280-015-0710-4
work_keys_str_mv AT kellingsteve takingabigdataapproachtodataqualityinacitizenscienceproject
AT finkdaniel takingabigdataapproachtodataqualityinacitizenscienceproject
AT lasortefranka takingabigdataapproachtodataqualityinacitizenscienceproject
AT johnstonalison takingabigdataapproachtodataqualityinacitizenscienceproject
AT brunsnicholase takingabigdataapproachtodataqualityinacitizenscienceproject
AT hochachkawesleym takingabigdataapproachtodataqualityinacitizenscienceproject