Cargando…
Taking a ‘Big Data’ approach to data quality in a citizen science project
Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citize...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4623867/ https://www.ncbi.nlm.nih.gov/pubmed/26508347 http://dx.doi.org/10.1007/s13280-015-0710-4 |
_version_ | 1782397755052785664 |
---|---|
author | Kelling, Steve Fink, Daniel La Sorte, Frank A. Johnston, Alison Bruns, Nicholas E. Hochachka, Wesley M. |
author_facet | Kelling, Steve Fink, Daniel La Sorte, Frank A. Johnston, Alison Bruns, Nicholas E. Hochachka, Wesley M. |
author_sort | Kelling, Steve |
collection | PubMed |
description | Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citizen science projects can gather sufficient quantities of data, but data collected from volunteers are inherently noisy and heterogeneous. Here we describe a ‘Big Data’ approach to improve the data quality in eBird, a global citizen science project that gathers bird observations. First, eBird’s data submission design ensures that all data meet high standards of completeness and accuracy. Second, we take a ‘sensor calibration’ approach to measure individual variation in eBird participant’s ability to detect and identify birds. Third, we use species distribution models to fill in data gaps. Finally, we provide examples of novel analyses exploring population-level patterns in bird distributions. |
format | Online Article Text |
id | pubmed-4623867 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-46238672015-10-30 Taking a ‘Big Data’ approach to data quality in a citizen science project Kelling, Steve Fink, Daniel La Sorte, Frank A. Johnston, Alison Bruns, Nicholas E. Hochachka, Wesley M. Ambio Article Data from well-designed experiments provide the strongest evidence of causation in biodiversity studies. However, for many species the collection of these data is not scalable to the spatial and temporal extents required to understand patterns at the population level. Only data collected from citizen science projects can gather sufficient quantities of data, but data collected from volunteers are inherently noisy and heterogeneous. Here we describe a ‘Big Data’ approach to improve the data quality in eBird, a global citizen science project that gathers bird observations. First, eBird’s data submission design ensures that all data meet high standards of completeness and accuracy. Second, we take a ‘sensor calibration’ approach to measure individual variation in eBird participant’s ability to detect and identify birds. Third, we use species distribution models to fill in data gaps. Finally, we provide examples of novel analyses exploring population-level patterns in bird distributions. Springer Netherlands 2015-10-27 2015-11 /pmc/articles/PMC4623867/ /pubmed/26508347 http://dx.doi.org/10.1007/s13280-015-0710-4 Text en © The Author(s) 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Article Kelling, Steve Fink, Daniel La Sorte, Frank A. Johnston, Alison Bruns, Nicholas E. Hochachka, Wesley M. Taking a ‘Big Data’ approach to data quality in a citizen science project |
title | Taking a ‘Big Data’ approach to data quality in a citizen science project |
title_full | Taking a ‘Big Data’ approach to data quality in a citizen science project |
title_fullStr | Taking a ‘Big Data’ approach to data quality in a citizen science project |
title_full_unstemmed | Taking a ‘Big Data’ approach to data quality in a citizen science project |
title_short | Taking a ‘Big Data’ approach to data quality in a citizen science project |
title_sort | taking a ‘big data’ approach to data quality in a citizen science project |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4623867/ https://www.ncbi.nlm.nih.gov/pubmed/26508347 http://dx.doi.org/10.1007/s13280-015-0710-4 |
work_keys_str_mv | AT kellingsteve takingabigdataapproachtodataqualityinacitizenscienceproject AT finkdaniel takingabigdataapproachtodataqualityinacitizenscienceproject AT lasortefranka takingabigdataapproachtodataqualityinacitizenscienceproject AT johnstonalison takingabigdataapproachtodataqualityinacitizenscienceproject AT brunsnicholase takingabigdataapproachtodataqualityinacitizenscienceproject AT hochachkawesleym takingabigdataapproachtodataqualityinacitizenscienceproject |