Cargando…
Exploring the impact of data curation criteria on the observed geographical distribution of mosses
Biodiversity data records contain inaccuracies and biases. To overcome this limitation and establish robust geographic patterns, ecologists often curate records keeping those that are most suitable for their analyses. Yet, this choice is not straightforward and the outcome of the analysis may vary d...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694387/ http://dx.doi.org/10.1002/ece3.10786 |
_version_ | 1785153365663023104 |
---|---|
author | Ronquillo, Cristina Stropp, Juliana Medina, Nagore G. Hortal, Joaquin |
author_facet | Ronquillo, Cristina Stropp, Juliana Medina, Nagore G. Hortal, Joaquin |
author_sort | Ronquillo, Cristina |
collection | PubMed |
description | Biodiversity data records contain inaccuracies and biases. To overcome this limitation and establish robust geographic patterns, ecologists often curate records keeping those that are most suitable for their analyses. Yet, this choice is not straightforward and the outcome of the analysis may vary due to a trade‐off between data quality and volume. This problem is particularly recurrent for less‐studied groups with patchy sampling effort. The latitudinal pattern of mosses richness remains inconsistent across studies and these may emerge purely from sampling artefacts. Our main objective here is to assess the effect of different curation criteria on this spatial pattern in the Temperate Northern Hemisphere (above 20° latitude). We contrasted the geographical distribution of moss species records and the latitude‐species richness relation obtained under different data curation scenarios. These scenarios comprehend five sources of taxonomical standardisations and eight data cleaning filters. The analyses are based on the selection of well‐surveyed cells at 100 km cell resolution. The application of some ‘data curation scenarios’ severely affects the number of records selected for analysis and substantially changes the proportion of richness per cell. The sensitivity to data curation becomes detectable at regional and at the cell scales showing a large shift in the latitudinal richness peak in Europe, from 60° N to 45° N latitude, when only preserved specimens are selected and duplicates based on date of collection and coordinates are excluded. Our results stress the importance of justifying the criteria used for filtering biodiversity data retrieved from biodiversity databases to avoid detecting misleading patterns. Curating records under particular criteria compromises the information in some areas displaying different spatial information of mosses. This problem can be ameliorated if data filtering is combined with identifying well‐surveyed cells, render relatively constant results under different combinations of filtering even for less well‐known groups such as mosses. |
format | Online Article Text |
id | pubmed-10694387 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106943872023-12-05 Exploring the impact of data curation criteria on the observed geographical distribution of mosses Ronquillo, Cristina Stropp, Juliana Medina, Nagore G. Hortal, Joaquin Ecol Evol Research Articles Biodiversity data records contain inaccuracies and biases. To overcome this limitation and establish robust geographic patterns, ecologists often curate records keeping those that are most suitable for their analyses. Yet, this choice is not straightforward and the outcome of the analysis may vary due to a trade‐off between data quality and volume. This problem is particularly recurrent for less‐studied groups with patchy sampling effort. The latitudinal pattern of mosses richness remains inconsistent across studies and these may emerge purely from sampling artefacts. Our main objective here is to assess the effect of different curation criteria on this spatial pattern in the Temperate Northern Hemisphere (above 20° latitude). We contrasted the geographical distribution of moss species records and the latitude‐species richness relation obtained under different data curation scenarios. These scenarios comprehend five sources of taxonomical standardisations and eight data cleaning filters. The analyses are based on the selection of well‐surveyed cells at 100 km cell resolution. The application of some ‘data curation scenarios’ severely affects the number of records selected for analysis and substantially changes the proportion of richness per cell. The sensitivity to data curation becomes detectable at regional and at the cell scales showing a large shift in the latitudinal richness peak in Europe, from 60° N to 45° N latitude, when only preserved specimens are selected and duplicates based on date of collection and coordinates are excluded. Our results stress the importance of justifying the criteria used for filtering biodiversity data retrieved from biodiversity databases to avoid detecting misleading patterns. Curating records under particular criteria compromises the information in some areas displaying different spatial information of mosses. This problem can be ameliorated if data filtering is combined with identifying well‐surveyed cells, render relatively constant results under different combinations of filtering even for less well‐known groups such as mosses. John Wiley and Sons Inc. 2023-12-03 /pmc/articles/PMC10694387/ http://dx.doi.org/10.1002/ece3.10786 Text en © 2023 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Ronquillo, Cristina Stropp, Juliana Medina, Nagore G. Hortal, Joaquin Exploring the impact of data curation criteria on the observed geographical distribution of mosses |
title | Exploring the impact of data curation criteria on the observed geographical distribution of mosses |
title_full | Exploring the impact of data curation criteria on the observed geographical distribution of mosses |
title_fullStr | Exploring the impact of data curation criteria on the observed geographical distribution of mosses |
title_full_unstemmed | Exploring the impact of data curation criteria on the observed geographical distribution of mosses |
title_short | Exploring the impact of data curation criteria on the observed geographical distribution of mosses |
title_sort | exploring the impact of data curation criteria on the observed geographical distribution of mosses |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694387/ http://dx.doi.org/10.1002/ece3.10786 |
work_keys_str_mv | AT ronquillocristina exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses AT stroppjuliana exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses AT medinanagoreg exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses AT hortaljoaquin exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses |