Cargando…

Exploring the impact of data curation criteria on the observed geographical distribution of mosses

Biodiversity data records contain inaccuracies and biases. To overcome this limitation and establish robust geographic patterns, ecologists often curate records keeping those that are most suitable for their analyses. Yet, this choice is not straightforward and the outcome of the analysis may vary d...

Descripción completa

Detalles Bibliográficos
Autores principales: Ronquillo, Cristina, Stropp, Juliana, Medina, Nagore G., Hortal, Joaquin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694387/
http://dx.doi.org/10.1002/ece3.10786
_version_ 1785153365663023104
author Ronquillo, Cristina
Stropp, Juliana
Medina, Nagore G.
Hortal, Joaquin
author_facet Ronquillo, Cristina
Stropp, Juliana
Medina, Nagore G.
Hortal, Joaquin
author_sort Ronquillo, Cristina
collection PubMed
description Biodiversity data records contain inaccuracies and biases. To overcome this limitation and establish robust geographic patterns, ecologists often curate records keeping those that are most suitable for their analyses. Yet, this choice is not straightforward and the outcome of the analysis may vary due to a trade‐off between data quality and volume. This problem is particularly recurrent for less‐studied groups with patchy sampling effort. The latitudinal pattern of mosses richness remains inconsistent across studies and these may emerge purely from sampling artefacts. Our main objective here is to assess the effect of different curation criteria on this spatial pattern in the Temperate Northern Hemisphere (above 20° latitude). We contrasted the geographical distribution of moss species records and the latitude‐species richness relation obtained under different data curation scenarios. These scenarios comprehend five sources of taxonomical standardisations and eight data cleaning filters. The analyses are based on the selection of well‐surveyed cells at 100 km cell resolution. The application of some ‘data curation scenarios’ severely affects the number of records selected for analysis and substantially changes the proportion of richness per cell. The sensitivity to data curation becomes detectable at regional and at the cell scales showing a large shift in the latitudinal richness peak in Europe, from 60° N to 45° N latitude, when only preserved specimens are selected and duplicates based on date of collection and coordinates are excluded. Our results stress the importance of justifying the criteria used for filtering biodiversity data retrieved from biodiversity databases to avoid detecting misleading patterns. Curating records under particular criteria compromises the information in some areas displaying different spatial information of mosses. This problem can be ameliorated if data filtering is combined with identifying well‐surveyed cells, render relatively constant results under different combinations of filtering even for less well‐known groups such as mosses.
format Online
Article
Text
id pubmed-10694387
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-106943872023-12-05 Exploring the impact of data curation criteria on the observed geographical distribution of mosses Ronquillo, Cristina Stropp, Juliana Medina, Nagore G. Hortal, Joaquin Ecol Evol Research Articles Biodiversity data records contain inaccuracies and biases. To overcome this limitation and establish robust geographic patterns, ecologists often curate records keeping those that are most suitable for their analyses. Yet, this choice is not straightforward and the outcome of the analysis may vary due to a trade‐off between data quality and volume. This problem is particularly recurrent for less‐studied groups with patchy sampling effort. The latitudinal pattern of mosses richness remains inconsistent across studies and these may emerge purely from sampling artefacts. Our main objective here is to assess the effect of different curation criteria on this spatial pattern in the Temperate Northern Hemisphere (above 20° latitude). We contrasted the geographical distribution of moss species records and the latitude‐species richness relation obtained under different data curation scenarios. These scenarios comprehend five sources of taxonomical standardisations and eight data cleaning filters. The analyses are based on the selection of well‐surveyed cells at 100 km cell resolution. The application of some ‘data curation scenarios’ severely affects the number of records selected for analysis and substantially changes the proportion of richness per cell. The sensitivity to data curation becomes detectable at regional and at the cell scales showing a large shift in the latitudinal richness peak in Europe, from 60° N to 45° N latitude, when only preserved specimens are selected and duplicates based on date of collection and coordinates are excluded. Our results stress the importance of justifying the criteria used for filtering biodiversity data retrieved from biodiversity databases to avoid detecting misleading patterns. Curating records under particular criteria compromises the information in some areas displaying different spatial information of mosses. This problem can be ameliorated if data filtering is combined with identifying well‐surveyed cells, render relatively constant results under different combinations of filtering even for less well‐known groups such as mosses. John Wiley and Sons Inc. 2023-12-03 /pmc/articles/PMC10694387/ http://dx.doi.org/10.1002/ece3.10786 Text en © 2023 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Ronquillo, Cristina
Stropp, Juliana
Medina, Nagore G.
Hortal, Joaquin
Exploring the impact of data curation criteria on the observed geographical distribution of mosses
title Exploring the impact of data curation criteria on the observed geographical distribution of mosses
title_full Exploring the impact of data curation criteria on the observed geographical distribution of mosses
title_fullStr Exploring the impact of data curation criteria on the observed geographical distribution of mosses
title_full_unstemmed Exploring the impact of data curation criteria on the observed geographical distribution of mosses
title_short Exploring the impact of data curation criteria on the observed geographical distribution of mosses
title_sort exploring the impact of data curation criteria on the observed geographical distribution of mosses
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10694387/
http://dx.doi.org/10.1002/ece3.10786
work_keys_str_mv AT ronquillocristina exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses
AT stroppjuliana exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses
AT medinanagoreg exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses
AT hortaljoaquin exploringtheimpactofdatacurationcriteriaontheobservedgeographicaldistributionofmosses