Cargando…

An audit of some processing effects in aggregated occurrence records

Abstract. A total of ca 800,000 occurrence records from the Australian Museum (AM), Museums Victoria (MV) and the New Zealand Arthropod Collection (NZAC) were audited for changes in selected Darwin Core fields after processing by the Atlas of Living Australia (ALA; for AM and MV records) and the Glo...

Descripción completa

Detalles Bibliográficos
Autor principal: Mesibov, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pensoft Publishers 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5923217/
https://www.ncbi.nlm.nih.gov/pubmed/29713234
http://dx.doi.org/10.3897/zookeys.751.24791
_version_ 1783318288807231488
author Mesibov, Robert
author_facet Mesibov, Robert
author_sort Mesibov, Robert
collection PubMed
description Abstract. A total of ca 800,000 occurrence records from the Australian Museum (AM), Museums Victoria (MV) and the New Zealand Arthropod Collection (NZAC) were audited for changes in selected Darwin Core fields after processing by the Atlas of Living Australia (ALA; for AM and MV records) and the Global Biodiversity Information Facility (GBIF; for AM, MV and NZAC records). Formal taxon names in the genus- and species-groups were changed in 13–21% of AM and MV records, depending on dataset and aggregator. There was little agreement between the two aggregators on processed names, with names changed in two to three times as many records by one aggregator alone compared to records with names changed by both aggregators. The type status of specimen records did not change with name changes, resulting in confusion as to the name with which a type was associated. Data losses of up to 100% were found after processing in some fields, apparently due to programming errors. The taxonomic usefulness of occurrence records could be improved if aggregators included both original and the processed taxonomic data items for each record. It is recommended that end-users check original and processed records for data loss and name replacements after processing by aggregators.
format Online
Article
Text
id pubmed-5923217
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Pensoft Publishers
record_format MEDLINE/PubMed
spelling pubmed-59232172018-04-30 An audit of some processing effects in aggregated occurrence records Mesibov, Robert Zookeys Data Paper Abstract. A total of ca 800,000 occurrence records from the Australian Museum (AM), Museums Victoria (MV) and the New Zealand Arthropod Collection (NZAC) were audited for changes in selected Darwin Core fields after processing by the Atlas of Living Australia (ALA; for AM and MV records) and the Global Biodiversity Information Facility (GBIF; for AM, MV and NZAC records). Formal taxon names in the genus- and species-groups were changed in 13–21% of AM and MV records, depending on dataset and aggregator. There was little agreement between the two aggregators on processed names, with names changed in two to three times as many records by one aggregator alone compared to records with names changed by both aggregators. The type status of specimen records did not change with name changes, resulting in confusion as to the name with which a type was associated. Data losses of up to 100% were found after processing in some fields, apparently due to programming errors. The taxonomic usefulness of occurrence records could be improved if aggregators included both original and the processed taxonomic data items for each record. It is recommended that end-users check original and processed records for data loss and name replacements after processing by aggregators. Pensoft Publishers 2018-04-20 /pmc/articles/PMC5923217/ /pubmed/29713234 http://dx.doi.org/10.3897/zookeys.751.24791 Text en Robert Mesibov http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Data Paper
Mesibov, Robert
An audit of some processing effects in aggregated occurrence records
title An audit of some processing effects in aggregated occurrence records
title_full An audit of some processing effects in aggregated occurrence records
title_fullStr An audit of some processing effects in aggregated occurrence records
title_full_unstemmed An audit of some processing effects in aggregated occurrence records
title_short An audit of some processing effects in aggregated occurrence records
title_sort audit of some processing effects in aggregated occurrence records
topic Data Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5923217/
https://www.ncbi.nlm.nih.gov/pubmed/29713234
http://dx.doi.org/10.3897/zookeys.751.24791
work_keys_str_mv AT mesibovrobert anauditofsomeprocessingeffectsinaggregatedoccurrencerecords
AT mesibovrobert auditofsomeprocessingeffectsinaggregatedoccurrencerecords