Cargando…

The disambiguation of people names in biological collections

Scientific collections have been built by people. For hundreds of years, people have collected, studied, identified, preserved, documented and curated collection specimens. Understanding who those people are is of interest to historians, but much more can be made of these data by other stakeholders...

Descripción completa

Detalles Bibliográficos
Autores principales: Groom, Quentin, Bräuchler, Christian, Cubey, Robert W. N., Dillen, Mathias, Huybrechts, Pieter, Kearney, Nicole, Klazenga, Niels, Leachman, Siobhan, Paul, Deborah L, Rogers, Heather, Santos, Joaquim, Shorthouse, David Peter, Vaughan, Alison, von Mering, Sabine, Haston, Elspeth M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pensoft Publishers 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9836581/
https://www.ncbi.nlm.nih.gov/pubmed/36761559
http://dx.doi.org/10.3897/BDJ.10.e86089
_version_ 1784868900031168512
author Groom, Quentin
Bräuchler, Christian
Cubey, Robert W. N.
Dillen, Mathias
Huybrechts, Pieter
Kearney, Nicole
Klazenga, Niels
Leachman, Siobhan
Paul, Deborah L
Rogers, Heather
Santos, Joaquim
Shorthouse, David Peter
Vaughan, Alison
von Mering, Sabine
Haston, Elspeth M
author_facet Groom, Quentin
Bräuchler, Christian
Cubey, Robert W. N.
Dillen, Mathias
Huybrechts, Pieter
Kearney, Nicole
Klazenga, Niels
Leachman, Siobhan
Paul, Deborah L
Rogers, Heather
Santos, Joaquim
Shorthouse, David Peter
Vaughan, Alison
von Mering, Sabine
Haston, Elspeth M
author_sort Groom, Quentin
collection PubMed
description Scientific collections have been built by people. For hundreds of years, people have collected, studied, identified, preserved, documented and curated collection specimens. Understanding who those people are is of interest to historians, but much more can be made of these data by other stakeholders once they have been linked to the people’s identities and their biographies. Knowing who people are helps us attribute work correctly, validate data and understand the scientific contribution of people and institutions. We can evaluate the work they have done, the interests they have, the places they have worked and what they have created from the specimens they have collected. The problem is that all we know about most of the people associated with collections are their names written on specimens. Disambiguating these people is the challenge that this paper addresses. Disambiguation of people often proves difficult in isolation and can result in staff or researchers independently trying to determine the identity of specific individuals over and over again. By sharing biographical data and building an open, collectively maintained dataset with shared knowledge, expertise and resources, it is possible to collectively deduce the identities of individuals, aggregate biographical information for each person, reduce duplication of effort and share the information locally and globally. The authors of this paper aspire to disambiguate all person names efficiently and fully in all their variations across the entirety of the biological sciences, starting with collections. Towards that vision, this paper has three key aims: to improve the linking, validation, enhancement and valorisation of person-related information within and between collections, databases and publications; to suggest good practice for identifying people involved in biological collections; and to promote coordination amongst all stakeholders, including individuals, natural history collections, institutions, learned societies, government agencies and data aggregators.
format Online
Article
Text
id pubmed-9836581
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Pensoft Publishers
record_format MEDLINE/PubMed
spelling pubmed-98365812023-02-08 The disambiguation of people names in biological collections Groom, Quentin Bräuchler, Christian Cubey, Robert W. N. Dillen, Mathias Huybrechts, Pieter Kearney, Nicole Klazenga, Niels Leachman, Siobhan Paul, Deborah L Rogers, Heather Santos, Joaquim Shorthouse, David Peter Vaughan, Alison von Mering, Sabine Haston, Elspeth M Biodivers Data J Methods Scientific collections have been built by people. For hundreds of years, people have collected, studied, identified, preserved, documented and curated collection specimens. Understanding who those people are is of interest to historians, but much more can be made of these data by other stakeholders once they have been linked to the people’s identities and their biographies. Knowing who people are helps us attribute work correctly, validate data and understand the scientific contribution of people and institutions. We can evaluate the work they have done, the interests they have, the places they have worked and what they have created from the specimens they have collected. The problem is that all we know about most of the people associated with collections are their names written on specimens. Disambiguating these people is the challenge that this paper addresses. Disambiguation of people often proves difficult in isolation and can result in staff or researchers independently trying to determine the identity of specific individuals over and over again. By sharing biographical data and building an open, collectively maintained dataset with shared knowledge, expertise and resources, it is possible to collectively deduce the identities of individuals, aggregate biographical information for each person, reduce duplication of effort and share the information locally and globally. The authors of this paper aspire to disambiguate all person names efficiently and fully in all their variations across the entirety of the biological sciences, starting with collections. Towards that vision, this paper has three key aims: to improve the linking, validation, enhancement and valorisation of person-related information within and between collections, databases and publications; to suggest good practice for identifying people involved in biological collections; and to promote coordination amongst all stakeholders, including individuals, natural history collections, institutions, learned societies, government agencies and data aggregators. Pensoft Publishers 2022-10-10 /pmc/articles/PMC9836581/ /pubmed/36761559 http://dx.doi.org/10.3897/BDJ.10.e86089 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article distributed under the terms of the CC0 Public Domain Dedication.
spellingShingle Methods
Groom, Quentin
Bräuchler, Christian
Cubey, Robert W. N.
Dillen, Mathias
Huybrechts, Pieter
Kearney, Nicole
Klazenga, Niels
Leachman, Siobhan
Paul, Deborah L
Rogers, Heather
Santos, Joaquim
Shorthouse, David Peter
Vaughan, Alison
von Mering, Sabine
Haston, Elspeth M
The disambiguation of people names in biological collections
title The disambiguation of people names in biological collections
title_full The disambiguation of people names in biological collections
title_fullStr The disambiguation of people names in biological collections
title_full_unstemmed The disambiguation of people names in biological collections
title_short The disambiguation of people names in biological collections
title_sort disambiguation of people names in biological collections
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9836581/
https://www.ncbi.nlm.nih.gov/pubmed/36761559
http://dx.doi.org/10.3897/BDJ.10.e86089
work_keys_str_mv AT groomquentin thedisambiguationofpeoplenamesinbiologicalcollections
AT brauchlerchristian thedisambiguationofpeoplenamesinbiologicalcollections
AT cubeyrobertwn thedisambiguationofpeoplenamesinbiologicalcollections
AT dillenmathias thedisambiguationofpeoplenamesinbiologicalcollections
AT huybrechtspieter thedisambiguationofpeoplenamesinbiologicalcollections
AT kearneynicole thedisambiguationofpeoplenamesinbiologicalcollections
AT klazenganiels thedisambiguationofpeoplenamesinbiologicalcollections
AT leachmansiobhan thedisambiguationofpeoplenamesinbiologicalcollections
AT pauldeborahl thedisambiguationofpeoplenamesinbiologicalcollections
AT rogersheather thedisambiguationofpeoplenamesinbiologicalcollections
AT santosjoaquim thedisambiguationofpeoplenamesinbiologicalcollections
AT shorthousedavidpeter thedisambiguationofpeoplenamesinbiologicalcollections
AT vaughanalison thedisambiguationofpeoplenamesinbiologicalcollections
AT vonmeringsabine thedisambiguationofpeoplenamesinbiologicalcollections
AT hastonelspethm thedisambiguationofpeoplenamesinbiologicalcollections
AT groomquentin disambiguationofpeoplenamesinbiologicalcollections
AT brauchlerchristian disambiguationofpeoplenamesinbiologicalcollections
AT cubeyrobertwn disambiguationofpeoplenamesinbiologicalcollections
AT dillenmathias disambiguationofpeoplenamesinbiologicalcollections
AT huybrechtspieter disambiguationofpeoplenamesinbiologicalcollections
AT kearneynicole disambiguationofpeoplenamesinbiologicalcollections
AT klazenganiels disambiguationofpeoplenamesinbiologicalcollections
AT leachmansiobhan disambiguationofpeoplenamesinbiologicalcollections
AT pauldeborahl disambiguationofpeoplenamesinbiologicalcollections
AT rogersheather disambiguationofpeoplenamesinbiologicalcollections
AT santosjoaquim disambiguationofpeoplenamesinbiologicalcollections
AT shorthousedavidpeter disambiguationofpeoplenamesinbiologicalcollections
AT vaughanalison disambiguationofpeoplenamesinbiologicalcollections
AT vonmeringsabine disambiguationofpeoplenamesinbiologicalcollections
AT hastonelspethm disambiguationofpeoplenamesinbiologicalcollections