Cargando…

Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries

A collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and hal...

Descripción completa

Detalles Bibliográficos
Autores principales: Weiler, Henning, Meyer-Wegener, Klaus, Mele, Salvatore
Lenguaje:eng
Publicado: 2011
Materias:
Acceso en línea:https://dx.doi.org/10.1145/2063576.2063949
http://cds.cern.ch/record/1698253
_version_ 1780936208284647424
author Weiler, Henning
Meyer-Wegener, Klaus
Mele, Salvatore
author_facet Weiler, Henning
Meyer-Wegener, Klaus
Mele, Salvatore
author_sort Weiler, Henning
collection CERN
description A collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and half a million full-text documents, and offers a unique opportunity for author disambiguation strategies. The presented approach features extended metadata comparison metrics and a three-step unsupervised graph clustering technique. The algorithm aided in identifying 200'000 individuals from 6'500'000 author signatures. Preliminary tests based on knowledge of external experts and a pilot of a crowd-sourcing system show a success rate of more than 96% within the selected test cases. The obtained author clusters serve as a recommendation for INSPIRE users to further clean the publication list in a crowd-sourced approach.
id cern-1698253
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2011
record_format invenio
spelling cern-16982532019-09-30T06:29:59Zdoi:10.1145/2063576.2063949http://cds.cern.ch/record/1698253engWeiler, HenningMeyer-Wegener, KlausMele, SalvatoreAuthormagic – An Approach to Author Disambiguation in Large-Scale Digital LibrariesInformation Transfer and ManagementA collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and half a million full-text documents, and offers a unique opportunity for author disambiguation strategies. The presented approach features extended metadata comparison metrics and a three-step unsupervised graph clustering technique. The algorithm aided in identifying 200'000 individuals from 6'500'000 author signatures. Preliminary tests based on knowledge of external experts and a pilot of a crowd-sourcing system show a success rate of more than 96% within the selected test cases. The obtained author clusters serve as a recommendation for INSPIRE users to further clean the publication list in a crowd-sourced approach.oai:cds.cern.ch:16982532011
spellingShingle Information Transfer and Management
Weiler, Henning
Meyer-Wegener, Klaus
Mele, Salvatore
Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
title Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
title_full Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
title_fullStr Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
title_full_unstemmed Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
title_short Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
title_sort authormagic – an approach to author disambiguation in large-scale digital libraries
topic Information Transfer and Management
url https://dx.doi.org/10.1145/2063576.2063949
http://cds.cern.ch/record/1698253
work_keys_str_mv AT weilerhenning authormagicanapproachtoauthordisambiguationinlargescaledigitallibraries
AT meyerwegenerklaus authormagicanapproachtoauthordisambiguationinlargescaledigitallibraries
AT melesalvatore authormagicanapproachtoauthordisambiguationinlargescaledigitallibraries