Cargando…
Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries
A collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and hal...
Autores principales: | , , |
---|---|
Lenguaje: | eng |
Publicado: |
2011
|
Materias: | |
Acceso en línea: | https://dx.doi.org/10.1145/2063576.2063949 http://cds.cern.ch/record/1698253 |
_version_ | 1780936208284647424 |
---|---|
author | Weiler, Henning Meyer-Wegener, Klaus Mele, Salvatore |
author_facet | Weiler, Henning Meyer-Wegener, Klaus Mele, Salvatore |
author_sort | Weiler, Henning |
collection | CERN |
description | A collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and half a million full-text documents, and offers a unique opportunity for author disambiguation strategies. The presented approach features extended metadata comparison metrics and a three-step unsupervised graph clustering technique. The algorithm aided in identifying 200'000 individuals from 6'500'000 author signatures. Preliminary tests based on knowledge of external experts and a pilot of a crowd-sourcing system show a success rate of more than 96% within the selected test cases. The obtained author clusters serve as a recommendation for INSPIRE users to further clean the publication list in a crowd-sourced approach. |
id | cern-1698253 |
institution | Organización Europea para la Investigación Nuclear |
language | eng |
publishDate | 2011 |
record_format | invenio |
spelling | cern-16982532019-09-30T06:29:59Zdoi:10.1145/2063576.2063949http://cds.cern.ch/record/1698253engWeiler, HenningMeyer-Wegener, KlausMele, SalvatoreAuthormagic – An Approach to Author Disambiguation in Large-Scale Digital LibrariesInformation Transfer and ManagementA collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and half a million full-text documents, and offers a unique opportunity for author disambiguation strategies. The presented approach features extended metadata comparison metrics and a three-step unsupervised graph clustering technique. The algorithm aided in identifying 200'000 individuals from 6'500'000 author signatures. Preliminary tests based on knowledge of external experts and a pilot of a crowd-sourcing system show a success rate of more than 96% within the selected test cases. The obtained author clusters serve as a recommendation for INSPIRE users to further clean the publication list in a crowd-sourced approach.oai:cds.cern.ch:16982532011 |
spellingShingle | Information Transfer and Management Weiler, Henning Meyer-Wegener, Klaus Mele, Salvatore Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries |
title | Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries |
title_full | Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries |
title_fullStr | Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries |
title_full_unstemmed | Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries |
title_short | Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries |
title_sort | authormagic – an approach to author disambiguation in large-scale digital libraries |
topic | Information Transfer and Management |
url | https://dx.doi.org/10.1145/2063576.2063949 http://cds.cern.ch/record/1698253 |
work_keys_str_mv | AT weilerhenning authormagicanapproachtoauthordisambiguationinlargescaledigitallibraries AT meyerwegenerklaus authormagicanapproachtoauthordisambiguationinlargescaledigitallibraries AT melesalvatore authormagicanapproachtoauthordisambiguationinlargescaledigitallibraries |