Cargando…

Author disambiguation using multi-aspect similarity indicators

Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues o...

Descripción completa

Detalles Bibliográficos
Autores principales: Gurney, Thomas, Horlings, Edwin, van den Besselaar, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3319899/
https://www.ncbi.nlm.nih.gov/pubmed/22485059
http://dx.doi.org/10.1007/s11192-011-0589-1
_version_ 1782228760463933440
author Gurney, Thomas
Horlings, Edwin
van den Besselaar, Peter
author_facet Gurney, Thomas
Horlings, Edwin
van den Besselaar, Peter
author_sort Gurney, Thomas
collection PubMed
description Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues of discarded records due to null data fields and their resultant effect on recall, precision and F-measure results. We have implemented a dynamic approach to similarity calculations based on all available data fields. We have also included differences in author contribution and age difference between publications, both of which have meaningful effects on overall similarity measurements, resulting in significantly higher recall and precision of returned records. The results are presented from a test dataset of heterogeneous catalysis publications. Results demonstrate significantly high average F-measure scores and substantial improvements on previous and stand-alone techniques.
format Online
Article
Text
id pubmed-3319899
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-33198992012-04-05 Author disambiguation using multi-aspect similarity indicators Gurney, Thomas Horlings, Edwin van den Besselaar, Peter Scientometrics Article Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues of discarded records due to null data fields and their resultant effect on recall, precision and F-measure results. We have implemented a dynamic approach to similarity calculations based on all available data fields. We have also included differences in author contribution and age difference between publications, both of which have meaningful effects on overall similarity measurements, resulting in significantly higher recall and precision of returned records. The results are presented from a test dataset of heterogeneous catalysis publications. Results demonstrate significantly high average F-measure scores and substantial improvements on previous and stand-alone techniques. Springer Netherlands 2011-12-30 2012 /pmc/articles/PMC3319899/ /pubmed/22485059 http://dx.doi.org/10.1007/s11192-011-0589-1 Text en © The Author(s) 2011 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Article
Gurney, Thomas
Horlings, Edwin
van den Besselaar, Peter
Author disambiguation using multi-aspect similarity indicators
title Author disambiguation using multi-aspect similarity indicators
title_full Author disambiguation using multi-aspect similarity indicators
title_fullStr Author disambiguation using multi-aspect similarity indicators
title_full_unstemmed Author disambiguation using multi-aspect similarity indicators
title_short Author disambiguation using multi-aspect similarity indicators
title_sort author disambiguation using multi-aspect similarity indicators
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3319899/
https://www.ncbi.nlm.nih.gov/pubmed/22485059
http://dx.doi.org/10.1007/s11192-011-0589-1
work_keys_str_mv AT gurneythomas authordisambiguationusingmultiaspectsimilarityindicators
AT horlingsedwin authordisambiguationusingmultiaspectsimilarityindicators
AT vandenbesselaarpeter authordisambiguationusingmultiaspectsimilarityindicators