Cargando…
Author disambiguation using multi-aspect similarity indicators
Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues o...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3319899/ https://www.ncbi.nlm.nih.gov/pubmed/22485059 http://dx.doi.org/10.1007/s11192-011-0589-1 |
_version_ | 1782228760463933440 |
---|---|
author | Gurney, Thomas Horlings, Edwin van den Besselaar, Peter |
author_facet | Gurney, Thomas Horlings, Edwin van den Besselaar, Peter |
author_sort | Gurney, Thomas |
collection | PubMed |
description | Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues of discarded records due to null data fields and their resultant effect on recall, precision and F-measure results. We have implemented a dynamic approach to similarity calculations based on all available data fields. We have also included differences in author contribution and age difference between publications, both of which have meaningful effects on overall similarity measurements, resulting in significantly higher recall and precision of returned records. The results are presented from a test dataset of heterogeneous catalysis publications. Results demonstrate significantly high average F-measure scores and substantial improvements on previous and stand-alone techniques. |
format | Online Article Text |
id | pubmed-3319899 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-33198992012-04-05 Author disambiguation using multi-aspect similarity indicators Gurney, Thomas Horlings, Edwin van den Besselaar, Peter Scientometrics Article Key to accurate bibliometric analyses is the ability to correctly link individuals to their corpus of work, with an optimal balance between precision and recall. We have developed an algorithm that does this disambiguation task with a very high recall and precision. The method addresses the issues of discarded records due to null data fields and their resultant effect on recall, precision and F-measure results. We have implemented a dynamic approach to similarity calculations based on all available data fields. We have also included differences in author contribution and age difference between publications, both of which have meaningful effects on overall similarity measurements, resulting in significantly higher recall and precision of returned records. The results are presented from a test dataset of heterogeneous catalysis publications. Results demonstrate significantly high average F-measure scores and substantial improvements on previous and stand-alone techniques. Springer Netherlands 2011-12-30 2012 /pmc/articles/PMC3319899/ /pubmed/22485059 http://dx.doi.org/10.1007/s11192-011-0589-1 Text en © The Author(s) 2011 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited. |
spellingShingle | Article Gurney, Thomas Horlings, Edwin van den Besselaar, Peter Author disambiguation using multi-aspect similarity indicators |
title | Author disambiguation using multi-aspect similarity indicators |
title_full | Author disambiguation using multi-aspect similarity indicators |
title_fullStr | Author disambiguation using multi-aspect similarity indicators |
title_full_unstemmed | Author disambiguation using multi-aspect similarity indicators |
title_short | Author disambiguation using multi-aspect similarity indicators |
title_sort | author disambiguation using multi-aspect similarity indicators |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3319899/ https://www.ncbi.nlm.nih.gov/pubmed/22485059 http://dx.doi.org/10.1007/s11192-011-0589-1 |
work_keys_str_mv | AT gurneythomas authordisambiguationusingmultiaspectsimilarityindicators AT horlingsedwin authordisambiguationusingmultiaspectsimilarityindicators AT vandenbesselaarpeter authordisambiguationusingmultiaspectsimilarityindicators |