Cargando…

Man versus machine? Self-reports versus algorithmic measurement of publications

This paper uses newly available data from Web of Science on publications matched to researchers in Survey of Doctorate Recipients to compare the quality of scientific publication data collected by surveys versus algorithmic approaches. We illustrate the different types of measurement errors in self-...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Xuan, Chang, Wan-Ying, Weinberg, Bruce A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8480886/
https://www.ncbi.nlm.nih.gov/pubmed/34587169
http://dx.doi.org/10.1371/journal.pone.0257309
_version_ 1784576563101040640
author Jiang, Xuan
Chang, Wan-Ying
Weinberg, Bruce A.
author_facet Jiang, Xuan
Chang, Wan-Ying
Weinberg, Bruce A.
author_sort Jiang, Xuan
collection PubMed
description This paper uses newly available data from Web of Science on publications matched to researchers in Survey of Doctorate Recipients to compare the quality of scientific publication data collected by surveys versus algorithmic approaches. We illustrate the different types of measurement errors in self-reported and machine-generated data by estimating how publication measures from the two approaches are related to career outcomes (e.g., salaries and faculty rankings). We find that the potential biases in the self-reports are smaller relative to the algorithmic data. Moreover, the errors in the two approaches are quite intuitive: the measurement errors in algorithmic data are mainly due to the accuracy of matching, which primarily depends on the frequency of names and the data that was available to make matches, while the noise in self reports increases over the career as researchers’ publication records become more complex, harder to recall, and less immediately relevant for career progress. At a methodological level, we show how the approaches can be evaluated using accepted statistical methods without gold standard data. We also provide guidance on how to use the new linked data.
format Online
Article
Text
id pubmed-8480886
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-84808862021-09-30 Man versus machine? Self-reports versus algorithmic measurement of publications Jiang, Xuan Chang, Wan-Ying Weinberg, Bruce A. PLoS One Research Article This paper uses newly available data from Web of Science on publications matched to researchers in Survey of Doctorate Recipients to compare the quality of scientific publication data collected by surveys versus algorithmic approaches. We illustrate the different types of measurement errors in self-reported and machine-generated data by estimating how publication measures from the two approaches are related to career outcomes (e.g., salaries and faculty rankings). We find that the potential biases in the self-reports are smaller relative to the algorithmic data. Moreover, the errors in the two approaches are quite intuitive: the measurement errors in algorithmic data are mainly due to the accuracy of matching, which primarily depends on the frequency of names and the data that was available to make matches, while the noise in self reports increases over the career as researchers’ publication records become more complex, harder to recall, and less immediately relevant for career progress. At a methodological level, we show how the approaches can be evaluated using accepted statistical methods without gold standard data. We also provide guidance on how to use the new linked data. Public Library of Science 2021-09-29 /pmc/articles/PMC8480886/ /pubmed/34587169 http://dx.doi.org/10.1371/journal.pone.0257309 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Jiang, Xuan
Chang, Wan-Ying
Weinberg, Bruce A.
Man versus machine? Self-reports versus algorithmic measurement of publications
title Man versus machine? Self-reports versus algorithmic measurement of publications
title_full Man versus machine? Self-reports versus algorithmic measurement of publications
title_fullStr Man versus machine? Self-reports versus algorithmic measurement of publications
title_full_unstemmed Man versus machine? Self-reports versus algorithmic measurement of publications
title_short Man versus machine? Self-reports versus algorithmic measurement of publications
title_sort man versus machine? self-reports versus algorithmic measurement of publications
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8480886/
https://www.ncbi.nlm.nih.gov/pubmed/34587169
http://dx.doi.org/10.1371/journal.pone.0257309
work_keys_str_mv AT jiangxuan manversusmachineselfreportsversusalgorithmicmeasurementofpublications
AT changwanying manversusmachineselfreportsversusalgorithmicmeasurementofpublications
AT weinbergbrucea manversusmachineselfreportsversusalgorithmicmeasurementofpublications