Cargando…

Predicting substantive biomedical citations without full text

Insights from biomedical citation networks can be used to identify promising avenues for accelerating research and its downstream bench-to-bedside translation. Citation analysis generally assumes that each citation documents substantive knowledge transfer that informed the conception, design, or exe...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoppe, Travis A., Arabi, Salsabil, Hutchins, B. Ian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10372685/
https://www.ncbi.nlm.nih.gov/pubmed/37463199
http://dx.doi.org/10.1073/pnas.2213697120
_version_ 1785078418990170112
author Hoppe, Travis A.
Arabi, Salsabil
Hutchins, B. Ian
author_facet Hoppe, Travis A.
Arabi, Salsabil
Hutchins, B. Ian
author_sort Hoppe, Travis A.
collection PubMed
description Insights from biomedical citation networks can be used to identify promising avenues for accelerating research and its downstream bench-to-bedside translation. Citation analysis generally assumes that each citation documents substantive knowledge transfer that informed the conception, design, or execution of the main experiments. Citations may exist for other reasons. In this paper, we take advantage of late-stage citations added during peer review because these are less likely to represent substantive knowledge flow. Using a large, comprehensive feature set of open access data, we train a predictive model to identify late-stage citations. The model relies only on the title, abstract, and citations to previous articles but not the full-text or future citations patterns, making it suitable for publications as soon as they are released, or those behind a paywall (the vast majority). We find that high prediction scores identify late-stage citations that were likely added during the peer review process as well as those more likely to be rhetorical, such as journal self-citations added during review. Our model conversely gives low prediction scores to early-stage citations and citation classes that are known to represent substantive knowledge transfer. Using this model, we find that US federally funded biomedical research publications represent 30% of the predicted early-stage (and more likely to be substantive) knowledge transfer from basic studies to clinical research, even though these comprise only 10% of the literature. This is a threefold overrepresentation in this important type of knowledge flow.
format Online
Article
Text
id pubmed-10372685
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-103726852023-07-28 Predicting substantive biomedical citations without full text Hoppe, Travis A. Arabi, Salsabil Hutchins, B. Ian Proc Natl Acad Sci U S A Social Sciences Insights from biomedical citation networks can be used to identify promising avenues for accelerating research and its downstream bench-to-bedside translation. Citation analysis generally assumes that each citation documents substantive knowledge transfer that informed the conception, design, or execution of the main experiments. Citations may exist for other reasons. In this paper, we take advantage of late-stage citations added during peer review because these are less likely to represent substantive knowledge flow. Using a large, comprehensive feature set of open access data, we train a predictive model to identify late-stage citations. The model relies only on the title, abstract, and citations to previous articles but not the full-text or future citations patterns, making it suitable for publications as soon as they are released, or those behind a paywall (the vast majority). We find that high prediction scores identify late-stage citations that were likely added during the peer review process as well as those more likely to be rhetorical, such as journal self-citations added during review. Our model conversely gives low prediction scores to early-stage citations and citation classes that are known to represent substantive knowledge transfer. Using this model, we find that US federally funded biomedical research publications represent 30% of the predicted early-stage (and more likely to be substantive) knowledge transfer from basic studies to clinical research, even though these comprise only 10% of the literature. This is a threefold overrepresentation in this important type of knowledge flow. National Academy of Sciences 2023-07-18 2023-07-25 /pmc/articles/PMC10372685/ /pubmed/37463199 http://dx.doi.org/10.1073/pnas.2213697120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Social Sciences
Hoppe, Travis A.
Arabi, Salsabil
Hutchins, B. Ian
Predicting substantive biomedical citations without full text
title Predicting substantive biomedical citations without full text
title_full Predicting substantive biomedical citations without full text
title_fullStr Predicting substantive biomedical citations without full text
title_full_unstemmed Predicting substantive biomedical citations without full text
title_short Predicting substantive biomedical citations without full text
title_sort predicting substantive biomedical citations without full text
topic Social Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10372685/
https://www.ncbi.nlm.nih.gov/pubmed/37463199
http://dx.doi.org/10.1073/pnas.2213697120
work_keys_str_mv AT hoppetravisa predictingsubstantivebiomedicalcitationswithoutfulltext
AT arabisalsabil predictingsubstantivebiomedicalcitationswithoutfulltext
AT hutchinsbian predictingsubstantivebiomedicalcitationswithoutfulltext