Cargando…

Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation

BACKGROUND: The current COVID-19 crisis underscores the importance of preprints, as they allow for rapid communication of research results without delay in review. To fully integrate this type of publication into library information systems, we developed preview: a publicly available, central search...

Descripción completa

Detalles Bibliográficos
Autores principales: Langnickel, Lisa, Podorskaja, Daria, Fluck, Juliane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8998365/
https://www.ncbi.nlm.nih.gov/pubmed/35285808
http://dx.doi.org/10.2196/34072
_version_ 1784684925769744384
author Langnickel, Lisa
Podorskaja, Daria
Fluck, Juliane
author_facet Langnickel, Lisa
Podorskaja, Daria
Fluck, Juliane
author_sort Langnickel, Lisa
collection PubMed
description BACKGROUND: The current COVID-19 crisis underscores the importance of preprints, as they allow for rapid communication of research results without delay in review. To fully integrate this type of publication into library information systems, we developed preview: a publicly available, central search engine for COVID-19–related preprints, which clearly distinguishes this source from peer-reviewed publications. The relationship between the preprint version and its corresponding journal version should be stored as metadata in both versions so that duplicates can be easily identified and information overload for researchers is reduced. OBJECTIVE: In this work, we investigated the extent to which the relationship information between preprint and corresponding journal publication is present in the published metadata, how it can be further completed, and how it can be used in preVIEW to identify already republished preprints and filter those duplicates in search results. METHODS: We first analyzed the information content available at the preprint servers themselves and the information that can be retrieved via Crossref. Moreover, we developed the algorithm Pre2Pub to find the corresponding reviewed article for each preprint. We integrated the results of those different resources into our search engine preVIEW, presented the information in the result set overview, and added filter options accordingly. RESULTS: Preprints have found their place in publication workflows; however, the link from a preprint to its corresponding journal publication is not completely covered in the metadata of the preprint servers or in Crossref. Our algorithm Pre2Pub is able to find approximately 16% more related journal articles with a precision of 99.27%. We also integrate this information in a transparent way within preVIEW so that researchers can use it in their search. CONCLUSIONS: Relationships between the preprint version and its journal version is valuable information that can help researchers finding only previously unknown information in preprints. As long as there is no transparent and complete way to store this relationship in metadata, the Pre2Pub algorithm is a suitable extension to retrieve this information.
format Online
Article
Text
id pubmed-8998365
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-89983652022-04-12 Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation Langnickel, Lisa Podorskaja, Daria Fluck, Juliane J Med Internet Res Original Paper BACKGROUND: The current COVID-19 crisis underscores the importance of preprints, as they allow for rapid communication of research results without delay in review. To fully integrate this type of publication into library information systems, we developed preview: a publicly available, central search engine for COVID-19–related preprints, which clearly distinguishes this source from peer-reviewed publications. The relationship between the preprint version and its corresponding journal version should be stored as metadata in both versions so that duplicates can be easily identified and information overload for researchers is reduced. OBJECTIVE: In this work, we investigated the extent to which the relationship information between preprint and corresponding journal publication is present in the published metadata, how it can be further completed, and how it can be used in preVIEW to identify already republished preprints and filter those duplicates in search results. METHODS: We first analyzed the information content available at the preprint servers themselves and the information that can be retrieved via Crossref. Moreover, we developed the algorithm Pre2Pub to find the corresponding reviewed article for each preprint. We integrated the results of those different resources into our search engine preVIEW, presented the information in the result set overview, and added filter options accordingly. RESULTS: Preprints have found their place in publication workflows; however, the link from a preprint to its corresponding journal publication is not completely covered in the metadata of the preprint servers or in Crossref. Our algorithm Pre2Pub is able to find approximately 16% more related journal articles with a precision of 99.27%. We also integrate this information in a transparent way within preVIEW so that researchers can use it in their search. CONCLUSIONS: Relationships between the preprint version and its journal version is valuable information that can help researchers finding only previously unknown information in preprints. As long as there is no transparent and complete way to store this relationship in metadata, the Pre2Pub algorithm is a suitable extension to retrieve this information. JMIR Publications 2022-04-08 /pmc/articles/PMC8998365/ /pubmed/35285808 http://dx.doi.org/10.2196/34072 Text en ©Lisa Langnickel, Daria Podorskaja, Juliane Fluck. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 08.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Langnickel, Lisa
Podorskaja, Daria
Fluck, Juliane
Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
title Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
title_full Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
title_fullStr Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
title_full_unstemmed Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
title_short Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
title_sort pre2pub—tracking the path from preprint to journal article: algorithm development and validation
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8998365/
https://www.ncbi.nlm.nih.gov/pubmed/35285808
http://dx.doi.org/10.2196/34072
work_keys_str_mv AT langnickellisa pre2pubtrackingthepathfrompreprinttojournalarticlealgorithmdevelopmentandvalidation
AT podorskajadaria pre2pubtrackingthepathfrompreprinttojournalarticlealgorithmdevelopmentandvalidation
AT fluckjuliane pre2pubtrackingthepathfrompreprinttojournalarticlealgorithmdevelopmentandvalidation