Cargando…
Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation
BACKGROUND: The current COVID-19 crisis underscores the importance of preprints, as they allow for rapid communication of research results without delay in review. To fully integrate this type of publication into library information systems, we developed preview: a publicly available, central search...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8998365/ https://www.ncbi.nlm.nih.gov/pubmed/35285808 http://dx.doi.org/10.2196/34072 |
_version_ | 1784684925769744384 |
---|---|
author | Langnickel, Lisa Podorskaja, Daria Fluck, Juliane |
author_facet | Langnickel, Lisa Podorskaja, Daria Fluck, Juliane |
author_sort | Langnickel, Lisa |
collection | PubMed |
description | BACKGROUND: The current COVID-19 crisis underscores the importance of preprints, as they allow for rapid communication of research results without delay in review. To fully integrate this type of publication into library information systems, we developed preview: a publicly available, central search engine for COVID-19–related preprints, which clearly distinguishes this source from peer-reviewed publications. The relationship between the preprint version and its corresponding journal version should be stored as metadata in both versions so that duplicates can be easily identified and information overload for researchers is reduced. OBJECTIVE: In this work, we investigated the extent to which the relationship information between preprint and corresponding journal publication is present in the published metadata, how it can be further completed, and how it can be used in preVIEW to identify already republished preprints and filter those duplicates in search results. METHODS: We first analyzed the information content available at the preprint servers themselves and the information that can be retrieved via Crossref. Moreover, we developed the algorithm Pre2Pub to find the corresponding reviewed article for each preprint. We integrated the results of those different resources into our search engine preVIEW, presented the information in the result set overview, and added filter options accordingly. RESULTS: Preprints have found their place in publication workflows; however, the link from a preprint to its corresponding journal publication is not completely covered in the metadata of the preprint servers or in Crossref. Our algorithm Pre2Pub is able to find approximately 16% more related journal articles with a precision of 99.27%. We also integrate this information in a transparent way within preVIEW so that researchers can use it in their search. CONCLUSIONS: Relationships between the preprint version and its journal version is valuable information that can help researchers finding only previously unknown information in preprints. As long as there is no transparent and complete way to store this relationship in metadata, the Pre2Pub algorithm is a suitable extension to retrieve this information. |
format | Online Article Text |
id | pubmed-8998365 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-89983652022-04-12 Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation Langnickel, Lisa Podorskaja, Daria Fluck, Juliane J Med Internet Res Original Paper BACKGROUND: The current COVID-19 crisis underscores the importance of preprints, as they allow for rapid communication of research results without delay in review. To fully integrate this type of publication into library information systems, we developed preview: a publicly available, central search engine for COVID-19–related preprints, which clearly distinguishes this source from peer-reviewed publications. The relationship between the preprint version and its corresponding journal version should be stored as metadata in both versions so that duplicates can be easily identified and information overload for researchers is reduced. OBJECTIVE: In this work, we investigated the extent to which the relationship information between preprint and corresponding journal publication is present in the published metadata, how it can be further completed, and how it can be used in preVIEW to identify already republished preprints and filter those duplicates in search results. METHODS: We first analyzed the information content available at the preprint servers themselves and the information that can be retrieved via Crossref. Moreover, we developed the algorithm Pre2Pub to find the corresponding reviewed article for each preprint. We integrated the results of those different resources into our search engine preVIEW, presented the information in the result set overview, and added filter options accordingly. RESULTS: Preprints have found their place in publication workflows; however, the link from a preprint to its corresponding journal publication is not completely covered in the metadata of the preprint servers or in Crossref. Our algorithm Pre2Pub is able to find approximately 16% more related journal articles with a precision of 99.27%. We also integrate this information in a transparent way within preVIEW so that researchers can use it in their search. CONCLUSIONS: Relationships between the preprint version and its journal version is valuable information that can help researchers finding only previously unknown information in preprints. As long as there is no transparent and complete way to store this relationship in metadata, the Pre2Pub algorithm is a suitable extension to retrieve this information. JMIR Publications 2022-04-08 /pmc/articles/PMC8998365/ /pubmed/35285808 http://dx.doi.org/10.2196/34072 Text en ©Lisa Langnickel, Daria Podorskaja, Juliane Fluck. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 08.04.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Langnickel, Lisa Podorskaja, Daria Fluck, Juliane Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation |
title | Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation |
title_full | Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation |
title_fullStr | Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation |
title_full_unstemmed | Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation |
title_short | Pre2Pub—Tracking the Path From Preprint to Journal Article: Algorithm Development and Validation |
title_sort | pre2pub—tracking the path from preprint to journal article: algorithm development and validation |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8998365/ https://www.ncbi.nlm.nih.gov/pubmed/35285808 http://dx.doi.org/10.2196/34072 |
work_keys_str_mv | AT langnickellisa pre2pubtrackingthepathfrompreprinttojournalarticlealgorithmdevelopmentandvalidation AT podorskajadaria pre2pubtrackingthepathfrompreprinttojournalarticlealgorithmdevelopmentandvalidation AT fluckjuliane pre2pubtrackingthepathfrompreprinttojournalarticlealgorithmdevelopmentandvalidation |