Cargando…

Long-term availability of data associated with articles in PLOS ONE

The adoption of journal policies requiring authors to include a Data Availability Statement has helped to increase the availability of research data associated with research articles. However, having a Data Availability Statement is not a guarantee that readers will be able to locate the data; even...

Descripción completa

Detalles Bibliográficos
Autor principal: Federer, Lisa M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9401135/
https://www.ncbi.nlm.nih.gov/pubmed/36001577
http://dx.doi.org/10.1371/journal.pone.0272845
_version_ 1784772903001128960
author Federer, Lisa M.
author_facet Federer, Lisa M.
author_sort Federer, Lisa M.
collection PubMed
description The adoption of journal policies requiring authors to include a Data Availability Statement has helped to increase the availability of research data associated with research articles. However, having a Data Availability Statement is not a guarantee that readers will be able to locate the data; even if provided with an identifier like a uniform resource locator (URL) or a digital object identifier (DOI), the data may become unavailable due to link rot and content drift. To explore the long-term availability of resources including data, code, and other digital research objects associated with papers, this study extracted 8,503 URLs and DOIs from a corpus of nearly 50,000 Data Availability Statements from papers published in PLOS ONE between 2014 and 2016. These URLs and DOIs were used to attempt to retrieve the data through both automated and manual means. Overall, 80% of the resources could be retrieved automatically, compared to much lower retrieval rates of 10–40% found in previous papers that relied on contacting authors to locate data. Because a URL or DOI might be valid but still not point to the resource, a subset of 350 URLs and 350 DOIs were manually tested, with 78% and 98% of resources, respectively, successfully retrieved. Having a DOI and being shared in a repository were both positively associated with availability. Although resources associated with older papers were slightly less likely to be available, this difference was not statistically significant, suggesting that URLs and DOIs may be an effective means for accessing data over time. These findings point to the value of including URLs and DOIs in Data Availability Statements to ensure access to data on a long-term basis.
format Online
Article
Text
id pubmed-9401135
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-94011352022-08-25 Long-term availability of data associated with articles in PLOS ONE Federer, Lisa M. PLoS One Research Article The adoption of journal policies requiring authors to include a Data Availability Statement has helped to increase the availability of research data associated with research articles. However, having a Data Availability Statement is not a guarantee that readers will be able to locate the data; even if provided with an identifier like a uniform resource locator (URL) or a digital object identifier (DOI), the data may become unavailable due to link rot and content drift. To explore the long-term availability of resources including data, code, and other digital research objects associated with papers, this study extracted 8,503 URLs and DOIs from a corpus of nearly 50,000 Data Availability Statements from papers published in PLOS ONE between 2014 and 2016. These URLs and DOIs were used to attempt to retrieve the data through both automated and manual means. Overall, 80% of the resources could be retrieved automatically, compared to much lower retrieval rates of 10–40% found in previous papers that relied on contacting authors to locate data. Because a URL or DOI might be valid but still not point to the resource, a subset of 350 URLs and 350 DOIs were manually tested, with 78% and 98% of resources, respectively, successfully retrieved. Having a DOI and being shared in a repository were both positively associated with availability. Although resources associated with older papers were slightly less likely to be available, this difference was not statistically significant, suggesting that URLs and DOIs may be an effective means for accessing data over time. These findings point to the value of including URLs and DOIs in Data Availability Statements to ensure access to data on a long-term basis. Public Library of Science 2022-08-24 /pmc/articles/PMC9401135/ /pubmed/36001577 http://dx.doi.org/10.1371/journal.pone.0272845 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Federer, Lisa M.
Long-term availability of data associated with articles in PLOS ONE
title Long-term availability of data associated with articles in PLOS ONE
title_full Long-term availability of data associated with articles in PLOS ONE
title_fullStr Long-term availability of data associated with articles in PLOS ONE
title_full_unstemmed Long-term availability of data associated with articles in PLOS ONE
title_short Long-term availability of data associated with articles in PLOS ONE
title_sort long-term availability of data associated with articles in plos one
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9401135/
https://www.ncbi.nlm.nih.gov/pubmed/36001577
http://dx.doi.org/10.1371/journal.pone.0272845
work_keys_str_mv AT federerlisam longtermavailabilityofdataassociatedwitharticlesinplosone