Cargando…

Use it or lose it: citations predict the continued online availability of published bioinformatics resources

Scientific Data Analysis Resources (SDARs) such as bioinformatics programs, web servers and databases are integral to modern science, but previous studies have shown that the Uniform Resource Locators (URLs) linking to them decay in a time-dependent manner, with ∼27% decayed to date. Because SDARs a...

Descripción completa

Detalles Bibliográficos
Autores principales: Wren, Jonathan D., Georgescu, Constantin, Giles, Cory B., Hennessey, Jason
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397159/
https://www.ncbi.nlm.nih.gov/pubmed/28334982
http://dx.doi.org/10.1093/nar/gkx182
_version_ 1783230213474222080
author Wren, Jonathan D.
Georgescu, Constantin
Giles, Cory B.
Hennessey, Jason
author_facet Wren, Jonathan D.
Georgescu, Constantin
Giles, Cory B.
Hennessey, Jason
author_sort Wren, Jonathan D.
collection PubMed
description Scientific Data Analysis Resources (SDARs) such as bioinformatics programs, web servers and databases are integral to modern science, but previous studies have shown that the Uniform Resource Locators (URLs) linking to them decay in a time-dependent manner, with ∼27% decayed to date. Because SDARs are overrepresented among science's most cited papers over the past 20 years, loss of widely used SDARs could be particularly disruptive to scientific research. We identified URLs in MEDLINE abstracts and used crowdsourcing to identify which reported the creation of SDARs. We used the Internet Archive's Wayback Machine to approximate ‘death dates’ and calculate citations/year over each SDAR's lifespan. At first glance, decayed SDARs did not significantly differ from available SDARs in their average citations per year over their lifespan or journal impact factor (JIF). But the most cited SDARs were 94% likely to be relocated to another URL versus only 34% of uncited ones. Taking relocation into account, we find that citations are the strongest predictors of current online availability after time since publication, and JIF modestly predictive. This suggests that URL decay is a general, persistent phenomenon affecting all URLs, but the most useful/recognized SDARs are more likely to persist.
format Online
Article
Text
id pubmed-5397159
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-53971592017-04-24 Use it or lose it: citations predict the continued online availability of published bioinformatics resources Wren, Jonathan D. Georgescu, Constantin Giles, Cory B. Hennessey, Jason Nucleic Acids Res Survey and Summary Scientific Data Analysis Resources (SDARs) such as bioinformatics programs, web servers and databases are integral to modern science, but previous studies have shown that the Uniform Resource Locators (URLs) linking to them decay in a time-dependent manner, with ∼27% decayed to date. Because SDARs are overrepresented among science's most cited papers over the past 20 years, loss of widely used SDARs could be particularly disruptive to scientific research. We identified URLs in MEDLINE abstracts and used crowdsourcing to identify which reported the creation of SDARs. We used the Internet Archive's Wayback Machine to approximate ‘death dates’ and calculate citations/year over each SDAR's lifespan. At first glance, decayed SDARs did not significantly differ from available SDARs in their average citations per year over their lifespan or journal impact factor (JIF). But the most cited SDARs were 94% likely to be relocated to another URL versus only 34% of uncited ones. Taking relocation into account, we find that citations are the strongest predictors of current online availability after time since publication, and JIF modestly predictive. This suggests that URL decay is a general, persistent phenomenon affecting all URLs, but the most useful/recognized SDARs are more likely to persist. Oxford University Press 2017-04-20 2017-03-15 /pmc/articles/PMC5397159/ /pubmed/28334982 http://dx.doi.org/10.1093/nar/gkx182 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Survey and Summary
Wren, Jonathan D.
Georgescu, Constantin
Giles, Cory B.
Hennessey, Jason
Use it or lose it: citations predict the continued online availability of published bioinformatics resources
title Use it or lose it: citations predict the continued online availability of published bioinformatics resources
title_full Use it or lose it: citations predict the continued online availability of published bioinformatics resources
title_fullStr Use it or lose it: citations predict the continued online availability of published bioinformatics resources
title_full_unstemmed Use it or lose it: citations predict the continued online availability of published bioinformatics resources
title_short Use it or lose it: citations predict the continued online availability of published bioinformatics resources
title_sort use it or lose it: citations predict the continued online availability of published bioinformatics resources
topic Survey and Summary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5397159/
https://www.ncbi.nlm.nih.gov/pubmed/28334982
http://dx.doi.org/10.1093/nar/gkx182
work_keys_str_mv AT wrenjonathand useitorloseitcitationspredictthecontinuedonlineavailabilityofpublishedbioinformaticsresources
AT georgescuconstantin useitorloseitcitationspredictthecontinuedonlineavailabilityofpublishedbioinformaticsresources
AT gilescoryb useitorloseitcitationspredictthecontinuedonlineavailabilityofpublishedbioinformaticsresources
AT hennesseyjason useitorloseitcitationspredictthecontinuedonlineavailabilityofpublishedbioinformaticsresources