Cargando…

The NIH Open Citation Collection: A public access, broad coverage resource

Citation data have remained hidden behind proprietary, restrictive licensing agreements, which raises barriers to entry for analysts wishing to use the data, increases the expense of performing large-scale analyses, and reduces the robustness and reproducibility of the conclusions. For the past seve...

Descripción completa

Detalles Bibliográficos
Autores principales: Hutchins, B. Ian, Baker, Kirk L., Davis, Matthew T., Diwersy, Mario A., Haque, Ehsanul, Harriman, Robert M., Hoppe, Travis A., Leicht, Stephen A., Meyer, Payam, Santangelo, George M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786512/
https://www.ncbi.nlm.nih.gov/pubmed/31600197
http://dx.doi.org/10.1371/journal.pbio.3000385
_version_ 1783458075003322368
author Hutchins, B. Ian
Baker, Kirk L.
Davis, Matthew T.
Diwersy, Mario A.
Haque, Ehsanul
Harriman, Robert M.
Hoppe, Travis A.
Leicht, Stephen A.
Meyer, Payam
Santangelo, George M.
author_facet Hutchins, B. Ian
Baker, Kirk L.
Davis, Matthew T.
Diwersy, Mario A.
Haque, Ehsanul
Harriman, Robert M.
Hoppe, Travis A.
Leicht, Stephen A.
Meyer, Payam
Santangelo, George M.
author_sort Hutchins, B. Ian
collection PubMed
description Citation data have remained hidden behind proprietary, restrictive licensing agreements, which raises barriers to entry for analysts wishing to use the data, increases the expense of performing large-scale analyses, and reduces the robustness and reproducibility of the conclusions. For the past several years, the National Institutes of Health (NIH) Office of Portfolio Analysis (OPA) has been aggregating and enhancing citation data that can be shared publicly. Here, we describe the NIH Open Citation Collection (NIH-OCC), a public access database for biomedical research that is made freely available to the community. This dataset, which has been carefully generated from unrestricted data sources such as MedLine, PubMed Central (PMC), and CrossRef, now underlies the citation statistics delivered in the NIH iCite analytic platform. We have also included data from a machine learning pipeline that identifies, extracts, resolves, and disambiguates references from full-text articles available on the internet. Open citation links are available to the public in a major update of iCite (https://icite.od.nih.gov).
format Online
Article
Text
id pubmed-6786512
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-67865122019-10-20 The NIH Open Citation Collection: A public access, broad coverage resource Hutchins, B. Ian Baker, Kirk L. Davis, Matthew T. Diwersy, Mario A. Haque, Ehsanul Harriman, Robert M. Hoppe, Travis A. Leicht, Stephen A. Meyer, Payam Santangelo, George M. PLoS Biol Community Page Citation data have remained hidden behind proprietary, restrictive licensing agreements, which raises barriers to entry for analysts wishing to use the data, increases the expense of performing large-scale analyses, and reduces the robustness and reproducibility of the conclusions. For the past several years, the National Institutes of Health (NIH) Office of Portfolio Analysis (OPA) has been aggregating and enhancing citation data that can be shared publicly. Here, we describe the NIH Open Citation Collection (NIH-OCC), a public access database for biomedical research that is made freely available to the community. This dataset, which has been carefully generated from unrestricted data sources such as MedLine, PubMed Central (PMC), and CrossRef, now underlies the citation statistics delivered in the NIH iCite analytic platform. We have also included data from a machine learning pipeline that identifies, extracts, resolves, and disambiguates references from full-text articles available on the internet. Open citation links are available to the public in a major update of iCite (https://icite.od.nih.gov). Public Library of Science 2019-10-10 /pmc/articles/PMC6786512/ /pubmed/31600197 http://dx.doi.org/10.1371/journal.pbio.3000385 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Community Page
Hutchins, B. Ian
Baker, Kirk L.
Davis, Matthew T.
Diwersy, Mario A.
Haque, Ehsanul
Harriman, Robert M.
Hoppe, Travis A.
Leicht, Stephen A.
Meyer, Payam
Santangelo, George M.
The NIH Open Citation Collection: A public access, broad coverage resource
title The NIH Open Citation Collection: A public access, broad coverage resource
title_full The NIH Open Citation Collection: A public access, broad coverage resource
title_fullStr The NIH Open Citation Collection: A public access, broad coverage resource
title_full_unstemmed The NIH Open Citation Collection: A public access, broad coverage resource
title_short The NIH Open Citation Collection: A public access, broad coverage resource
title_sort nih open citation collection: a public access, broad coverage resource
topic Community Page
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6786512/
https://www.ncbi.nlm.nih.gov/pubmed/31600197
http://dx.doi.org/10.1371/journal.pbio.3000385
work_keys_str_mv AT hutchinsbian thenihopencitationcollectionapublicaccessbroadcoverageresource
AT bakerkirkl thenihopencitationcollectionapublicaccessbroadcoverageresource
AT davismatthewt thenihopencitationcollectionapublicaccessbroadcoverageresource
AT diwersymarioa thenihopencitationcollectionapublicaccessbroadcoverageresource
AT haqueehsanul thenihopencitationcollectionapublicaccessbroadcoverageresource
AT harrimanrobertm thenihopencitationcollectionapublicaccessbroadcoverageresource
AT hoppetravisa thenihopencitationcollectionapublicaccessbroadcoverageresource
AT leichtstephena thenihopencitationcollectionapublicaccessbroadcoverageresource
AT meyerpayam thenihopencitationcollectionapublicaccessbroadcoverageresource
AT santangelogeorgem thenihopencitationcollectionapublicaccessbroadcoverageresource
AT hutchinsbian nihopencitationcollectionapublicaccessbroadcoverageresource
AT bakerkirkl nihopencitationcollectionapublicaccessbroadcoverageresource
AT davismatthewt nihopencitationcollectionapublicaccessbroadcoverageresource
AT diwersymarioa nihopencitationcollectionapublicaccessbroadcoverageresource
AT haqueehsanul nihopencitationcollectionapublicaccessbroadcoverageresource
AT harrimanrobertm nihopencitationcollectionapublicaccessbroadcoverageresource
AT hoppetravisa nihopencitationcollectionapublicaccessbroadcoverageresource
AT leichtstephena nihopencitationcollectionapublicaccessbroadcoverageresource
AT meyerpayam nihopencitationcollectionapublicaccessbroadcoverageresource
AT santangelogeorgem nihopencitationcollectionapublicaccessbroadcoverageresource