Cargando…

A detailed open access model of the PubMed literature

Portfolio analysis is a fundamental practice of organizational leadership and is a necessary precursor of strategic planning. Successful application requires a highly detailed model of research options. We have constructed a model, the first of its kind, that accurately characterizes these options f...

Descripción completa

Detalles Bibliográficos
Autores principales: Boyack, Kevin W., Smith, Caleb, Klavans, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7680135/
https://www.ncbi.nlm.nih.gov/pubmed/33219227
http://dx.doi.org/10.1038/s41597-020-00749-y
_version_ 1783612404286881792
author Boyack, Kevin W.
Smith, Caleb
Klavans, Richard
author_facet Boyack, Kevin W.
Smith, Caleb
Klavans, Richard
author_sort Boyack, Kevin W.
collection PubMed
description Portfolio analysis is a fundamental practice of organizational leadership and is a necessary precursor of strategic planning. Successful application requires a highly detailed model of research options. We have constructed a model, the first of its kind, that accurately characterizes these options for the biomedical literature. The model comprises over 18 million PubMed documents from 1996–2019. Document relatedness was measured using a hybrid citation analysis + text similarity approach. The resulting 606.6 million document-to-document links were used to create 28,743 document clusters and an associated visual map. Clusters are characterized using metadata (e.g., phrases, MeSH) and over 20 indicators (e.g., funding, patent activity). The map and cluster-level data are embedded in Tableau to provide an interactive model enabling in-depth exploration of a research portfolio. Two example usage cases are provided, one to identify specific research opportunities related to coronavirus, and the second to identify research strengths of a large cohort of African American and Native American researchers at the University of Michigan Medical School.
format Online
Article
Text
id pubmed-7680135
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76801352020-11-24 A detailed open access model of the PubMed literature Boyack, Kevin W. Smith, Caleb Klavans, Richard Sci Data Data Descriptor Portfolio analysis is a fundamental practice of organizational leadership and is a necessary precursor of strategic planning. Successful application requires a highly detailed model of research options. We have constructed a model, the first of its kind, that accurately characterizes these options for the biomedical literature. The model comprises over 18 million PubMed documents from 1996–2019. Document relatedness was measured using a hybrid citation analysis + text similarity approach. The resulting 606.6 million document-to-document links were used to create 28,743 document clusters and an associated visual map. Clusters are characterized using metadata (e.g., phrases, MeSH) and over 20 indicators (e.g., funding, patent activity). The map and cluster-level data are embedded in Tableau to provide an interactive model enabling in-depth exploration of a research portfolio. Two example usage cases are provided, one to identify specific research opportunities related to coronavirus, and the second to identify research strengths of a large cohort of African American and Native American researchers at the University of Michigan Medical School. Nature Publishing Group UK 2020-11-20 /pmc/articles/PMC7680135/ /pubmed/33219227 http://dx.doi.org/10.1038/s41597-020-00749-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Boyack, Kevin W.
Smith, Caleb
Klavans, Richard
A detailed open access model of the PubMed literature
title A detailed open access model of the PubMed literature
title_full A detailed open access model of the PubMed literature
title_fullStr A detailed open access model of the PubMed literature
title_full_unstemmed A detailed open access model of the PubMed literature
title_short A detailed open access model of the PubMed literature
title_sort detailed open access model of the pubmed literature
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7680135/
https://www.ncbi.nlm.nih.gov/pubmed/33219227
http://dx.doi.org/10.1038/s41597-020-00749-y
work_keys_str_mv AT boyackkevinw adetailedopenaccessmodelofthepubmedliterature
AT smithcaleb adetailedopenaccessmodelofthepubmedliterature
AT klavansrichard adetailedopenaccessmodelofthepubmedliterature
AT boyackkevinw detailedopenaccessmodelofthepubmedliterature
AT smithcaleb detailedopenaccessmodelofthepubmedliterature
AT klavansrichard detailedopenaccessmodelofthepubmedliterature