Cargando…

Demystifying COVID-19 publications: institutions, journals, concepts, and topics

OBJECTIVE: We analyzed the COVID-19 Open Research Dataset (CORD-19) to understand leading research institutions, collaborations among institutions, major publication venues, key research concepts, and topics covered by pandemic-related research. METHODS: We conducted a descriptive analysis of author...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Haihua, Chen, Jiangping, Nguyen, Huyen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: University Library System, University of Pittsburgh 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8485960/
https://www.ncbi.nlm.nih.gov/pubmed/34629968
http://dx.doi.org/10.5195/jmla.2021.1141
_version_ 1784577639505199104
author Chen, Haihua
Chen, Jiangping
Nguyen, Huyen
author_facet Chen, Haihua
Chen, Jiangping
Nguyen, Huyen
author_sort Chen, Haihua
collection PubMed
description OBJECTIVE: We analyzed the COVID-19 Open Research Dataset (CORD-19) to understand leading research institutions, collaborations among institutions, major publication venues, key research concepts, and topics covered by pandemic-related research. METHODS: We conducted a descriptive analysis of authors' institutions and relationships, automatic content extraction of key words and phrases from titles and abstracts, and topic modeling and evolution. Data visualization techniques were applied to present the results of the analysis. RESULTS: We found that leading research institutions on COVID-19 included the Chinese Academy of Sciences, the US National Institutes of Health, and the University of California. Research studies mostly involved collaboration among different institutions at national and international levels. In addition to bioRxiv, major publication venues included journals such as The BMJ, PLOS One, Journal of Virology, and The Lancet. Key research concepts included the coronavirus, acute respiratory impairments, health care, and social distancing. The ten most popular topics were identified through topic modeling and included human metapneumovirus and livestock, clinical outcomes of severe patients, and risk factors for higher mortality rate. CONCLUSION: Data analytics is a powerful approach for quickly processing and understanding large-scale datasets like CORD-19. This approach could help medical librarians, researchers, and the public understand important characteristics of COVID-19 research and could be applied to the analysis of other large datasets.
format Online
Article
Text
id pubmed-8485960
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher University Library System, University of Pittsburgh
record_format MEDLINE/PubMed
spelling pubmed-84859602021-10-08 Demystifying COVID-19 publications: institutions, journals, concepts, and topics Chen, Haihua Chen, Jiangping Nguyen, Huyen J Med Libr Assoc Original Investigation OBJECTIVE: We analyzed the COVID-19 Open Research Dataset (CORD-19) to understand leading research institutions, collaborations among institutions, major publication venues, key research concepts, and topics covered by pandemic-related research. METHODS: We conducted a descriptive analysis of authors' institutions and relationships, automatic content extraction of key words and phrases from titles and abstracts, and topic modeling and evolution. Data visualization techniques were applied to present the results of the analysis. RESULTS: We found that leading research institutions on COVID-19 included the Chinese Academy of Sciences, the US National Institutes of Health, and the University of California. Research studies mostly involved collaboration among different institutions at national and international levels. In addition to bioRxiv, major publication venues included journals such as The BMJ, PLOS One, Journal of Virology, and The Lancet. Key research concepts included the coronavirus, acute respiratory impairments, health care, and social distancing. The ten most popular topics were identified through topic modeling and included human metapneumovirus and livestock, clinical outcomes of severe patients, and risk factors for higher mortality rate. CONCLUSION: Data analytics is a powerful approach for quickly processing and understanding large-scale datasets like CORD-19. This approach could help medical librarians, researchers, and the public understand important characteristics of COVID-19 research and could be applied to the analysis of other large datasets. University Library System, University of Pittsburgh 2021-07-01 2021-07-01 /pmc/articles/PMC8485960/ /pubmed/34629968 http://dx.doi.org/10.5195/jmla.2021.1141 Text en Copyright © 2021 Haihua Chen, Jiangping Chen, Huyen Nguyen https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Original Investigation
Chen, Haihua
Chen, Jiangping
Nguyen, Huyen
Demystifying COVID-19 publications: institutions, journals, concepts, and topics
title Demystifying COVID-19 publications: institutions, journals, concepts, and topics
title_full Demystifying COVID-19 publications: institutions, journals, concepts, and topics
title_fullStr Demystifying COVID-19 publications: institutions, journals, concepts, and topics
title_full_unstemmed Demystifying COVID-19 publications: institutions, journals, concepts, and topics
title_short Demystifying COVID-19 publications: institutions, journals, concepts, and topics
title_sort demystifying covid-19 publications: institutions, journals, concepts, and topics
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8485960/
https://www.ncbi.nlm.nih.gov/pubmed/34629968
http://dx.doi.org/10.5195/jmla.2021.1141
work_keys_str_mv AT chenhaihua demystifyingcovid19publicationsinstitutionsjournalsconceptsandtopics
AT chenjiangping demystifyingcovid19publicationsinstitutionsjournalsconceptsandtopics
AT nguyenhuyen demystifyingcovid19publicationsinstitutionsjournalsconceptsandtopics