Cargando…

Global Research on Coronaviruses: An R Package

BACKGROUND: In these trying times, we developed an R package about bibliographic references on coronaviruses. Working with reproducible research principles based on open science, disseminating scientific information, providing easy access to scientific production on this particular issue, and offeri...

Descripción completa

Detalles Bibliográficos
Autor principal: Warin, Thierry
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7423387/
https://www.ncbi.nlm.nih.gov/pubmed/32730218
http://dx.doi.org/10.2196/19615
_version_ 1783570160018259968
author Warin, Thierry
author_facet Warin, Thierry
author_sort Warin, Thierry
collection PubMed
description BACKGROUND: In these trying times, we developed an R package about bibliographic references on coronaviruses. Working with reproducible research principles based on open science, disseminating scientific information, providing easy access to scientific production on this particular issue, and offering a rapid integration in researchers’ workflows may help save time in this race against the virus, notably in terms of public health. OBJECTIVE: The goal is to simplify the workflow of interested researchers, with multidisciplinary research in mind. With more than 60,500 medical bibliographic references at the time of publication, this package is among the largest about coronaviruses. METHODS: This package could be of interest to epidemiologists, researchers in scientometrics, biostatisticians, as well as data scientists broadly defined. This package collects references from PubMed and organizes the data in a data frame. We then built functions to sort through this collection of references. Researchers can also integrate the data into their pipeline and implement them in R within their code libraries. RESULTS: We provide a short use case in this paper based on a bibliometric analysis of the references made available by this package. Classification techniques can also be used to go through the large volume of references and allow researchers to save time on this part of their research. Network analysis can be used to filter the data set. Text mining techniques can also help researchers calculate similarity indices and help them focus on the parts of the literature that are relevant for their research. CONCLUSIONS: This package aims at accelerating research on coronaviruses. Epidemiologists can integrate this package into their workflow. It is also possible to add a machine learning layer on top of this package to model the latest advances in research about coronaviruses, as we update this package daily. It is also the only one of this size, to the best of our knowledge, to be built in the R language.
format Online
Article
Text
id pubmed-7423387
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-74233872020-08-20 Global Research on Coronaviruses: An R Package Warin, Thierry J Med Internet Res Original Paper BACKGROUND: In these trying times, we developed an R package about bibliographic references on coronaviruses. Working with reproducible research principles based on open science, disseminating scientific information, providing easy access to scientific production on this particular issue, and offering a rapid integration in researchers’ workflows may help save time in this race against the virus, notably in terms of public health. OBJECTIVE: The goal is to simplify the workflow of interested researchers, with multidisciplinary research in mind. With more than 60,500 medical bibliographic references at the time of publication, this package is among the largest about coronaviruses. METHODS: This package could be of interest to epidemiologists, researchers in scientometrics, biostatisticians, as well as data scientists broadly defined. This package collects references from PubMed and organizes the data in a data frame. We then built functions to sort through this collection of references. Researchers can also integrate the data into their pipeline and implement them in R within their code libraries. RESULTS: We provide a short use case in this paper based on a bibliometric analysis of the references made available by this package. Classification techniques can also be used to go through the large volume of references and allow researchers to save time on this part of their research. Network analysis can be used to filter the data set. Text mining techniques can also help researchers calculate similarity indices and help them focus on the parts of the literature that are relevant for their research. CONCLUSIONS: This package aims at accelerating research on coronaviruses. Epidemiologists can integrate this package into their workflow. It is also possible to add a machine learning layer on top of this package to model the latest advances in research about coronaviruses, as we update this package daily. It is also the only one of this size, to the best of our knowledge, to be built in the R language. JMIR Publications 2020-08-11 /pmc/articles/PMC7423387/ /pubmed/32730218 http://dx.doi.org/10.2196/19615 Text en ©Thierry Warin. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 11.08.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Warin, Thierry
Global Research on Coronaviruses: An R Package
title Global Research on Coronaviruses: An R Package
title_full Global Research on Coronaviruses: An R Package
title_fullStr Global Research on Coronaviruses: An R Package
title_full_unstemmed Global Research on Coronaviruses: An R Package
title_short Global Research on Coronaviruses: An R Package
title_sort global research on coronaviruses: an r package
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7423387/
https://www.ncbi.nlm.nih.gov/pubmed/32730218
http://dx.doi.org/10.2196/19615
work_keys_str_mv AT warinthierry globalresearchoncoronavirusesanrpackage