Cargando…
Global Research on Coronaviruses: An R Package
BACKGROUND: In these trying times, we developed an R package about bibliographic references on coronaviruses. Working with reproducible research principles based on open science, disseminating scientific information, providing easy access to scientific production on this particular issue, and offeri...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7423387/ https://www.ncbi.nlm.nih.gov/pubmed/32730218 http://dx.doi.org/10.2196/19615 |
_version_ | 1783570160018259968 |
---|---|
author | Warin, Thierry |
author_facet | Warin, Thierry |
author_sort | Warin, Thierry |
collection | PubMed |
description | BACKGROUND: In these trying times, we developed an R package about bibliographic references on coronaviruses. Working with reproducible research principles based on open science, disseminating scientific information, providing easy access to scientific production on this particular issue, and offering a rapid integration in researchers’ workflows may help save time in this race against the virus, notably in terms of public health. OBJECTIVE: The goal is to simplify the workflow of interested researchers, with multidisciplinary research in mind. With more than 60,500 medical bibliographic references at the time of publication, this package is among the largest about coronaviruses. METHODS: This package could be of interest to epidemiologists, researchers in scientometrics, biostatisticians, as well as data scientists broadly defined. This package collects references from PubMed and organizes the data in a data frame. We then built functions to sort through this collection of references. Researchers can also integrate the data into their pipeline and implement them in R within their code libraries. RESULTS: We provide a short use case in this paper based on a bibliometric analysis of the references made available by this package. Classification techniques can also be used to go through the large volume of references and allow researchers to save time on this part of their research. Network analysis can be used to filter the data set. Text mining techniques can also help researchers calculate similarity indices and help them focus on the parts of the literature that are relevant for their research. CONCLUSIONS: This package aims at accelerating research on coronaviruses. Epidemiologists can integrate this package into their workflow. It is also possible to add a machine learning layer on top of this package to model the latest advances in research about coronaviruses, as we update this package daily. It is also the only one of this size, to the best of our knowledge, to be built in the R language. |
format | Online Article Text |
id | pubmed-7423387 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-74233872020-08-20 Global Research on Coronaviruses: An R Package Warin, Thierry J Med Internet Res Original Paper BACKGROUND: In these trying times, we developed an R package about bibliographic references on coronaviruses. Working with reproducible research principles based on open science, disseminating scientific information, providing easy access to scientific production on this particular issue, and offering a rapid integration in researchers’ workflows may help save time in this race against the virus, notably in terms of public health. OBJECTIVE: The goal is to simplify the workflow of interested researchers, with multidisciplinary research in mind. With more than 60,500 medical bibliographic references at the time of publication, this package is among the largest about coronaviruses. METHODS: This package could be of interest to epidemiologists, researchers in scientometrics, biostatisticians, as well as data scientists broadly defined. This package collects references from PubMed and organizes the data in a data frame. We then built functions to sort through this collection of references. Researchers can also integrate the data into their pipeline and implement them in R within their code libraries. RESULTS: We provide a short use case in this paper based on a bibliometric analysis of the references made available by this package. Classification techniques can also be used to go through the large volume of references and allow researchers to save time on this part of their research. Network analysis can be used to filter the data set. Text mining techniques can also help researchers calculate similarity indices and help them focus on the parts of the literature that are relevant for their research. CONCLUSIONS: This package aims at accelerating research on coronaviruses. Epidemiologists can integrate this package into their workflow. It is also possible to add a machine learning layer on top of this package to model the latest advances in research about coronaviruses, as we update this package daily. It is also the only one of this size, to the best of our knowledge, to be built in the R language. JMIR Publications 2020-08-11 /pmc/articles/PMC7423387/ /pubmed/32730218 http://dx.doi.org/10.2196/19615 Text en ©Thierry Warin. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 11.08.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Warin, Thierry Global Research on Coronaviruses: An R Package |
title | Global Research on Coronaviruses: An R Package |
title_full | Global Research on Coronaviruses: An R Package |
title_fullStr | Global Research on Coronaviruses: An R Package |
title_full_unstemmed | Global Research on Coronaviruses: An R Package |
title_short | Global Research on Coronaviruses: An R Package |
title_sort | global research on coronaviruses: an r package |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7423387/ https://www.ncbi.nlm.nih.gov/pubmed/32730218 http://dx.doi.org/10.2196/19615 |
work_keys_str_mv | AT warinthierry globalresearchoncoronavirusesanrpackage |