Cargando…

A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence

The COVID-19 pandemic has underlined the need for reliable information for clinical decision-making and public health policies. As such, evidence-based medicine (EBM) is essential in identifying and evaluating scientific documents pertinent to novel diseases, and the accurate classification of biome...

Descripción completa

Detalles Bibliográficos
Autores principales: Carvallo, Andrés, Parra, Denis, Lobel, Hans, Rada, Gabriel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10641139/
https://www.ncbi.nlm.nih.gov/pubmed/37965606
http://dx.doi.org/10.1016/j.dib.2023.109720
_version_ 1785146709159968768
author Carvallo, Andrés
Parra, Denis
Lobel, Hans
Rada, Gabriel
author_facet Carvallo, Andrés
Parra, Denis
Lobel, Hans
Rada, Gabriel
author_sort Carvallo, Andrés
collection PubMed
description The COVID-19 pandemic has underlined the need for reliable information for clinical decision-making and public health policies. As such, evidence-based medicine (EBM) is essential in identifying and evaluating scientific documents pertinent to novel diseases, and the accurate classification of biomedical text is integral to this process. Given this context, we introduce a comprehensive, curated dataset composed of COVID-19-related documents. This dataset includes 20,047 labeled documents that were meticulously classified into five distinct categories: systematic reviews (SR), primary study randomized controlled trials (PS-RCT), primary study non-randomized controlled trials (PS-NRCT), broad synthesis (BS), and excluded (EXC). The documents, labeled by collaborators from the Epistemonikos Foundation, incorporate information such as document type, title, abstract, and metadata, including PubMed id, authors, journal, and publication date. Uniquely, this dataset has been curated by the Epistemonikos Foundation and is not readily accessible through conventional web-scraping methods, thereby attesting to its distinctive value in this field of research. In addition to this, the dataset also includes a vast evidence repository comprising 427,870 non-COVID-19 documents, also categorized into SR, PS-RCT, PS-NRCT, BS, and EXC. This additional collection can serve as a valuable benchmark for subsequent research. The comprehensive nature of this open-access dataset and its accompanying resources is poised to significantly advance evidence-based medicine and facilitate further research in the domain.
format Online
Article
Text
id pubmed-10641139
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106411392023-11-14 A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence Carvallo, Andrés Parra, Denis Lobel, Hans Rada, Gabriel Data Brief Data Article The COVID-19 pandemic has underlined the need for reliable information for clinical decision-making and public health policies. As such, evidence-based medicine (EBM) is essential in identifying and evaluating scientific documents pertinent to novel diseases, and the accurate classification of biomedical text is integral to this process. Given this context, we introduce a comprehensive, curated dataset composed of COVID-19-related documents. This dataset includes 20,047 labeled documents that were meticulously classified into five distinct categories: systematic reviews (SR), primary study randomized controlled trials (PS-RCT), primary study non-randomized controlled trials (PS-NRCT), broad synthesis (BS), and excluded (EXC). The documents, labeled by collaborators from the Epistemonikos Foundation, incorporate information such as document type, title, abstract, and metadata, including PubMed id, authors, journal, and publication date. Uniquely, this dataset has been curated by the Epistemonikos Foundation and is not readily accessible through conventional web-scraping methods, thereby attesting to its distinctive value in this field of research. In addition to this, the dataset also includes a vast evidence repository comprising 427,870 non-COVID-19 documents, also categorized into SR, PS-RCT, PS-NRCT, BS, and EXC. This additional collection can serve as a valuable benchmark for subsequent research. The comprehensive nature of this open-access dataset and its accompanying resources is poised to significantly advance evidence-based medicine and facilitate further research in the domain. Elsevier 2023-10-24 /pmc/articles/PMC10641139/ /pubmed/37965606 http://dx.doi.org/10.1016/j.dib.2023.109720 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Carvallo, Andrés
Parra, Denis
Lobel, Hans
Rada, Gabriel
A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
title A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
title_full A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
title_fullStr A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
title_full_unstemmed A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
title_short A comparative dataset: Bridging COVID-19 and other diseases through epistemonikos and CORD-19 evidence
title_sort comparative dataset: bridging covid-19 and other diseases through epistemonikos and cord-19 evidence
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10641139/
https://www.ncbi.nlm.nih.gov/pubmed/37965606
http://dx.doi.org/10.1016/j.dib.2023.109720
work_keys_str_mv AT carvalloandres acomparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT parradenis acomparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT lobelhans acomparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT radagabriel acomparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT carvalloandres comparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT parradenis comparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT lobelhans comparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence
AT radagabriel comparativedatasetbridgingcovid19andotherdiseasesthroughepistemonikosandcord19evidence