Cargando…

A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases

This short paper describes a web resource—the NIST CORD-19 Web Resource—for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structure...

Descripción completa

Detalles Bibliográficos
Autores principales: Collard, Jacob, Bhat, Talapady, Subrahmanian, Eswaran, Monarch, Ira, Tash, Jonah, Sriram, Ram, Elliot, John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer India 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7523249/
https://www.ncbi.nlm.nih.gov/pubmed/33013023
http://dx.doi.org/10.1007/s41745-020-00193-2
_version_ 1783588349905207296
author Collard, Jacob
Bhat, Talapady
Subrahmanian, Eswaran
Monarch, Ira
Tash, Jonah
Sriram, Ram
Elliot, John
author_facet Collard, Jacob
Bhat, Talapady
Subrahmanian, Eswaran
Monarch, Ira
Tash, Jonah
Sriram, Ram
Elliot, John
author_sort Collard, Jacob
collection PubMed
description This short paper describes a web resource—the NIST CORD-19 Web Resource—for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structures to create terms that represent phrases in a corpus. The method allows for auto-suggesting-related terms to discover terms to refine the search of a COVID-19 heterogenous document base. The method also produces taxonomic structures in the target domain as well as providing semantic information about the relationships between terms. This term structure can serve as a basis for creating topic modeling and trend analysis tools. In this paper, we describe use of a novel search engine to demonstrate some of the capabilities above.
format Online
Article
Text
id pubmed-7523249
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer India
record_format MEDLINE/PubMed
spelling pubmed-75232492020-09-29 A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases Collard, Jacob Bhat, Talapady Subrahmanian, Eswaran Monarch, Ira Tash, Jonah Sriram, Ram Elliot, John J Indian Inst Sci Review Article This short paper describes a web resource—the NIST CORD-19 Web Resource—for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structures to create terms that represent phrases in a corpus. The method allows for auto-suggesting-related terms to discover terms to refine the search of a COVID-19 heterogenous document base. The method also produces taxonomic structures in the target domain as well as providing semantic information about the relationships between terms. This term structure can serve as a basis for creating topic modeling and trend analysis tools. In this paper, we describe use of a novel search engine to demonstrate some of the capabilities above. Springer India 2020-09-29 2020 /pmc/articles/PMC7523249/ /pubmed/33013023 http://dx.doi.org/10.1007/s41745-020-00193-2 Text en © Indian Institute of Science 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Review Article
Collard, Jacob
Bhat, Talapady
Subrahmanian, Eswaran
Monarch, Ira
Tash, Jonah
Sriram, Ram
Elliot, John
A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
title A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
title_full A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
title_fullStr A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
title_full_unstemmed A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
title_short A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
title_sort web resource for exploring the cord-19 dataset using root- and rule-based phrases
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7523249/
https://www.ncbi.nlm.nih.gov/pubmed/33013023
http://dx.doi.org/10.1007/s41745-020-00193-2
work_keys_str_mv AT collardjacob awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT bhattalapady awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT subrahmanianeswaran awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT monarchira awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT tashjonah awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT sriramram awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT elliotjohn awebresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT collardjacob webresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT bhattalapady webresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT subrahmanianeswaran webresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT monarchira webresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT tashjonah webresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT sriramram webresourceforexploringthecord19datasetusingrootandrulebasedphrases
AT elliotjohn webresourceforexploringthecord19datasetusingrootandrulebasedphrases