Cargando…
A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases
This short paper describes a web resource—the NIST CORD-19 Web Resource—for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structure...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer India
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7523249/ https://www.ncbi.nlm.nih.gov/pubmed/33013023 http://dx.doi.org/10.1007/s41745-020-00193-2 |
_version_ | 1783588349905207296 |
---|---|
author | Collard, Jacob Bhat, Talapady Subrahmanian, Eswaran Monarch, Ira Tash, Jonah Sriram, Ram Elliot, John |
author_facet | Collard, Jacob Bhat, Talapady Subrahmanian, Eswaran Monarch, Ira Tash, Jonah Sriram, Ram Elliot, John |
author_sort | Collard, Jacob |
collection | PubMed |
description | This short paper describes a web resource—the NIST CORD-19 Web Resource—for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structures to create terms that represent phrases in a corpus. The method allows for auto-suggesting-related terms to discover terms to refine the search of a COVID-19 heterogenous document base. The method also produces taxonomic structures in the target domain as well as providing semantic information about the relationships between terms. This term structure can serve as a basis for creating topic modeling and trend analysis tools. In this paper, we describe use of a novel search engine to demonstrate some of the capabilities above. |
format | Online Article Text |
id | pubmed-7523249 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer India |
record_format | MEDLINE/PubMed |
spelling | pubmed-75232492020-09-29 A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases Collard, Jacob Bhat, Talapady Subrahmanian, Eswaran Monarch, Ira Tash, Jonah Sriram, Ram Elliot, John J Indian Inst Sci Review Article This short paper describes a web resource—the NIST CORD-19 Web Resource—for community explorations of the COVID-19 Open Research Dataset (CORD-19). The tools for exploration in the web resource make use of the NIST-developed Root- and Rule-based method, which exploits underlying linguistic structures to create terms that represent phrases in a corpus. The method allows for auto-suggesting-related terms to discover terms to refine the search of a COVID-19 heterogenous document base. The method also produces taxonomic structures in the target domain as well as providing semantic information about the relationships between terms. This term structure can serve as a basis for creating topic modeling and trend analysis tools. In this paper, we describe use of a novel search engine to demonstrate some of the capabilities above. Springer India 2020-09-29 2020 /pmc/articles/PMC7523249/ /pubmed/33013023 http://dx.doi.org/10.1007/s41745-020-00193-2 Text en © Indian Institute of Science 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Review Article Collard, Jacob Bhat, Talapady Subrahmanian, Eswaran Monarch, Ira Tash, Jonah Sriram, Ram Elliot, John A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases |
title | A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases |
title_full | A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases |
title_fullStr | A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases |
title_full_unstemmed | A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases |
title_short | A Web Resource for Exploring the CORD-19 Dataset Using Root- and Rule-Based Phrases |
title_sort | web resource for exploring the cord-19 dataset using root- and rule-based phrases |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7523249/ https://www.ncbi.nlm.nih.gov/pubmed/33013023 http://dx.doi.org/10.1007/s41745-020-00193-2 |
work_keys_str_mv | AT collardjacob awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT bhattalapady awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT subrahmanianeswaran awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT monarchira awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT tashjonah awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT sriramram awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT elliotjohn awebresourceforexploringthecord19datasetusingrootandrulebasedphrases AT collardjacob webresourceforexploringthecord19datasetusingrootandrulebasedphrases AT bhattalapady webresourceforexploringthecord19datasetusingrootandrulebasedphrases AT subrahmanianeswaran webresourceforexploringthecord19datasetusingrootandrulebasedphrases AT monarchira webresourceforexploringthecord19datasetusingrootandrulebasedphrases AT tashjonah webresourceforexploringthecord19datasetusingrootandrulebasedphrases AT sriramram webresourceforexploringthecord19datasetusingrootandrulebasedphrases AT elliotjohn webresourceforexploringthecord19datasetusingrootandrulebasedphrases |