Cargando…

COBERT: COVID-19 Question Answering System Using BERT

In the current situation of worldwide pandemic COVID-19, which has infected 62.5 Million people and caused nearly 1.46 Million deaths worldwide as of Nov 2020. The profoundly powerful and quickly advancing circumstance with COVID-19 has made it hard to get precise, on-request latest data with respec...

Descripción completa

Detalles Bibliográficos
Autores principales: Alzubi, Jafar A., Jain, Rachna, Singh, Anubhav, Parwekar, Pritee, Gupta, Meenu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8220121/
https://www.ncbi.nlm.nih.gov/pubmed/34178569
http://dx.doi.org/10.1007/s13369-021-05810-5
_version_ 1783711079591837696
author Alzubi, Jafar A.
Jain, Rachna
Singh, Anubhav
Parwekar, Pritee
Gupta, Meenu
author_facet Alzubi, Jafar A.
Jain, Rachna
Singh, Anubhav
Parwekar, Pritee
Gupta, Meenu
author_sort Alzubi, Jafar A.
collection PubMed
description In the current situation of worldwide pandemic COVID-19, which has infected 62.5 Million people and caused nearly 1.46 Million deaths worldwide as of Nov 2020. The profoundly powerful and quickly advancing circumstance with COVID-19 has made it hard to get precise, on-request latest data with respect to the virus. Especially, the frontline workers of the battle medical services experts, policymakers, clinical scientists, and so on will require expert specific methods to stay aware of this literature for getting scientific knowledge of the latest research findings. The risks are most certainly not trivial, as decisions made on fallacious, answers may endanger trust or general well being and security of the public. But, with thousands of research papers being dispensed on the topic, making it more difficult to keep track of the latest research. Taking these challenges into account we have proposed COBERT: a retriever-reader dual algorithmic system that answers the complex queries by searching a document of 59K corona virus-related literature made accessible through the Coronavirus Open Research Dataset Challenge (CORD-19). The retriever is composed of a TF-IDF vectorizer capturing the top 500 documents with optimal scores. The reader which is pre-trained Bidirectional Encoder Representations from Transformers (BERT) on SQuAD 1.1 dev dataset built on top of the HuggingFace BERT transformers, refines the sentences from the filtered documents, which are then passed into ranker which compares the logits scores to produce a short answer, title of the paper and source article of extraction. The proposed DistilBERT version has outperformed previous pre-trained models obtaining an Exact Match(EM)/F1 score of 80.6/87.3 respectively.
format Online
Article
Text
id pubmed-8220121
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-82201212021-06-23 COBERT: COVID-19 Question Answering System Using BERT Alzubi, Jafar A. Jain, Rachna Singh, Anubhav Parwekar, Pritee Gupta, Meenu Arab J Sci Eng RESEARCH ARTICLE - SPECIAL ISSUE - AI based health-related Computing for COVID-19 (AIHRC) In the current situation of worldwide pandemic COVID-19, which has infected 62.5 Million people and caused nearly 1.46 Million deaths worldwide as of Nov 2020. The profoundly powerful and quickly advancing circumstance with COVID-19 has made it hard to get precise, on-request latest data with respect to the virus. Especially, the frontline workers of the battle medical services experts, policymakers, clinical scientists, and so on will require expert specific methods to stay aware of this literature for getting scientific knowledge of the latest research findings. The risks are most certainly not trivial, as decisions made on fallacious, answers may endanger trust or general well being and security of the public. But, with thousands of research papers being dispensed on the topic, making it more difficult to keep track of the latest research. Taking these challenges into account we have proposed COBERT: a retriever-reader dual algorithmic system that answers the complex queries by searching a document of 59K corona virus-related literature made accessible through the Coronavirus Open Research Dataset Challenge (CORD-19). The retriever is composed of a TF-IDF vectorizer capturing the top 500 documents with optimal scores. The reader which is pre-trained Bidirectional Encoder Representations from Transformers (BERT) on SQuAD 1.1 dev dataset built on top of the HuggingFace BERT transformers, refines the sentences from the filtered documents, which are then passed into ranker which compares the logits scores to produce a short answer, title of the paper and source article of extraction. The proposed DistilBERT version has outperformed previous pre-trained models obtaining an Exact Match(EM)/F1 score of 80.6/87.3 respectively. Springer Berlin Heidelberg 2021-06-23 /pmc/articles/PMC8220121/ /pubmed/34178569 http://dx.doi.org/10.1007/s13369-021-05810-5 Text en © King Fahd University of Petroleum & Minerals 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle RESEARCH ARTICLE - SPECIAL ISSUE - AI based health-related Computing for COVID-19 (AIHRC)
Alzubi, Jafar A.
Jain, Rachna
Singh, Anubhav
Parwekar, Pritee
Gupta, Meenu
COBERT: COVID-19 Question Answering System Using BERT
title COBERT: COVID-19 Question Answering System Using BERT
title_full COBERT: COVID-19 Question Answering System Using BERT
title_fullStr COBERT: COVID-19 Question Answering System Using BERT
title_full_unstemmed COBERT: COVID-19 Question Answering System Using BERT
title_short COBERT: COVID-19 Question Answering System Using BERT
title_sort cobert: covid-19 question answering system using bert
topic RESEARCH ARTICLE - SPECIAL ISSUE - AI based health-related Computing for COVID-19 (AIHRC)
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8220121/
https://www.ncbi.nlm.nih.gov/pubmed/34178569
http://dx.doi.org/10.1007/s13369-021-05810-5
work_keys_str_mv AT alzubijafara cobertcovid19questionansweringsystemusingbert
AT jainrachna cobertcovid19questionansweringsystemusingbert
AT singhanubhav cobertcovid19questionansweringsystemusingbert
AT parwekarpritee cobertcovid19questionansweringsystemusingbert
AT guptameenu cobertcovid19questionansweringsystemusingbert