Cargando…
COBERT: COVID-19 Question Answering System Using BERT
In the current situation of worldwide pandemic COVID-19, which has infected 62.5 Million people and caused nearly 1.46 Million deaths worldwide as of Nov 2020. The profoundly powerful and quickly advancing circumstance with COVID-19 has made it hard to get precise, on-request latest data with respec...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8220121/ https://www.ncbi.nlm.nih.gov/pubmed/34178569 http://dx.doi.org/10.1007/s13369-021-05810-5 |
_version_ | 1783711079591837696 |
---|---|
author | Alzubi, Jafar A. Jain, Rachna Singh, Anubhav Parwekar, Pritee Gupta, Meenu |
author_facet | Alzubi, Jafar A. Jain, Rachna Singh, Anubhav Parwekar, Pritee Gupta, Meenu |
author_sort | Alzubi, Jafar A. |
collection | PubMed |
description | In the current situation of worldwide pandemic COVID-19, which has infected 62.5 Million people and caused nearly 1.46 Million deaths worldwide as of Nov 2020. The profoundly powerful and quickly advancing circumstance with COVID-19 has made it hard to get precise, on-request latest data with respect to the virus. Especially, the frontline workers of the battle medical services experts, policymakers, clinical scientists, and so on will require expert specific methods to stay aware of this literature for getting scientific knowledge of the latest research findings. The risks are most certainly not trivial, as decisions made on fallacious, answers may endanger trust or general well being and security of the public. But, with thousands of research papers being dispensed on the topic, making it more difficult to keep track of the latest research. Taking these challenges into account we have proposed COBERT: a retriever-reader dual algorithmic system that answers the complex queries by searching a document of 59K corona virus-related literature made accessible through the Coronavirus Open Research Dataset Challenge (CORD-19). The retriever is composed of a TF-IDF vectorizer capturing the top 500 documents with optimal scores. The reader which is pre-trained Bidirectional Encoder Representations from Transformers (BERT) on SQuAD 1.1 dev dataset built on top of the HuggingFace BERT transformers, refines the sentences from the filtered documents, which are then passed into ranker which compares the logits scores to produce a short answer, title of the paper and source article of extraction. The proposed DistilBERT version has outperformed previous pre-trained models obtaining an Exact Match(EM)/F1 score of 80.6/87.3 respectively. |
format | Online Article Text |
id | pubmed-8220121 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-82201212021-06-23 COBERT: COVID-19 Question Answering System Using BERT Alzubi, Jafar A. Jain, Rachna Singh, Anubhav Parwekar, Pritee Gupta, Meenu Arab J Sci Eng RESEARCH ARTICLE - SPECIAL ISSUE - AI based health-related Computing for COVID-19 (AIHRC) In the current situation of worldwide pandemic COVID-19, which has infected 62.5 Million people and caused nearly 1.46 Million deaths worldwide as of Nov 2020. The profoundly powerful and quickly advancing circumstance with COVID-19 has made it hard to get precise, on-request latest data with respect to the virus. Especially, the frontline workers of the battle medical services experts, policymakers, clinical scientists, and so on will require expert specific methods to stay aware of this literature for getting scientific knowledge of the latest research findings. The risks are most certainly not trivial, as decisions made on fallacious, answers may endanger trust or general well being and security of the public. But, with thousands of research papers being dispensed on the topic, making it more difficult to keep track of the latest research. Taking these challenges into account we have proposed COBERT: a retriever-reader dual algorithmic system that answers the complex queries by searching a document of 59K corona virus-related literature made accessible through the Coronavirus Open Research Dataset Challenge (CORD-19). The retriever is composed of a TF-IDF vectorizer capturing the top 500 documents with optimal scores. The reader which is pre-trained Bidirectional Encoder Representations from Transformers (BERT) on SQuAD 1.1 dev dataset built on top of the HuggingFace BERT transformers, refines the sentences from the filtered documents, which are then passed into ranker which compares the logits scores to produce a short answer, title of the paper and source article of extraction. The proposed DistilBERT version has outperformed previous pre-trained models obtaining an Exact Match(EM)/F1 score of 80.6/87.3 respectively. Springer Berlin Heidelberg 2021-06-23 /pmc/articles/PMC8220121/ /pubmed/34178569 http://dx.doi.org/10.1007/s13369-021-05810-5 Text en © King Fahd University of Petroleum & Minerals 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | RESEARCH ARTICLE - SPECIAL ISSUE - AI based health-related Computing for COVID-19 (AIHRC) Alzubi, Jafar A. Jain, Rachna Singh, Anubhav Parwekar, Pritee Gupta, Meenu COBERT: COVID-19 Question Answering System Using BERT |
title | COBERT: COVID-19 Question Answering System Using BERT |
title_full | COBERT: COVID-19 Question Answering System Using BERT |
title_fullStr | COBERT: COVID-19 Question Answering System Using BERT |
title_full_unstemmed | COBERT: COVID-19 Question Answering System Using BERT |
title_short | COBERT: COVID-19 Question Answering System Using BERT |
title_sort | cobert: covid-19 question answering system using bert |
topic | RESEARCH ARTICLE - SPECIAL ISSUE - AI based health-related Computing for COVID-19 (AIHRC) |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8220121/ https://www.ncbi.nlm.nih.gov/pubmed/34178569 http://dx.doi.org/10.1007/s13369-021-05810-5 |
work_keys_str_mv | AT alzubijafara cobertcovid19questionansweringsystemusingbert AT jainrachna cobertcovid19questionansweringsystemusingbert AT singhanubhav cobertcovid19questionansweringsystemusingbert AT parwekarpritee cobertcovid19questionansweringsystemusingbert AT guptameenu cobertcovid19questionansweringsystemusingbert |