Cargando…

Information retrieval and question answering: A case study on COVID-19 scientific literature

Biosanitary experts around the world are directing their efforts towards the study of COVID-19. This effort generates a large volume of scientific publications at a speed that makes the effective acquisition of new knowledge difficult. Therefore, Information Systems are needed to assist biosanitary...

Descripción completa

Detalles Bibliográficos
Autores principales: Otegi, Arantxa, San Vicente, Iñaki, Saralegi, Xabier, Peñas, Anselmo, Lozano, Borja, Agirre, Eneko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Published by Elsevier B.V. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719365/
https://www.ncbi.nlm.nih.gov/pubmed/35002094
http://dx.doi.org/10.1016/j.knosys.2021.108072
_version_ 1784624921955008512
author Otegi, Arantxa
San Vicente, Iñaki
Saralegi, Xabier
Peñas, Anselmo
Lozano, Borja
Agirre, Eneko
author_facet Otegi, Arantxa
San Vicente, Iñaki
Saralegi, Xabier
Peñas, Anselmo
Lozano, Borja
Agirre, Eneko
author_sort Otegi, Arantxa
collection PubMed
description Biosanitary experts around the world are directing their efforts towards the study of COVID-19. This effort generates a large volume of scientific publications at a speed that makes the effective acquisition of new knowledge difficult. Therefore, Information Systems are needed to assist biosanitary experts in accessing, consulting and analyzing these publications. In this work we develop a study of the variables involved in the development of a Question Answering system that receives a set of questions asked by experts about the disease COVID-19 and its causal virus SARS-CoV-2, and provides a ranked list of expert-level answers to each question. In particular, we address the interrelation of the Information Retrieval and the Answer Extraction steps. We found that a recall based document retrieval that leaves to a neural answer extraction module the scanning of the whole documents to find the best answer is a better strategy than relying in a precise passage retrieval before extracting the answer span.
format Online
Article
Text
id pubmed-8719365
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-87193652022-01-03 Information retrieval and question answering: A case study on COVID-19 scientific literature Otegi, Arantxa San Vicente, Iñaki Saralegi, Xabier Peñas, Anselmo Lozano, Borja Agirre, Eneko Knowl Based Syst Article Biosanitary experts around the world are directing their efforts towards the study of COVID-19. This effort generates a large volume of scientific publications at a speed that makes the effective acquisition of new knowledge difficult. Therefore, Information Systems are needed to assist biosanitary experts in accessing, consulting and analyzing these publications. In this work we develop a study of the variables involved in the development of a Question Answering system that receives a set of questions asked by experts about the disease COVID-19 and its causal virus SARS-CoV-2, and provides a ranked list of expert-level answers to each question. In particular, we address the interrelation of the Information Retrieval and the Answer Extraction steps. We found that a recall based document retrieval that leaves to a neural answer extraction module the scanning of the whole documents to find the best answer is a better strategy than relying in a precise passage retrieval before extracting the answer span. Published by Elsevier B.V. 2022-03-15 2021-12-31 /pmc/articles/PMC8719365/ /pubmed/35002094 http://dx.doi.org/10.1016/j.knosys.2021.108072 Text en © 2021 Published by Elsevier B.V. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Otegi, Arantxa
San Vicente, Iñaki
Saralegi, Xabier
Peñas, Anselmo
Lozano, Borja
Agirre, Eneko
Information retrieval and question answering: A case study on COVID-19 scientific literature
title Information retrieval and question answering: A case study on COVID-19 scientific literature
title_full Information retrieval and question answering: A case study on COVID-19 scientific literature
title_fullStr Information retrieval and question answering: A case study on COVID-19 scientific literature
title_full_unstemmed Information retrieval and question answering: A case study on COVID-19 scientific literature
title_short Information retrieval and question answering: A case study on COVID-19 scientific literature
title_sort information retrieval and question answering: a case study on covid-19 scientific literature
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8719365/
https://www.ncbi.nlm.nih.gov/pubmed/35002094
http://dx.doi.org/10.1016/j.knosys.2021.108072
work_keys_str_mv AT otegiarantxa informationretrievalandquestionansweringacasestudyoncovid19scientificliterature
AT sanvicenteinaki informationretrievalandquestionansweringacasestudyoncovid19scientificliterature
AT saralegixabier informationretrievalandquestionansweringacasestudyoncovid19scientificliterature
AT penasanselmo informationretrievalandquestionansweringacasestudyoncovid19scientificliterature
AT lozanoborja informationretrievalandquestionansweringacasestudyoncovid19scientificliterature
AT agirreeneko informationretrievalandquestionansweringacasestudyoncovid19scientificliterature