Cargando…

Automatic question answering for multiple stakeholders, the epidemic question answering dataset

One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Auto...

Descripción completa

Detalles Bibliográficos
Autores principales: Goodwin, Travis R., Demner-Fushman, Dina, Lo, Kyle, Wang, Lucy Lu, Dang, Hoa T., Soboroff, Ian M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9302224/
https://www.ncbi.nlm.nih.gov/pubmed/35864125
http://dx.doi.org/10.1038/s41597-022-01533-w
_version_ 1784751587566026752
author Goodwin, Travis R.
Demner-Fushman, Dina
Lo, Kyle
Wang, Lucy Lu
Dang, Hoa T.
Soboroff, Ian M.
author_facet Goodwin, Travis R.
Demner-Fushman, Dina
Lo, Kyle
Wang, Lucy Lu
Dang, Hoa T.
Soboroff, Ian M.
author_sort Goodwin, Travis R.
collection PubMed
description One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Automatic Question Answering can quickly bring the most salient points to the user’s attention. Leveraging a collection of scientific articles, government websites, relevant news articles, curated social media posts, and questions asked by researchers, clinicians, and the general public, we developed a dataset to explore automatic Question Answering for multiple stakeholders. Analysis of questions asked by various stakeholders shows that while information needs of experts and the public may overlap, satisfactory answers to these questions often originate from different information sources or benefit from different approaches to answer generation. We believe that this dataset has the potential to support the development of question answering systems not only for epidemic questions, but for other domains with varying expertise such as legal or finance.
format Online
Article
Text
id pubmed-9302224
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-93022242022-07-22 Automatic question answering for multiple stakeholders, the epidemic question answering dataset Goodwin, Travis R. Demner-Fushman, Dina Lo, Kyle Wang, Lucy Lu Dang, Hoa T. Soboroff, Ian M. Sci Data Data Descriptor One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Automatic Question Answering can quickly bring the most salient points to the user’s attention. Leveraging a collection of scientific articles, government websites, relevant news articles, curated social media posts, and questions asked by researchers, clinicians, and the general public, we developed a dataset to explore automatic Question Answering for multiple stakeholders. Analysis of questions asked by various stakeholders shows that while information needs of experts and the public may overlap, satisfactory answers to these questions often originate from different information sources or benefit from different approaches to answer generation. We believe that this dataset has the potential to support the development of question answering systems not only for epidemic questions, but for other domains with varying expertise such as legal or finance. Nature Publishing Group UK 2022-07-21 /pmc/articles/PMC9302224/ /pubmed/35864125 http://dx.doi.org/10.1038/s41597-022-01533-w Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Goodwin, Travis R.
Demner-Fushman, Dina
Lo, Kyle
Wang, Lucy Lu
Dang, Hoa T.
Soboroff, Ian M.
Automatic question answering for multiple stakeholders, the epidemic question answering dataset
title Automatic question answering for multiple stakeholders, the epidemic question answering dataset
title_full Automatic question answering for multiple stakeholders, the epidemic question answering dataset
title_fullStr Automatic question answering for multiple stakeholders, the epidemic question answering dataset
title_full_unstemmed Automatic question answering for multiple stakeholders, the epidemic question answering dataset
title_short Automatic question answering for multiple stakeholders, the epidemic question answering dataset
title_sort automatic question answering for multiple stakeholders, the epidemic question answering dataset
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9302224/
https://www.ncbi.nlm.nih.gov/pubmed/35864125
http://dx.doi.org/10.1038/s41597-022-01533-w
work_keys_str_mv AT goodwintravisr automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset
AT demnerfushmandina automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset
AT lokyle automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset
AT wanglucylu automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset
AT danghoat automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset
AT soboroffianm automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset