Cargando…
Automatic question answering for multiple stakeholders, the epidemic question answering dataset
One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Auto...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9302224/ https://www.ncbi.nlm.nih.gov/pubmed/35864125 http://dx.doi.org/10.1038/s41597-022-01533-w |
_version_ | 1784751587566026752 |
---|---|
author | Goodwin, Travis R. Demner-Fushman, Dina Lo, Kyle Wang, Lucy Lu Dang, Hoa T. Soboroff, Ian M. |
author_facet | Goodwin, Travis R. Demner-Fushman, Dina Lo, Kyle Wang, Lucy Lu Dang, Hoa T. Soboroff, Ian M. |
author_sort | Goodwin, Travis R. |
collection | PubMed |
description | One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Automatic Question Answering can quickly bring the most salient points to the user’s attention. Leveraging a collection of scientific articles, government websites, relevant news articles, curated social media posts, and questions asked by researchers, clinicians, and the general public, we developed a dataset to explore automatic Question Answering for multiple stakeholders. Analysis of questions asked by various stakeholders shows that while information needs of experts and the public may overlap, satisfactory answers to these questions often originate from different information sources or benefit from different approaches to answer generation. We believe that this dataset has the potential to support the development of question answering systems not only for epidemic questions, but for other domains with varying expertise such as legal or finance. |
format | Online Article Text |
id | pubmed-9302224 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-93022242022-07-22 Automatic question answering for multiple stakeholders, the epidemic question answering dataset Goodwin, Travis R. Demner-Fushman, Dina Lo, Kyle Wang, Lucy Lu Dang, Hoa T. Soboroff, Ian M. Sci Data Data Descriptor One of the effects of COVID-19 pandemic is a rapidly growing and changing stream of publications to inform clinicians, researchers, policy makers, and patients about the health, socio-economic, and cultural consequences of the pandemic. Managing this information stream manually is not feasible. Automatic Question Answering can quickly bring the most salient points to the user’s attention. Leveraging a collection of scientific articles, government websites, relevant news articles, curated social media posts, and questions asked by researchers, clinicians, and the general public, we developed a dataset to explore automatic Question Answering for multiple stakeholders. Analysis of questions asked by various stakeholders shows that while information needs of experts and the public may overlap, satisfactory answers to these questions often originate from different information sources or benefit from different approaches to answer generation. We believe that this dataset has the potential to support the development of question answering systems not only for epidemic questions, but for other domains with varying expertise such as legal or finance. Nature Publishing Group UK 2022-07-21 /pmc/articles/PMC9302224/ /pubmed/35864125 http://dx.doi.org/10.1038/s41597-022-01533-w Text en © This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Goodwin, Travis R. Demner-Fushman, Dina Lo, Kyle Wang, Lucy Lu Dang, Hoa T. Soboroff, Ian M. Automatic question answering for multiple stakeholders, the epidemic question answering dataset |
title | Automatic question answering for multiple stakeholders, the epidemic question answering dataset |
title_full | Automatic question answering for multiple stakeholders, the epidemic question answering dataset |
title_fullStr | Automatic question answering for multiple stakeholders, the epidemic question answering dataset |
title_full_unstemmed | Automatic question answering for multiple stakeholders, the epidemic question answering dataset |
title_short | Automatic question answering for multiple stakeholders, the epidemic question answering dataset |
title_sort | automatic question answering for multiple stakeholders, the epidemic question answering dataset |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9302224/ https://www.ncbi.nlm.nih.gov/pubmed/35864125 http://dx.doi.org/10.1038/s41597-022-01533-w |
work_keys_str_mv | AT goodwintravisr automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset AT demnerfushmandina automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset AT lokyle automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset AT wanglucylu automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset AT danghoat automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset AT soboroffianm automaticquestionansweringformultiplestakeholderstheepidemicquestionansweringdataset |