Cargando…

VQuAnDa: Verbalization QUestion ANswering DAtaset

Question Answering (QA) systems over Knowledge Graphs (KGs) aim to provide a concise answer to a given natural language question. Despite the significant evolution of QA methods over the past years, there are still some core lines of work, which are lagging behind. This is especially true for method...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kacupaj, Endri, Zafar, Hamid, Lehmann, Jens, Maleshkova, Maria
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250603/ http://dx.doi.org/10.1007/978-3-030-49461-2_31

_version_	1783538794017849344
author	Kacupaj, Endri Zafar, Hamid Lehmann, Jens Maleshkova, Maria
author_facet	Kacupaj, Endri Zafar, Hamid Lehmann, Jens Maleshkova, Maria
author_sort	Kacupaj, Endri
collection	PubMed
description	Question Answering (QA) systems over Knowledge Graphs (KGs) aim to provide a concise answer to a given natural language question. Despite the significant evolution of QA methods over the past years, there are still some core lines of work, which are lagging behind. This is especially true for methods and datasets that support the verbalization of answers in natural language. Specifically, to the best of our knowledge, none of the existing Question Answering datasets provide any verbalization data for the question-query pairs. Hence, we aim to fill this gap by providing the first QA dataset VQuAnDa that includes the verbalization of each answer. We base VQuAnDa on a commonly used large-scale QA dataset – LC-QuAD, in order to support compatibility and continuity of previous work. We complement the dataset with baseline scores for measuring future training and evaluation work, by using a set of standard sequence to sequence models and sharing the results of the experiments. This resource empowers researchers to train and evaluate a variety of models to generate answer verbalizations.
format	Online Article Text
id	pubmed-7250603
institution	National Center for Biotechnology Information
language	English
publishDate	2020
record_format	MEDLINE/PubMed
spelling	pubmed-72506032020-05-27 VQuAnDa: Verbalization QUestion ANswering DAtaset Kacupaj, Endri Zafar, Hamid Lehmann, Jens Maleshkova, Maria The Semantic Web Article Question Answering (QA) systems over Knowledge Graphs (KGs) aim to provide a concise answer to a given natural language question. Despite the significant evolution of QA methods over the past years, there are still some core lines of work, which are lagging behind. This is especially true for methods and datasets that support the verbalization of answers in natural language. Specifically, to the best of our knowledge, none of the existing Question Answering datasets provide any verbalization data for the question-query pairs. Hence, we aim to fill this gap by providing the first QA dataset VQuAnDa that includes the verbalization of each answer. We base VQuAnDa on a commonly used large-scale QA dataset – LC-QuAD, in order to support compatibility and continuity of previous work. We complement the dataset with baseline scores for measuring future training and evaluation work, by using a set of standard sequence to sequence models and sharing the results of the experiments. This resource empowers researchers to train and evaluate a variety of models to generate answer verbalizations. 2020-05-07 /pmc/articles/PMC7250603/ http://dx.doi.org/10.1007/978-3-030-49461-2_31 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Kacupaj, Endri Zafar, Hamid Lehmann, Jens Maleshkova, Maria VQuAnDa: Verbalization QUestion ANswering DAtaset
title	VQuAnDa: Verbalization QUestion ANswering DAtaset
title_full	VQuAnDa: Verbalization QUestion ANswering DAtaset
title_fullStr	VQuAnDa: Verbalization QUestion ANswering DAtaset
title_full_unstemmed	VQuAnDa: Verbalization QUestion ANswering DAtaset
title_short	VQuAnDa: Verbalization QUestion ANswering DAtaset
title_sort	vquanda: verbalization question answering dataset
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250603/ http://dx.doi.org/10.1007/978-3-030-49461-2_31
work_keys_str_mv	AT kacupajendri vquandaverbalizationquestionansweringdataset AT zafarhamid vquandaverbalizationquestionansweringdataset AT lehmannjens vquandaverbalizationquestionansweringdataset AT maleshkovamaria vquandaverbalizationquestionansweringdataset

VQuAnDa: Verbalization QUestion ANswering DAtaset

Ejemplares similares