Cargando…

Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs

Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, sour...

Descripción completa

Detalles Bibliográficos
Autores principales:	Asadifar, Somayeh, Kahani, Mohsen, Shekarpour, Saeedeh
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2022
Materias:	Data Mining and Machine Learning
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044320/ https://www.ncbi.nlm.nih.gov/pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846

_version_	1784695081063677952
author	Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh
author_facet	Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh
author_sort	Asadifar, Somayeh
collection	PubMed
description	Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, source prediction to ensure scalability is very important. In this paper, a method for source prediction is presented in hybrid QA, involving several KB sources and a text source. In a few hybrid methods for source selection, including only one KB source in addition to the textual source, prioritization or heuristics have been used that have not been evaluated so far. Most methods available in source selection services are based on general metadata or triple instances. These methods are not suitable due to the unstructured source in hybrid QA. In this research, we need data details to predict the source. In addition, unlike KB federated methods that are based on triple instances, we use the behind idea of mediated schema to ensure data integration and scalability. Results from evaluations that consider word, triple, and question level information, show that the proposed approach performs well against a few benchmarks. In addition, the comparison of the proposed method with the existing approaches in hybrid and KB source prediction and also QA tasks has shown a significant reduction in response time and increased accuracy.
format	Online Article Text
id	pubmed-9044320
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-90443202022-04-28 Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh PeerJ Comput Sci Data Mining and Machine Learning Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, source prediction to ensure scalability is very important. In this paper, a method for source prediction is presented in hybrid QA, involving several KB sources and a text source. In a few hybrid methods for source selection, including only one KB source in addition to the textual source, prioritization or heuristics have been used that have not been evaluated so far. Most methods available in source selection services are based on general metadata or triple instances. These methods are not suitable due to the unstructured source in hybrid QA. In this research, we need data details to predict the source. In addition, unlike KB federated methods that are based on triple instances, we use the behind idea of mediated schema to ensure data integration and scalability. Results from evaluations that consider word, triple, and question level information, show that the proposed approach performs well against a few benchmarks. In addition, the comparison of the proposed method with the existing approaches in hybrid and KB source prediction and also QA tasks has shown a significant reduction in response time and increased accuracy. PeerJ Inc. 2022-03-03 /pmc/articles/PMC9044320/ /pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846 Text en © 2022 Asadifar et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle	Data Mining and Machine Learning Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title	Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_full	Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_fullStr	Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_full_unstemmed	Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_short	Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_sort	schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
topic	Data Mining and Machine Learning
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044320/ https://www.ncbi.nlm.nih.gov/pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846
work_keys_str_mv	AT asadifarsomayeh schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs AT kahanimohsen schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs AT shekarpoursaeedeh schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs

Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs

Ejemplares similares