Cargando…

Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs

Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, sour...

Descripción completa

Detalles Bibliográficos
Autores principales: Asadifar, Somayeh, Kahani, Mohsen, Shekarpour, Saeedeh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044320/
https://www.ncbi.nlm.nih.gov/pubmed/35494835
http://dx.doi.org/10.7717/peerj-cs.846
_version_ 1784695081063677952
author Asadifar, Somayeh
Kahani, Mohsen
Shekarpour, Saeedeh
author_facet Asadifar, Somayeh
Kahani, Mohsen
Shekarpour, Saeedeh
author_sort Asadifar, Somayeh
collection PubMed
description Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, source prediction to ensure scalability is very important. In this paper, a method for source prediction is presented in hybrid QA, involving several KB sources and a text source. In a few hybrid methods for source selection, including only one KB source in addition to the textual source, prioritization or heuristics have been used that have not been evaluated so far. Most methods available in source selection services are based on general metadata or triple instances. These methods are not suitable due to the unstructured source in hybrid QA. In this research, we need data details to predict the source. In addition, unlike KB federated methods that are based on triple instances, we use the behind idea of mediated schema to ensure data integration and scalability. Results from evaluations that consider word, triple, and question level information, show that the proposed approach performs well against a few benchmarks. In addition, the comparison of the proposed method with the existing approaches in hybrid and KB source prediction and also QA tasks has shown a significant reduction in response time and increased accuracy.
format Online
Article
Text
id pubmed-9044320
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-90443202022-04-28 Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh PeerJ Comput Sci Data Mining and Machine Learning Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, source prediction to ensure scalability is very important. In this paper, a method for source prediction is presented in hybrid QA, involving several KB sources and a text source. In a few hybrid methods for source selection, including only one KB source in addition to the textual source, prioritization or heuristics have been used that have not been evaluated so far. Most methods available in source selection services are based on general metadata or triple instances. These methods are not suitable due to the unstructured source in hybrid QA. In this research, we need data details to predict the source. In addition, unlike KB federated methods that are based on triple instances, we use the behind idea of mediated schema to ensure data integration and scalability. Results from evaluations that consider word, triple, and question level information, show that the proposed approach performs well against a few benchmarks. In addition, the comparison of the proposed method with the existing approaches in hybrid and KB source prediction and also QA tasks has shown a significant reduction in response time and increased accuracy. PeerJ Inc. 2022-03-03 /pmc/articles/PMC9044320/ /pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846 Text en © 2022 Asadifar et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Data Mining and Machine Learning
Asadifar, Somayeh
Kahani, Mohsen
Shekarpour, Saeedeh
Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_full Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_fullStr Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_full_unstemmed Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_short Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
title_sort schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
topic Data Mining and Machine Learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044320/
https://www.ncbi.nlm.nih.gov/pubmed/35494835
http://dx.doi.org/10.7717/peerj-cs.846
work_keys_str_mv AT asadifarsomayeh schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs
AT kahanimohsen schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs
AT shekarpoursaeedeh schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs