Cargando…
Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs
Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, sour...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044320/ https://www.ncbi.nlm.nih.gov/pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846 |
_version_ | 1784695081063677952 |
---|---|
author | Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh |
author_facet | Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh |
author_sort | Asadifar, Somayeh |
collection | PubMed |
description | Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, source prediction to ensure scalability is very important. In this paper, a method for source prediction is presented in hybrid QA, involving several KB sources and a text source. In a few hybrid methods for source selection, including only one KB source in addition to the textual source, prioritization or heuristics have been used that have not been evaluated so far. Most methods available in source selection services are based on general metadata or triple instances. These methods are not suitable due to the unstructured source in hybrid QA. In this research, we need data details to predict the source. In addition, unlike KB federated methods that are based on triple instances, we use the behind idea of mediated schema to ensure data integration and scalability. Results from evaluations that consider word, triple, and question level information, show that the proposed approach performs well against a few benchmarks. In addition, the comparison of the proposed method with the existing approaches in hybrid and KB source prediction and also QA tasks has shown a significant reduction in response time and increased accuracy. |
format | Online Article Text |
id | pubmed-9044320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-90443202022-04-28 Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh PeerJ Comput Sci Data Mining and Machine Learning Today, several attempts to manage question answering (QA) have been made in three separate areas: (1) knowledge-based (KB), (2) text-based and (3) hybrid, which takes advantage of both prior areas in extracting the response. On the other hand, in question answering on a large number of sources, source prediction to ensure scalability is very important. In this paper, a method for source prediction is presented in hybrid QA, involving several KB sources and a text source. In a few hybrid methods for source selection, including only one KB source in addition to the textual source, prioritization or heuristics have been used that have not been evaluated so far. Most methods available in source selection services are based on general metadata or triple instances. These methods are not suitable due to the unstructured source in hybrid QA. In this research, we need data details to predict the source. In addition, unlike KB federated methods that are based on triple instances, we use the behind idea of mediated schema to ensure data integration and scalability. Results from evaluations that consider word, triple, and question level information, show that the proposed approach performs well against a few benchmarks. In addition, the comparison of the proposed method with the existing approaches in hybrid and KB source prediction and also QA tasks has shown a significant reduction in response time and increased accuracy. PeerJ Inc. 2022-03-03 /pmc/articles/PMC9044320/ /pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846 Text en © 2022 Asadifar et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Data Mining and Machine Learning Asadifar, Somayeh Kahani, Mohsen Shekarpour, Saeedeh Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
title | Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
title_full | Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
title_fullStr | Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
title_full_unstemmed | Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
title_short | Schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
title_sort | schema and content aware classification for predicting the sources containing an answer over corpus and knowledge graphs |
topic | Data Mining and Machine Learning |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9044320/ https://www.ncbi.nlm.nih.gov/pubmed/35494835 http://dx.doi.org/10.7717/peerj-cs.846 |
work_keys_str_mv | AT asadifarsomayeh schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs AT kahanimohsen schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs AT shekarpoursaeedeh schemaandcontentawareclassificationforpredictingthesourcescontainingananswerovercorpusandknowledgegraphs |