Cargando…
Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5 |
_version_ | 1783538795121999872 |
---|---|
author | Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad |
author_facet | Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad |
author_sort | Stoilos, Giorgos |
collection | PubMed |
description | The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address this, we designed a reasoning framework that combines logic-based subsumption with statistical methods for on-the-fly knowledge extraction. Statistical methods extract additional (missing) axioms for the compared entities with the goal of tackling the incompleteness of KBs and thus improving recall. Although this can be beneficial, it can also introduce noise (false positives or false negatives). Hence, our framework uses heuristics to assess whether knowledge extraction is likely to be advantageous and only activates the statistical components if this is the case. We instantiate our framework by combining lightweight logic-based reasoning implemented on top of existing triple-stores with an axiom extraction method that is based on the labels of concepts. Our work was motivated by industrial use cases over which we evaluate our instantiated framework, showing that it outperforms approaches that are only based on textual information. Besides the best combination of precision and recall, our implementation is also scalable and is currently used in an industrial production environment. |
format | Online Article Text |
id | pubmed-7250607 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-72506072020-05-27 Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad The Semantic Web Article The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address this, we designed a reasoning framework that combines logic-based subsumption with statistical methods for on-the-fly knowledge extraction. Statistical methods extract additional (missing) axioms for the compared entities with the goal of tackling the incompleteness of KBs and thus improving recall. Although this can be beneficial, it can also introduce noise (false positives or false negatives). Hence, our framework uses heuristics to assess whether knowledge extraction is likely to be advantageous and only activates the statistical components if this is the case. We instantiate our framework by combining lightweight logic-based reasoning implemented on top of existing triple-stores with an axiom extraction method that is based on the labels of concepts. Our work was motivated by industrial use cases over which we evaluate our instantiated framework, showing that it outperforms approaches that are only based on textual information. Besides the best combination of precision and recall, our implementation is also scalable and is currently used in an industrial production environment. 2020-05-07 /pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Article Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction |
title | Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction |
title_full | Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction |
title_fullStr | Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction |
title_full_unstemmed | Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction |
title_short | Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction |
title_sort | hybrid reasoning over large knowledge bases using on-the-fly knowledge extraction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5 |
work_keys_str_mv | AT stoilosgiorgos hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT juricdamir hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT wartakszymon hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT schulzclaudia hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT khodadadimohammad hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction |