Cargando…

Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction

The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address...

Descripción completa

Detalles Bibliográficos
Autores principales: Stoilos, Giorgos, Juric, Damir, Wartak, Szymon, Schulz, Claudia, Khodadadi, Mohammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250607/
http://dx.doi.org/10.1007/978-3-030-49461-2_5
_version_ 1783538795121999872
author Stoilos, Giorgos
Juric, Damir
Wartak, Szymon
Schulz, Claudia
Khodadadi, Mohammad
author_facet Stoilos, Giorgos
Juric, Damir
Wartak, Szymon
Schulz, Claudia
Khodadadi, Mohammad
author_sort Stoilos, Giorgos
collection PubMed
description The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address this, we designed a reasoning framework that combines logic-based subsumption with statistical methods for on-the-fly knowledge extraction. Statistical methods extract additional (missing) axioms for the compared entities with the goal of tackling the incompleteness of KBs and thus improving recall. Although this can be beneficial, it can also introduce noise (false positives or false negatives). Hence, our framework uses heuristics to assess whether knowledge extraction is likely to be advantageous and only activates the statistical components if this is the case. We instantiate our framework by combining lightweight logic-based reasoning implemented on top of existing triple-stores with an axiom extraction method that is based on the labels of concepts. Our work was motivated by industrial use cases over which we evaluate our instantiated framework, showing that it outperforms approaches that are only based on textual information. Besides the best combination of precision and recall, our implementation is also scalable and is currently used in an industrial production environment.
format Online
Article
Text
id pubmed-7250607
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-72506072020-05-27 Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad The Semantic Web Article The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address this, we designed a reasoning framework that combines logic-based subsumption with statistical methods for on-the-fly knowledge extraction. Statistical methods extract additional (missing) axioms for the compared entities with the goal of tackling the incompleteness of KBs and thus improving recall. Although this can be beneficial, it can also introduce noise (false positives or false negatives). Hence, our framework uses heuristics to assess whether knowledge extraction is likely to be advantageous and only activates the statistical components if this is the case. We instantiate our framework by combining lightweight logic-based reasoning implemented on top of existing triple-stores with an axiom extraction method that is based on the labels of concepts. Our work was motivated by industrial use cases over which we evaluate our instantiated framework, showing that it outperforms approaches that are only based on textual information. Besides the best combination of precision and recall, our implementation is also scalable and is currently used in an industrial production environment. 2020-05-07 /pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Stoilos, Giorgos
Juric, Damir
Wartak, Szymon
Schulz, Claudia
Khodadadi, Mohammad
Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_full Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_fullStr Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_full_unstemmed Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_short Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_sort hybrid reasoning over large knowledge bases using on-the-fly knowledge extraction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250607/
http://dx.doi.org/10.1007/978-3-030-49461-2_5
work_keys_str_mv AT stoilosgiorgos hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction
AT juricdamir hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction
AT wartakszymon hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction
AT schulzclaudia hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction
AT khodadadimohammad hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction