Cargando…

Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction

The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address...

Descripción completa

Detalles Bibliográficos
Autores principales:	Stoilos, Giorgos, Juric, Damir, Wartak, Szymon, Schulz, Claudia, Khodadadi, Mohammad
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5

_version_	1783538795121999872
author	Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad
author_facet	Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad
author_sort	Stoilos, Giorgos
collection	PubMed
description	The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address this, we designed a reasoning framework that combines logic-based subsumption with statistical methods for on-the-fly knowledge extraction. Statistical methods extract additional (missing) axioms for the compared entities with the goal of tackling the incompleteness of KBs and thus improving recall. Although this can be beneficial, it can also introduce noise (false positives or false negatives). Hence, our framework uses heuristics to assess whether knowledge extraction is likely to be advantageous and only activates the statistical components if this is the case. We instantiate our framework by combining lightweight logic-based reasoning implemented on top of existing triple-stores with an axiom extraction method that is based on the labels of concepts. Our work was motivated by industrial use cases over which we evaluate our instantiated framework, showing that it outperforms approaches that are only based on textual information. Besides the best combination of precision and recall, our implementation is also scalable and is currently used in an industrial production environment.
format	Online Article Text
id	pubmed-7250607
institution	National Center for Biotechnology Information
language	English
publishDate	2020
record_format	MEDLINE/PubMed
spelling	pubmed-72506072020-05-27 Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad The Semantic Web Article The success of logic-based methods for comparing entities heavily depends on the axioms that have been described for them in the Knowledge Base (KB). Due to the incompleteness of even large and well engineered KBs, such methods suffer from low recall when applied in real-world use cases. To address this, we designed a reasoning framework that combines logic-based subsumption with statistical methods for on-the-fly knowledge extraction. Statistical methods extract additional (missing) axioms for the compared entities with the goal of tackling the incompleteness of KBs and thus improving recall. Although this can be beneficial, it can also introduce noise (false positives or false negatives). Hence, our framework uses heuristics to assess whether knowledge extraction is likely to be advantageous and only activates the statistical components if this is the case. We instantiate our framework by combining lightweight logic-based reasoning implemented on top of existing triple-stores with an axiom extraction method that is based on the labels of concepts. Our work was motivated by industrial use cases over which we evaluate our instantiated framework, showing that it outperforms approaches that are only based on textual information. Besides the best combination of precision and recall, our implementation is also scalable and is currently used in an industrial production environment. 2020-05-07 /pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Stoilos, Giorgos Juric, Damir Wartak, Szymon Schulz, Claudia Khodadadi, Mohammad Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title	Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_full	Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_fullStr	Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_full_unstemmed	Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_short	Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction
title_sort	hybrid reasoning over large knowledge bases using on-the-fly knowledge extraction
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7250607/ http://dx.doi.org/10.1007/978-3-030-49461-2_5
work_keys_str_mv	AT stoilosgiorgos hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT juricdamir hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT wartakszymon hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT schulzclaudia hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction AT khodadadimohammad hybridreasoningoverlargeknowledgebasesusingontheflyknowledgeextraction

Hybrid Reasoning Over Large Knowledge Bases Using On-The-Fly Knowledge Extraction

Ejemplares similares