Cargando…

Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit

Probabilistic Datalog (PDatalog, proposed in 1995) is a probabilistic variant of Datalog and a nice conceptual idea to model Information Retrieval in a logical, rule-based programming paradigm. Making PDatalog work in real-world applications requires more than probabilistic facts and rules, and the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Frommholz, Ingo, Roelleke, Thomas
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Berlin Heidelberg 2016
Materias:	Schwerpunktbeitrag
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5750817/ https://www.ncbi.nlm.nih.gov/pubmed/29368760 http://dx.doi.org/10.1007/s13222-015-0208-z

_version_	1783289809540743168
author	Frommholz, Ingo Roelleke, Thomas
author_facet	Frommholz, Ingo Roelleke, Thomas
author_sort	Frommholz, Ingo
collection	PubMed
description	Probabilistic Datalog (PDatalog, proposed in 1995) is a probabilistic variant of Datalog and a nice conceptual idea to model Information Retrieval in a logical, rule-based programming paradigm. Making PDatalog work in real-world applications requires more than probabilistic facts and rules, and the semantics associated with the evaluation of the programs. We report in this paper some of the key features of the HySpirit system required to scale the execution of PDatalog programs. Firstly, there is the requirement to express probability estimation in PDatalog. Secondly, fuzzy-like predicates are required to model vague predicates (e.g. vague match of attributes such as age or price). Thirdly, to handle large data sets there are scalability issues to be addressed, and therefore, HySpirit provides probabilistic relational indexes and parallel and distributed processing. The main contribution of this paper is a consolidated view on the methods of the HySpirit system to make PDatalog applicable in real-scale applications that involve a wide range of requirements typical for data (information) management and analysis.
format	Online Article Text
id	pubmed-5750817
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Springer Berlin Heidelberg
record_format	MEDLINE/PubMed
spelling	pubmed-57508172018-01-22 Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit Frommholz, Ingo Roelleke, Thomas Datenbank Spektrum Schwerpunktbeitrag Probabilistic Datalog (PDatalog, proposed in 1995) is a probabilistic variant of Datalog and a nice conceptual idea to model Information Retrieval in a logical, rule-based programming paradigm. Making PDatalog work in real-world applications requires more than probabilistic facts and rules, and the semantics associated with the evaluation of the programs. We report in this paper some of the key features of the HySpirit system required to scale the execution of PDatalog programs. Firstly, there is the requirement to express probability estimation in PDatalog. Secondly, fuzzy-like predicates are required to model vague predicates (e.g. vague match of attributes such as age or price). Thirdly, to handle large data sets there are scalability issues to be addressed, and therefore, HySpirit provides probabilistic relational indexes and parallel and distributed processing. The main contribution of this paper is a consolidated view on the methods of the HySpirit system to make PDatalog applicable in real-scale applications that involve a wide range of requirements typical for data (information) management and analysis. Springer Berlin Heidelberg 2016-01-26 2016 /pmc/articles/PMC5750817/ /pubmed/29368760 http://dx.doi.org/10.1007/s13222-015-0208-z Text en © The Author(s) 2016 https://creativecommons.org/licenses/by/4.0/ Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.
spellingShingle	Schwerpunktbeitrag Frommholz, Ingo Roelleke, Thomas Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit
title	Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit
title_full	Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit
title_fullStr	Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit
title_full_unstemmed	Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit
title_short	Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit
title_sort	scalable db+ir technology: processing probabilistic datalog with hyspirit
topic	Schwerpunktbeitrag
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5750817/ https://www.ncbi.nlm.nih.gov/pubmed/29368760 http://dx.doi.org/10.1007/s13222-015-0208-z
work_keys_str_mv	AT frommholzingo scalabledbirtechnologyprocessingprobabilisticdatalogwithhyspirit AT roellekethomas scalabledbirtechnologyprocessingprobabilisticdatalogwithhyspirit

Scalable DB+IR Technology: Processing Probabilistic Datalog with HySpirit

Ejemplares similares