Cargando…

Exploration of Outliers in If-Then Rule-Based Knowledge Bases

The article presents both methods of clustering and outlier detection in complex data, such as rule-based knowledge bases. What distinguishes this work from others is, first, the application of clustering algorithms to rules in domain knowledge bases, and secondly, the use of outlier detection algor...

Descripción completa

Detalles Bibliográficos
Autores principales:	Nowak-Brzezińska, Agnieszka, Horyń, Czesław
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597194/ https://www.ncbi.nlm.nih.gov/pubmed/33286864 http://dx.doi.org/10.3390/e22101096

_version_	1783602287982149632
author	Nowak-Brzezińska, Agnieszka Horyń, Czesław
author_facet	Nowak-Brzezińska, Agnieszka Horyń, Czesław
author_sort	Nowak-Brzezińska, Agnieszka
collection	PubMed
description	The article presents both methods of clustering and outlier detection in complex data, such as rule-based knowledge bases. What distinguishes this work from others is, first, the application of clustering algorithms to rules in domain knowledge bases, and secondly, the use of outlier detection algorithms to detect unusual rules in knowledge bases. The aim of the paper is the analysis of using four algorithms for outlier detection in rule-based knowledge bases: Local Outlier Factor ([Formula: see text]), Connectivity-based Outlier Factor ([Formula: see text]), K- [Formula: see text] , and [Formula: see text] [Formula: see text]. The subject of outlier mining is very important nowadays. Outliers in rules If-Then mean unusual rules, which are rare in comparing to others and should be explored by the domain expert as soon as possible. In the research, the authors use the outlier detection methods to find a given number of outliers in rules ([Formula: see text] , [Formula: see text] , [Formula: see text]), while in small groups, the number of outliers covers no more than [Formula: see text] of the rule cluster. Subsequently, the authors analyze which of seven various quality indices, which they use for all rules and after removing selected outliers, improve the quality of rule clusters. In the experimental stage, the authors use six different knowledge bases. The best results (the most often the clusters quality was improved) are achieved for two outlier detection algorithms [Formula: see text] and [Formula: see text].
format	Online Article Text
id	pubmed-7597194
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-75971942020-11-09 Exploration of Outliers in If-Then Rule-Based Knowledge Bases Nowak-Brzezińska, Agnieszka Horyń, Czesław Entropy (Basel) Article The article presents both methods of clustering and outlier detection in complex data, such as rule-based knowledge bases. What distinguishes this work from others is, first, the application of clustering algorithms to rules in domain knowledge bases, and secondly, the use of outlier detection algorithms to detect unusual rules in knowledge bases. The aim of the paper is the analysis of using four algorithms for outlier detection in rule-based knowledge bases: Local Outlier Factor ([Formula: see text]), Connectivity-based Outlier Factor ([Formula: see text]), K- [Formula: see text] , and [Formula: see text] [Formula: see text]. The subject of outlier mining is very important nowadays. Outliers in rules If-Then mean unusual rules, which are rare in comparing to others and should be explored by the domain expert as soon as possible. In the research, the authors use the outlier detection methods to find a given number of outliers in rules ([Formula: see text] , [Formula: see text] , [Formula: see text]), while in small groups, the number of outliers covers no more than [Formula: see text] of the rule cluster. Subsequently, the authors analyze which of seven various quality indices, which they use for all rules and after removing selected outliers, improve the quality of rule clusters. In the experimental stage, the authors use six different knowledge bases. The best results (the most often the clusters quality was improved) are achieved for two outlier detection algorithms [Formula: see text] and [Formula: see text]. MDPI 2020-09-29 /pmc/articles/PMC7597194/ /pubmed/33286864 http://dx.doi.org/10.3390/e22101096 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Nowak-Brzezińska, Agnieszka Horyń, Czesław Exploration of Outliers in If-Then Rule-Based Knowledge Bases
title	Exploration of Outliers in If-Then Rule-Based Knowledge Bases
title_full	Exploration of Outliers in If-Then Rule-Based Knowledge Bases
title_fullStr	Exploration of Outliers in If-Then Rule-Based Knowledge Bases
title_full_unstemmed	Exploration of Outliers in If-Then Rule-Based Knowledge Bases
title_short	Exploration of Outliers in If-Then Rule-Based Knowledge Bases
title_sort	exploration of outliers in if-then rule-based knowledge bases
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597194/ https://www.ncbi.nlm.nih.gov/pubmed/33286864 http://dx.doi.org/10.3390/e22101096
work_keys_str_mv	AT nowakbrzezinskaagnieszka explorationofoutliersinifthenrulebasedknowledgebases AT horynczesław explorationofoutliersinifthenrulebasedknowledgebases

Exploration of Outliers in If-Then Rule-Based Knowledge Bases

Ejemplares similares