Cargando…

The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts

We investigate decision bireducts which extend the notion of a decision reduct developed in the theory of rough sets. For a decision table [Formula: see text], a decision bireduct is a pair (X, B), where [Formula: see text] is a subset of attributes which allows to distinguish between all pairs of o...

Descripción completa

Detalles Bibliográficos
Autores principales: Ślęzak, Dominik, Stawicki, Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338167/
http://dx.doi.org/10.1007/978-3-030-52705-1_15
_version_ 1783554624444170240
author Ślęzak, Dominik
Stawicki, Sebastian
author_facet Ślęzak, Dominik
Stawicki, Sebastian
author_sort Ślęzak, Dominik
collection PubMed
description We investigate decision bireducts which extend the notion of a decision reduct developed in the theory of rough sets. For a decision table [Formula: see text], a decision bireduct is a pair (X, B), where [Formula: see text] is a subset of attributes which allows to distinguish between all pairs of objects in [Formula: see text] labeled with different values of decision attribute d, and where B and X cannot be made, respectively, smaller and bigger without losing this property. We refer to our earlier studies on deriving bireducts (X, B) from decision tables and utilizing them to construct families of rule-based classifiers, where [Formula: see text] is equal to total support of decision rules built using attributes in [Formula: see text]. We introduce the notion of a correct ensemble of decision bireducts [Formula: see text], where each [Formula: see text] must be validly classified by more than [Formula: see text] of the corresponding models. We show that the problem of finding a correct ensemble of bireducts with the lowest cardinalities of subsets [Formula: see text] is NP-hard.
format Online
Article
Text
id pubmed-7338167
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73381672020-07-07 The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts Ślęzak, Dominik Stawicki, Sebastian Rough Sets Article We investigate decision bireducts which extend the notion of a decision reduct developed in the theory of rough sets. For a decision table [Formula: see text], a decision bireduct is a pair (X, B), where [Formula: see text] is a subset of attributes which allows to distinguish between all pairs of objects in [Formula: see text] labeled with different values of decision attribute d, and where B and X cannot be made, respectively, smaller and bigger without losing this property. We refer to our earlier studies on deriving bireducts (X, B) from decision tables and utilizing them to construct families of rule-based classifiers, where [Formula: see text] is equal to total support of decision rules built using attributes in [Formula: see text]. We introduce the notion of a correct ensemble of decision bireducts [Formula: see text], where each [Formula: see text] must be validly classified by more than [Formula: see text] of the corresponding models. We show that the problem of finding a correct ensemble of bireducts with the lowest cardinalities of subsets [Formula: see text] is NP-hard. 2020-06-10 /pmc/articles/PMC7338167/ http://dx.doi.org/10.1007/978-3-030-52705-1_15 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Ślęzak, Dominik
Stawicki, Sebastian
The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts
title The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts
title_full The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts
title_fullStr The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts
title_full_unstemmed The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts
title_short The Problem of Finding the Simplest Classifier Ensemble is NP-Hard – A Rough-Set-Inspired Formulation Based on Decision Bireducts
title_sort problem of finding the simplest classifier ensemble is np-hard – a rough-set-inspired formulation based on decision bireducts
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7338167/
http://dx.doi.org/10.1007/978-3-030-52705-1_15
work_keys_str_mv AT slezakdominik theproblemoffindingthesimplestclassifierensembleisnphardaroughsetinspiredformulationbasedondecisionbireducts
AT stawickisebastian theproblemoffindingthesimplestclassifierensembleisnphardaroughsetinspiredformulationbasedondecisionbireducts
AT slezakdominik problemoffindingthesimplestclassifierensembleisnphardaroughsetinspiredformulationbasedondecisionbireducts
AT stawickisebastian problemoffindingthesimplestclassifierensembleisnphardaroughsetinspiredformulationbasedondecisionbireducts