Cargando…

SAT-Based Encodings for Optimal Decision Trees with Explicit Paths

Decision trees play an important role both in Machine Learning and Knowledge Representation. They are attractive due to their immediate interpretability. In the spirit of Occam’s razor, and interpretability, it is desirable to calculate the smallest tree. This, however, has proven to be a challengin...

Descripción completa

Detalles Bibliográficos
Autores principales: Janota, Mikoláš, Morgado, António
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7326558/
http://dx.doi.org/10.1007/978-3-030-51825-7_35
_version_ 1783552370526912512
author Janota, Mikoláš
Morgado, António
author_facet Janota, Mikoláš
Morgado, António
author_sort Janota, Mikoláš
collection PubMed
description Decision trees play an important role both in Machine Learning and Knowledge Representation. They are attractive due to their immediate interpretability. In the spirit of Occam’s razor, and interpretability, it is desirable to calculate the smallest tree. This, however, has proven to be a challenging task and greedy approaches are typically used to learn trees in practice. Nevertheless, recent work showed that by the use of SAT solvers one may calculate the optimal size tree for real-world benchmarks. This paper proposes a novel SAT-based encoding that explicitly models paths in the tree, which enables us to control the tree’s depth as well as size. At the level of individual SAT calls, we investigate splitting the search space into tree topologies. Our tool outperforms the existing implementation. But also, the experimental results show that minimizing the depth first and then minimizing the number of nodes enables solving a larger set of instances.
format Online
Article
Text
id pubmed-7326558
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73265582020-07-01 SAT-Based Encodings for Optimal Decision Trees with Explicit Paths Janota, Mikoláš Morgado, António Theory and Applications of Satisfiability Testing – SAT 2020 Article Decision trees play an important role both in Machine Learning and Knowledge Representation. They are attractive due to their immediate interpretability. In the spirit of Occam’s razor, and interpretability, it is desirable to calculate the smallest tree. This, however, has proven to be a challenging task and greedy approaches are typically used to learn trees in practice. Nevertheless, recent work showed that by the use of SAT solvers one may calculate the optimal size tree for real-world benchmarks. This paper proposes a novel SAT-based encoding that explicitly models paths in the tree, which enables us to control the tree’s depth as well as size. At the level of individual SAT calls, we investigate splitting the search space into tree topologies. Our tool outperforms the existing implementation. But also, the experimental results show that minimizing the depth first and then minimizing the number of nodes enables solving a larger set of instances. 2020-06-26 /pmc/articles/PMC7326558/ http://dx.doi.org/10.1007/978-3-030-51825-7_35 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Janota, Mikoláš
Morgado, António
SAT-Based Encodings for Optimal Decision Trees with Explicit Paths
title SAT-Based Encodings for Optimal Decision Trees with Explicit Paths
title_full SAT-Based Encodings for Optimal Decision Trees with Explicit Paths
title_fullStr SAT-Based Encodings for Optimal Decision Trees with Explicit Paths
title_full_unstemmed SAT-Based Encodings for Optimal Decision Trees with Explicit Paths
title_short SAT-Based Encodings for Optimal Decision Trees with Explicit Paths
title_sort sat-based encodings for optimal decision trees with explicit paths
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7326558/
http://dx.doi.org/10.1007/978-3-030-51825-7_35
work_keys_str_mv AT janotamikolas satbasedencodingsforoptimaldecisiontreeswithexplicitpaths
AT morgadoantonio satbasedencodingsforoptimaldecisiontreeswithexplicitpaths