Cargando…

An Empirical Investigation Into Deep and Shallow Rule Learning

Inductive rule learning is arguably among the most traditional paradigms in machine learning. Although we have seen considerable progress over the years in learning rule-based theories, all state-of-the-art learners still learn descriptions that directly relate the input features to the target conce...

Descripción completa

Detalles Bibliográficos
Autores principales: Beck, Florian, Fürnkranz, Johannes
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8570245/
https://www.ncbi.nlm.nih.gov/pubmed/34746767
http://dx.doi.org/10.3389/frai.2021.689398
_version_ 1784594803654131712
author Beck, Florian
Fürnkranz, Johannes
author_facet Beck, Florian
Fürnkranz, Johannes
author_sort Beck, Florian
collection PubMed
description Inductive rule learning is arguably among the most traditional paradigms in machine learning. Although we have seen considerable progress over the years in learning rule-based theories, all state-of-the-art learners still learn descriptions that directly relate the input features to the target concept. In the simplest case, concept learning, this is a disjunctive normal form (DNF) description of the positive class. While it is clear that this is sufficient from a logical point of view because every logical expression can be reduced to an equivalent DNF expression, it could nevertheless be the case that more structured representations, which form deep theories by forming intermediate concepts, could be easier to learn, in very much the same way as deep neural networks are able to outperform shallow networks, even though the latter are also universal function approximators. However, there are several non-trivial obstacles that need to be overcome before a sufficiently powerful deep rule learning algorithm could be developed and be compared to the state-of-the-art in inductive rule learning. In this paper, we therefore take a different approach: we empirically compare deep and shallow rule sets that have been optimized with a uniform general mini-batch based optimization algorithm. In our experiments on both artificial and real-world benchmark data, deep rule networks outperformed their shallow counterparts, which we take as an indication that it is worth-while to devote more efforts to learning deep rule structures from data.
format Online
Article
Text
id pubmed-8570245
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85702452021-11-06 An Empirical Investigation Into Deep and Shallow Rule Learning Beck, Florian Fürnkranz, Johannes Front Artif Intell Artificial Intelligence Inductive rule learning is arguably among the most traditional paradigms in machine learning. Although we have seen considerable progress over the years in learning rule-based theories, all state-of-the-art learners still learn descriptions that directly relate the input features to the target concept. In the simplest case, concept learning, this is a disjunctive normal form (DNF) description of the positive class. While it is clear that this is sufficient from a logical point of view because every logical expression can be reduced to an equivalent DNF expression, it could nevertheless be the case that more structured representations, which form deep theories by forming intermediate concepts, could be easier to learn, in very much the same way as deep neural networks are able to outperform shallow networks, even though the latter are also universal function approximators. However, there are several non-trivial obstacles that need to be overcome before a sufficiently powerful deep rule learning algorithm could be developed and be compared to the state-of-the-art in inductive rule learning. In this paper, we therefore take a different approach: we empirically compare deep and shallow rule sets that have been optimized with a uniform general mini-batch based optimization algorithm. In our experiments on both artificial and real-world benchmark data, deep rule networks outperformed their shallow counterparts, which we take as an indication that it is worth-while to devote more efforts to learning deep rule structures from data. Frontiers Media S.A. 2021-10-22 /pmc/articles/PMC8570245/ /pubmed/34746767 http://dx.doi.org/10.3389/frai.2021.689398 Text en Copyright © 2021 Beck and Fürnkranz. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Beck, Florian
Fürnkranz, Johannes
An Empirical Investigation Into Deep and Shallow Rule Learning
title An Empirical Investigation Into Deep and Shallow Rule Learning
title_full An Empirical Investigation Into Deep and Shallow Rule Learning
title_fullStr An Empirical Investigation Into Deep and Shallow Rule Learning
title_full_unstemmed An Empirical Investigation Into Deep and Shallow Rule Learning
title_short An Empirical Investigation Into Deep and Shallow Rule Learning
title_sort empirical investigation into deep and shallow rule learning
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8570245/
https://www.ncbi.nlm.nih.gov/pubmed/34746767
http://dx.doi.org/10.3389/frai.2021.689398
work_keys_str_mv AT beckflorian anempiricalinvestigationintodeepandshallowrulelearning
AT furnkranzjohannes anempiricalinvestigationintodeepandshallowrulelearning
AT beckflorian empiricalinvestigationintodeepandshallowrulelearning
AT furnkranzjohannes empiricalinvestigationintodeepandshallowrulelearning