Cargando…
ToyArchitecture: Unsupervised learning of interpretable models of the environment
Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improvements in narrow AI domains, or on universal theoretical frameworks which are often uncomputable, or lack practical implementations. In this paper we attempt to follow a big picture view while also pro...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7233548/ https://www.ncbi.nlm.nih.gov/pubmed/32421693 http://dx.doi.org/10.1371/journal.pone.0230432 |
_version_ | 1783535558684835840 |
---|---|
author | Vítků, Jaroslav Dluhoš, Petr Davidson, Joseph Nikl, Matěj Andersson, Simon Paška, Přemysl Šinkora, Jan Hlubuček, Petr Stránský, Martin Hyben, Martin Poliak, Martin Feyereisl, Jan Rosa, Marek |
author_facet | Vítků, Jaroslav Dluhoš, Petr Davidson, Joseph Nikl, Matěj Andersson, Simon Paška, Přemysl Šinkora, Jan Hlubuček, Petr Stránský, Martin Hyben, Martin Poliak, Martin Feyereisl, Jan Rosa, Marek |
author_sort | Vítků, Jaroslav |
collection | PubMed |
description | Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improvements in narrow AI domains, or on universal theoretical frameworks which are often uncomputable, or lack practical implementations. In this paper we attempt to follow a big picture view while also providing a particular theory and its implementation to present a novel, purposely simple, and interpretable hierarchical architecture. This architecture incorporates the unsupervised learning of a model of the environment, learning the influence of one’s own actions, model-based reinforcement learning, hierarchical planning, and symbolic/sub-symbolic integration in general. The learned model is stored in the form of hierarchical representations which are increasingly more abstract, but can retain details when needed. We demonstrate the universality of the architecture by testing it on a series of diverse environments ranging from audio/visual compression to discrete and continuous action spaces, to learning disentangled representations. |
format | Online Article Text |
id | pubmed-7233548 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-72335482020-06-02 ToyArchitecture: Unsupervised learning of interpretable models of the environment Vítků, Jaroslav Dluhoš, Petr Davidson, Joseph Nikl, Matěj Andersson, Simon Paška, Přemysl Šinkora, Jan Hlubuček, Petr Stránský, Martin Hyben, Martin Poliak, Martin Feyereisl, Jan Rosa, Marek PLoS One Research Article Research in Artificial Intelligence (AI) has focused mostly on two extremes: either on small improvements in narrow AI domains, or on universal theoretical frameworks which are often uncomputable, or lack practical implementations. In this paper we attempt to follow a big picture view while also providing a particular theory and its implementation to present a novel, purposely simple, and interpretable hierarchical architecture. This architecture incorporates the unsupervised learning of a model of the environment, learning the influence of one’s own actions, model-based reinforcement learning, hierarchical planning, and symbolic/sub-symbolic integration in general. The learned model is stored in the form of hierarchical representations which are increasingly more abstract, but can retain details when needed. We demonstrate the universality of the architecture by testing it on a series of diverse environments ranging from audio/visual compression to discrete and continuous action spaces, to learning disentangled representations. Public Library of Science 2020-05-18 /pmc/articles/PMC7233548/ /pubmed/32421693 http://dx.doi.org/10.1371/journal.pone.0230432 Text en © 2020 Vítků et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Vítků, Jaroslav Dluhoš, Petr Davidson, Joseph Nikl, Matěj Andersson, Simon Paška, Přemysl Šinkora, Jan Hlubuček, Petr Stránský, Martin Hyben, Martin Poliak, Martin Feyereisl, Jan Rosa, Marek ToyArchitecture: Unsupervised learning of interpretable models of the environment |
title | ToyArchitecture: Unsupervised learning of interpretable models of the environment |
title_full | ToyArchitecture: Unsupervised learning of interpretable models of the environment |
title_fullStr | ToyArchitecture: Unsupervised learning of interpretable models of the environment |
title_full_unstemmed | ToyArchitecture: Unsupervised learning of interpretable models of the environment |
title_short | ToyArchitecture: Unsupervised learning of interpretable models of the environment |
title_sort | toyarchitecture: unsupervised learning of interpretable models of the environment |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7233548/ https://www.ncbi.nlm.nih.gov/pubmed/32421693 http://dx.doi.org/10.1371/journal.pone.0230432 |
work_keys_str_mv | AT vitkujaroslav toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT dluhospetr toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT davidsonjoseph toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT niklmatej toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT anderssonsimon toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT paskapremysl toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT sinkorajan toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT hlubucekpetr toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT stranskymartin toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT hybenmartin toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT poliakmartin toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT feyereisljan toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment AT rosamarek toyarchitectureunsupervisedlearningofinterpretablemodelsoftheenvironment |