Cargando…
A Unifying Framework for Reinforcement Learning and Planning
Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9309375/ https://www.ncbi.nlm.nih.gov/pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353 |
_version_ | 1784753147298709504 |
---|---|
author | Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. |
author_facet | Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. |
author_sort | Moerland, Thomas M. |
collection | PubMed |
description | Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning. |
format | Online Article Text |
id | pubmed-9309375 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93093752022-07-26 A Unifying Framework for Reinforcement Learning and Planning Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. Front Artif Intell Artificial Intelligence Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning. Frontiers Media S.A. 2022-07-11 /pmc/articles/PMC9309375/ /pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353 Text en Copyright © 2022 Moerland, Broekens, Plaat and Jonker. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. A Unifying Framework for Reinforcement Learning and Planning |
title | A Unifying Framework for Reinforcement Learning and Planning |
title_full | A Unifying Framework for Reinforcement Learning and Planning |
title_fullStr | A Unifying Framework for Reinforcement Learning and Planning |
title_full_unstemmed | A Unifying Framework for Reinforcement Learning and Planning |
title_short | A Unifying Framework for Reinforcement Learning and Planning |
title_sort | unifying framework for reinforcement learning and planning |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9309375/ https://www.ncbi.nlm.nih.gov/pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353 |
work_keys_str_mv | AT moerlandthomasm aunifyingframeworkforreinforcementlearningandplanning AT broekensjoost aunifyingframeworkforreinforcementlearningandplanning AT plaataske aunifyingframeworkforreinforcementlearningandplanning AT jonkercatholijnm aunifyingframeworkforreinforcementlearningandplanning AT moerlandthomasm unifyingframeworkforreinforcementlearningandplanning AT broekensjoost unifyingframeworkforreinforcementlearningandplanning AT plaataske unifyingframeworkforreinforcementlearningandplanning AT jonkercatholijnm unifyingframeworkforreinforcementlearningandplanning |