Cargando…

A Unifying Framework for Reinforcement Learning and Planning

Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both...

Descripción completa

Detalles Bibliográficos
Autores principales: Moerland, Thomas M., Broekens, Joost, Plaat, Aske, Jonker, Catholijn M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9309375/
https://www.ncbi.nlm.nih.gov/pubmed/35898393
http://dx.doi.org/10.3389/frai.2022.908353
_version_ 1784753147298709504
author Moerland, Thomas M.
Broekens, Joost
Plaat, Aske
Jonker, Catholijn M.
author_facet Moerland, Thomas M.
Broekens, Joost
Plaat, Aske
Jonker, Catholijn M.
author_sort Moerland, Thomas M.
collection PubMed
description Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning.
format Online
Article
Text
id pubmed-9309375
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93093752022-07-26 A Unifying Framework for Reinforcement Learning and Planning Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. Front Artif Intell Artificial Intelligence Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning. Frontiers Media S.A. 2022-07-11 /pmc/articles/PMC9309375/ /pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353 Text en Copyright © 2022 Moerland, Broekens, Plaat and Jonker. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Moerland, Thomas M.
Broekens, Joost
Plaat, Aske
Jonker, Catholijn M.
A Unifying Framework for Reinforcement Learning and Planning
title A Unifying Framework for Reinforcement Learning and Planning
title_full A Unifying Framework for Reinforcement Learning and Planning
title_fullStr A Unifying Framework for Reinforcement Learning and Planning
title_full_unstemmed A Unifying Framework for Reinforcement Learning and Planning
title_short A Unifying Framework for Reinforcement Learning and Planning
title_sort unifying framework for reinforcement learning and planning
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9309375/
https://www.ncbi.nlm.nih.gov/pubmed/35898393
http://dx.doi.org/10.3389/frai.2022.908353
work_keys_str_mv AT moerlandthomasm aunifyingframeworkforreinforcementlearningandplanning
AT broekensjoost aunifyingframeworkforreinforcementlearningandplanning
AT plaataske aunifyingframeworkforreinforcementlearningandplanning
AT jonkercatholijnm aunifyingframeworkforreinforcementlearningandplanning
AT moerlandthomasm unifyingframeworkforreinforcementlearningandplanning
AT broekensjoost unifyingframeworkforreinforcementlearningandplanning
AT plaataske unifyingframeworkforreinforcementlearningandplanning
AT jonkercatholijnm unifyingframeworkforreinforcementlearningandplanning