Cargando…

A Unifying Framework for Reinforcement Learning and Planning

Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both...

Descripción completa

Detalles Bibliográficos
Autores principales:	Moerland, Thomas M., Broekens, Joost, Plaat, Aske, Jonker, Catholijn M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2022
Materias:	Artificial Intelligence
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9309375/ https://www.ncbi.nlm.nih.gov/pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353

_version_	1784753147298709504
author	Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M.
author_facet	Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M.
author_sort	Moerland, Thomas M.
collection	PubMed
description	Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning.
format	Online Article Text
id	pubmed-9309375
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-93093752022-07-26 A Unifying Framework for Reinforcement Learning and Planning Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. Front Artif Intell Artificial Intelligence Sequential decision making, commonly formalized as optimization of a Markov Decision Process, is a key challenge in artificial intelligence. Two successful approaches to MDP optimization are reinforcement learning and planning, which both largely have their own research communities. However, if both research fields solve the same problem, then we might be able to disentangle the common factors in their solution approaches. Therefore, this paper presents a unifying algorithmic framework for reinforcement learning and planning (FRAP), which identifies underlying dimensions on which MDP planning and learning algorithms have to decide. At the end of the paper, we compare a variety of well-known planning, model-free and model-based RL algorithms along these dimensions. Altogether, the framework may help provide deeper insight in the algorithmic design space of planning and reinforcement learning. Frontiers Media S.A. 2022-07-11 /pmc/articles/PMC9309375/ /pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353 Text en Copyright © 2022 Moerland, Broekens, Plaat and Jonker. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Artificial Intelligence Moerland, Thomas M. Broekens, Joost Plaat, Aske Jonker, Catholijn M. A Unifying Framework for Reinforcement Learning and Planning
title	A Unifying Framework for Reinforcement Learning and Planning
title_full	A Unifying Framework for Reinforcement Learning and Planning
title_fullStr	A Unifying Framework for Reinforcement Learning and Planning
title_full_unstemmed	A Unifying Framework for Reinforcement Learning and Planning
title_short	A Unifying Framework for Reinforcement Learning and Planning
title_sort	unifying framework for reinforcement learning and planning
topic	Artificial Intelligence
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9309375/ https://www.ncbi.nlm.nih.gov/pubmed/35898393 http://dx.doi.org/10.3389/frai.2022.908353
work_keys_str_mv	AT moerlandthomasm aunifyingframeworkforreinforcementlearningandplanning AT broekensjoost aunifyingframeworkforreinforcementlearningandplanning AT plaataske aunifyingframeworkforreinforcementlearningandplanning AT jonkercatholijnm aunifyingframeworkforreinforcementlearningandplanning AT moerlandthomasm unifyingframeworkforreinforcementlearningandplanning AT broekensjoost unifyingframeworkforreinforcementlearningandplanning AT plaataske unifyingframeworkforreinforcementlearningandplanning AT jonkercatholijnm unifyingframeworkforreinforcementlearningandplanning

A Unifying Framework for Reinforcement Learning and Planning

Ejemplares similares