Cargando…

Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types

In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achieve...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dhakan, Paresh, Merrick, Kathryn, Rañó, Iñaki, Siddique, Nazmul
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2018
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6189580/ https://www.ncbi.nlm.nih.gov/pubmed/30356820 http://dx.doi.org/10.3389/fnbot.2018.00063

_version_	1783363391654461440
author	Dhakan, Paresh Merrick, Kathryn Rañó, Iñaki Siddique, Nazmul
author_facet	Dhakan, Paresh Merrick, Kathryn Rañó, Iñaki Siddique, Nazmul
author_sort	Dhakan, Paresh
collection	PubMed
description	In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors.
format	Online Article Text
id	pubmed-6189580
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-61895802018-10-23 Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types Dhakan, Paresh Merrick, Kathryn Rañó, Iñaki Siddique, Nazmul Front Neurorobot Neuroscience In reinforcement learning, reward is used to guide the learning process. The reward is often designed to be task-dependent, and it may require significant domain knowledge to design a good reward function. This paper proposes general reward functions for maintenance, approach, avoidance, and achievement goal types. These reward functions exploit the inherent property of each type of goal and are thus task-independent. We also propose metrics to measure an agent's performance for learning each type of goal. We evaluate the intrinsic reward functions in a framework that can autonomously generate goals and learn solutions to those goals using a standard reinforcement learning algorithm. We show empirically how the proposed reward functions lead to learning in a mobile robot application. Finally, using the proposed reward functions as building blocks, we demonstrate how compound reward functions, reward functions to generate sequences of tasks, can be created that allow the mobile robot to learn more complex behaviors. Frontiers Media S.A. 2018-10-09 /pmc/articles/PMC6189580/ /pubmed/30356820 http://dx.doi.org/10.3389/fnbot.2018.00063 Text en Copyright © 2018 Dhakan, Merrick, Rañó and Siddique. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Neuroscience Dhakan, Paresh Merrick, Kathryn Rañó, Iñaki Siddique, Nazmul Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types
title	Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types
title_full	Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types
title_fullStr	Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types
title_full_unstemmed	Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types
title_short	Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types
title_sort	intrinsic rewards for maintenance, approach, avoidance, and achievement goal types
topic	Neuroscience
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6189580/ https://www.ncbi.nlm.nih.gov/pubmed/30356820 http://dx.doi.org/10.3389/fnbot.2018.00063
work_keys_str_mv	AT dhakanparesh intrinsicrewardsformaintenanceapproachavoidanceandachievementgoaltypes AT merrickkathryn intrinsicrewardsformaintenanceapproachavoidanceandachievementgoaltypes AT ranoinaki intrinsicrewardsformaintenanceapproachavoidanceandachievementgoaltypes AT siddiquenazmul intrinsicrewardsformaintenanceapproachavoidanceandachievementgoaltypes

Intrinsic Rewards for Maintenance, Approach, Avoidance, and Achievement Goal Types

Ejemplares similares