Cargando…

A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning

Reinforcement learning (RL) is one kind of interactive learning methods. Its main characteristics are “trial and error” and “related reward.” A hierarchical reinforcement learning method based on action subrewards is proposed to solve the problem of “curse of dimensionality,” which means that the st...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fu, Yuchen, Liu, Quan, Ling, Xionghong, Cui, Zhiming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2014
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3926376/ https://www.ncbi.nlm.nih.gov/pubmed/24600318 http://dx.doi.org/10.1155/2014/120760

_version_	1782303968009912320
author	Fu, Yuchen Liu, Quan Ling, Xionghong Cui, Zhiming
author_facet	Fu, Yuchen Liu, Quan Ling, Xionghong Cui, Zhiming
author_sort	Fu, Yuchen
collection	PubMed
description	Reinforcement learning (RL) is one kind of interactive learning methods. Its main characteristics are “trial and error” and “related reward.” A hierarchical reinforcement learning method based on action subrewards is proposed to solve the problem of “curse of dimensionality,” which means that the states space will grow exponentially in the number of features and low convergence speed. The method can reduce state spaces greatly and choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply it to the online learning in Tetris game, and the experiment result shows that the convergence speed of this algorithm can be enhanced evidently based on the new method which combines hierarchical reinforcement learning algorithm and action subrewards. The “curse of dimensionality” problem is also solved to a certain extent with hierarchical method. All the performance with different parameters is compared and analyzed as well.
format	Online Article Text
id	pubmed-3926376
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-39263762014-03-05 A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning Fu, Yuchen Liu, Quan Ling, Xionghong Cui, Zhiming ScientificWorldJournal Research Article Reinforcement learning (RL) is one kind of interactive learning methods. Its main characteristics are “trial and error” and “related reward.” A hierarchical reinforcement learning method based on action subrewards is proposed to solve the problem of “curse of dimensionality,” which means that the states space will grow exponentially in the number of features and low convergence speed. The method can reduce state spaces greatly and choose actions with favorable purpose and efficiency so as to optimize reward function and enhance convergence speed. Apply it to the online learning in Tetris game, and the experiment result shows that the convergence speed of this algorithm can be enhanced evidently based on the new method which combines hierarchical reinforcement learning algorithm and action subrewards. The “curse of dimensionality” problem is also solved to a certain extent with hierarchical method. All the performance with different parameters is compared and analyzed as well. Hindawi Publishing Corporation 2014-01-28 /pmc/articles/PMC3926376/ /pubmed/24600318 http://dx.doi.org/10.1155/2014/120760 Text en Copyright © 2014 Yuchen Fu et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Fu, Yuchen Liu, Quan Ling, Xionghong Cui, Zhiming A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
title	A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
title_full	A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
title_fullStr	A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
title_full_unstemmed	A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
title_short	A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning
title_sort	reward optimization method based on action subrewards in hierarchical reinforcement learning
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3926376/ https://www.ncbi.nlm.nih.gov/pubmed/24600318 http://dx.doi.org/10.1155/2014/120760
work_keys_str_mv	AT fuyuchen arewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT liuquan arewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT lingxionghong arewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT cuizhiming arewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT fuyuchen rewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT liuquan rewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT lingxionghong rewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning AT cuizhiming rewardoptimizationmethodbasedonactionsubrewardsinhierarchicalreinforcementlearning

A Reward Optimization Method Based on Action Subrewards in Hierarchical Reinforcement Learning

Ejemplares similares