Cargando…

Constructing Temporally Extended Actions through Incremental Community Detection

Hierarchical reinforcement learning works on temporally extended actions or skills to facilitate learning. How to automatically form such abstraction is challenging, and many efforts tackle this issue in the options framework. While various approaches exist to construct options from different perspe...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xu, Xiao, Yang, Mei, Li, Ge
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi 2018
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5937602/ https://www.ncbi.nlm.nih.gov/pubmed/29849543 http://dx.doi.org/10.1155/2018/2085721

_version_	1783320655223062528
author	Xu, Xiao Yang, Mei Li, Ge
author_facet	Xu, Xiao Yang, Mei Li, Ge
author_sort	Xu, Xiao
collection	PubMed
description	Hierarchical reinforcement learning works on temporally extended actions or skills to facilitate learning. How to automatically form such abstraction is challenging, and many efforts tackle this issue in the options framework. While various approaches exist to construct options from different perspectives, few of them concentrate on options' adaptability during learning. This paper presents an algorithm to create options and enhance their quality online. Both aspects operate on detected communities of the learning environment's state transition graph. We first construct options from initial samples as the basis of online learning. Then a rule-based community revision algorithm is proposed to update graph partitions, based on which existing options can be continuously tuned. Experimental results in two problems indicate that options from initial samples may perform poorly in more complex environments, and our presented strategy can effectively improve options and get better results compared with flat reinforcement learning.
format	Online Article Text
id	pubmed-5937602
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Hindawi
record_format	MEDLINE/PubMed
spelling	pubmed-59376022018-05-30 Constructing Temporally Extended Actions through Incremental Community Detection Xu, Xiao Yang, Mei Li, Ge Comput Intell Neurosci Research Article Hierarchical reinforcement learning works on temporally extended actions or skills to facilitate learning. How to automatically form such abstraction is challenging, and many efforts tackle this issue in the options framework. While various approaches exist to construct options from different perspectives, few of them concentrate on options' adaptability during learning. This paper presents an algorithm to create options and enhance their quality online. Both aspects operate on detected communities of the learning environment's state transition graph. We first construct options from initial samples as the basis of online learning. Then a rule-based community revision algorithm is proposed to update graph partitions, based on which existing options can be continuously tuned. Experimental results in two problems indicate that options from initial samples may perform poorly in more complex environments, and our presented strategy can effectively improve options and get better results compared with flat reinforcement learning. Hindawi 2018-04-23 /pmc/articles/PMC5937602/ /pubmed/29849543 http://dx.doi.org/10.1155/2018/2085721 Text en Copyright © 2018 Xiao Xu et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Article Xu, Xiao Yang, Mei Li, Ge Constructing Temporally Extended Actions through Incremental Community Detection
title	Constructing Temporally Extended Actions through Incremental Community Detection
title_full	Constructing Temporally Extended Actions through Incremental Community Detection
title_fullStr	Constructing Temporally Extended Actions through Incremental Community Detection
title_full_unstemmed	Constructing Temporally Extended Actions through Incremental Community Detection
title_short	Constructing Temporally Extended Actions through Incremental Community Detection
title_sort	constructing temporally extended actions through incremental community detection
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5937602/ https://www.ncbi.nlm.nih.gov/pubmed/29849543 http://dx.doi.org/10.1155/2018/2085721
work_keys_str_mv	AT xuxiao constructingtemporallyextendedactionsthroughincrementalcommunitydetection AT yangmei constructingtemporallyextendedactionsthroughincrementalcommunitydetection AT lige constructingtemporallyextendedactionsthroughincrementalcommunitydetection

Constructing Temporally Extended Actions through Incremental Community Detection

Ejemplares similares