Cargando…

Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules

This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on a...

Descripción completa

Detalles Bibliográficos
Autor principal: Uchibe, Eiji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6170616/
https://www.ncbi.nlm.nih.gov/pubmed/30319389
http://dx.doi.org/10.3389/fnbot.2018.00061
_version_ 1783360684316164096
author Uchibe, Eiji
author_facet Uchibe, Eiji
author_sort Uchibe, Eiji
collection PubMed
description This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on an off-policy reinforcement learning algorithm and behavior cloning from samples collected by a behavior policy that is constructed by a combination of all the policies. Since the mixing weights are determined by the performance of the module, a better policy is automatically selected based on the learning progress. Experimental results on a benchmark control task show that CRAIL successfully achieves fast learning by allowing modules with complicated network structures to exploit task-relevant samples for training.
format Online
Article
Text
id pubmed-6170616
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-61706162018-10-12 Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules Uchibe, Eiji Front Neurorobot Robotics and AI This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on an off-policy reinforcement learning algorithm and behavior cloning from samples collected by a behavior policy that is constructed by a combination of all the policies. Since the mixing weights are determined by the performance of the module, a better policy is automatically selected based on the learning progress. Experimental results on a benchmark control task show that CRAIL successfully achieves fast learning by allowing modules with complicated network structures to exploit task-relevant samples for training. Frontiers Media S.A. 2018-09-27 /pmc/articles/PMC6170616/ /pubmed/30319389 http://dx.doi.org/10.3389/fnbot.2018.00061 Text en Copyright © 2018 Uchibe. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Robotics and AI
Uchibe, Eiji
Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
title Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
title_full Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
title_fullStr Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
title_full_unstemmed Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
title_short Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
title_sort cooperative and competitive reinforcement and imitation learning for a mixture of heterogeneous learning modules
topic Robotics and AI
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6170616/
https://www.ncbi.nlm.nih.gov/pubmed/30319389
http://dx.doi.org/10.3389/fnbot.2018.00061
work_keys_str_mv AT uchibeeiji cooperativeandcompetitivereinforcementandimitationlearningforamixtureofheterogeneouslearningmodules