Cargando…
Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules
This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on a...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6170616/ https://www.ncbi.nlm.nih.gov/pubmed/30319389 http://dx.doi.org/10.3389/fnbot.2018.00061 |
_version_ | 1783360684316164096 |
---|---|
author | Uchibe, Eiji |
author_facet | Uchibe, Eiji |
author_sort | Uchibe, Eiji |
collection | PubMed |
description | This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on an off-policy reinforcement learning algorithm and behavior cloning from samples collected by a behavior policy that is constructed by a combination of all the policies. Since the mixing weights are determined by the performance of the module, a better policy is automatically selected based on the learning progress. Experimental results on a benchmark control task show that CRAIL successfully achieves fast learning by allowing modules with complicated network structures to exploit task-relevant samples for training. |
format | Online Article Text |
id | pubmed-6170616 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-61706162018-10-12 Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules Uchibe, Eiji Front Neurorobot Robotics and AI This paper proposes Cooperative and competitive Reinforcement And Imitation Learning (CRAIL) for selecting an appropriate policy from a set of multiple heterogeneous modules and training all of them in parallel. Each learning module has its own network architecture and improves the policy based on an off-policy reinforcement learning algorithm and behavior cloning from samples collected by a behavior policy that is constructed by a combination of all the policies. Since the mixing weights are determined by the performance of the module, a better policy is automatically selected based on the learning progress. Experimental results on a benchmark control task show that CRAIL successfully achieves fast learning by allowing modules with complicated network structures to exploit task-relevant samples for training. Frontiers Media S.A. 2018-09-27 /pmc/articles/PMC6170616/ /pubmed/30319389 http://dx.doi.org/10.3389/fnbot.2018.00061 Text en Copyright © 2018 Uchibe. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Robotics and AI Uchibe, Eiji Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules |
title | Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules |
title_full | Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules |
title_fullStr | Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules |
title_full_unstemmed | Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules |
title_short | Cooperative and Competitive Reinforcement and Imitation Learning for a Mixture of Heterogeneous Learning Modules |
title_sort | cooperative and competitive reinforcement and imitation learning for a mixture of heterogeneous learning modules |
topic | Robotics and AI |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6170616/ https://www.ncbi.nlm.nih.gov/pubmed/30319389 http://dx.doi.org/10.3389/fnbot.2018.00061 |
work_keys_str_mv | AT uchibeeiji cooperativeandcompetitivereinforcementandimitationlearningforamixtureofheterogeneouslearningmodules |