Cargando…
End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702337/ https://www.ncbi.nlm.nih.gov/pubmed/34956359 http://dx.doi.org/10.1155/2021/9945044 |
_version_ | 1784621224630943744 |
---|---|
author | Ruan, Xiaogang Li, Peng Zhu, Xiaoqing Yu, Hejie Yu, Naigong |
author_facet | Ruan, Xiaogang Li, Peng Zhu, Xiaoqing Yu, Hejie Yu, Naigong |
author_sort | Ruan, Xiaogang |
collection | PubMed |
description | Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy. |
format | Online Article Text |
id | pubmed-8702337 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-87023372021-12-24 End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation Ruan, Xiaogang Li, Peng Zhu, Xiaoqing Yu, Hejie Yu, Naigong Comput Intell Neurosci Research Article Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy. Hindawi 2021-12-16 /pmc/articles/PMC8702337/ /pubmed/34956359 http://dx.doi.org/10.1155/2021/9945044 Text en Copyright © 2021 Xiaogang Ruan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ruan, Xiaogang Li, Peng Zhu, Xiaoqing Yu, Hejie Yu, Naigong End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation |
title | End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation |
title_full | End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation |
title_fullStr | End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation |
title_full_unstemmed | End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation |
title_short | End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation |
title_sort | end-to-end autonomous exploration with deep reinforcement learning and intrinsic motivation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702337/ https://www.ncbi.nlm.nih.gov/pubmed/34956359 http://dx.doi.org/10.1155/2021/9945044 |
work_keys_str_mv | AT ruanxiaogang endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation AT lipeng endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation AT zhuxiaoqing endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation AT yuhejie endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation AT yunaigong endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation |