Cargando…

End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation

Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic m...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruan, Xiaogang, Li, Peng, Zhu, Xiaoqing, Yu, Hejie, Yu, Naigong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702337/
https://www.ncbi.nlm.nih.gov/pubmed/34956359
http://dx.doi.org/10.1155/2021/9945044
_version_ 1784621224630943744
author Ruan, Xiaogang
Li, Peng
Zhu, Xiaoqing
Yu, Hejie
Yu, Naigong
author_facet Ruan, Xiaogang
Li, Peng
Zhu, Xiaoqing
Yu, Hejie
Yu, Naigong
author_sort Ruan, Xiaogang
collection PubMed
description Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy.
format Online
Article
Text
id pubmed-8702337
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-87023372021-12-24 End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation Ruan, Xiaogang Li, Peng Zhu, Xiaoqing Yu, Hejie Yu, Naigong Comput Intell Neurosci Research Article Developing artificial intelligence (AI) agents is challenging for efficient exploration in visually rich and complex environments. In this study, we formulate the exploration question as a reinforcement learning problem and rely on intrinsic motivation to guide exploration behavior. Such intrinsic motivation is driven by curiosity and is calculated based on episode memory. To distribute the intrinsic motivation, we use a count-based method and temporal distance to generate it synchronously. We tested our approach in 3D maze-like environments and validated its performance in exploration tasks through extensive experiments. The experimental results show that our agent can learn exploration ability from raw sensory input and accomplish autonomous exploration across different mazes. In addition, the learned policy is not biased by stochastic objects. We also analyze the effects of different training methods and driving forces on exploration policy. Hindawi 2021-12-16 /pmc/articles/PMC8702337/ /pubmed/34956359 http://dx.doi.org/10.1155/2021/9945044 Text en Copyright © 2021 Xiaogang Ruan et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ruan, Xiaogang
Li, Peng
Zhu, Xiaoqing
Yu, Hejie
Yu, Naigong
End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
title End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
title_full End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
title_fullStr End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
title_full_unstemmed End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
title_short End-to-End Autonomous Exploration with Deep Reinforcement Learning and Intrinsic Motivation
title_sort end-to-end autonomous exploration with deep reinforcement learning and intrinsic motivation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8702337/
https://www.ncbi.nlm.nih.gov/pubmed/34956359
http://dx.doi.org/10.1155/2021/9945044
work_keys_str_mv AT ruanxiaogang endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation
AT lipeng endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation
AT zhuxiaoqing endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation
AT yuhejie endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation
AT yunaigong endtoendautonomousexplorationwithdeepreinforcementlearningandintrinsicmotivation