Cargando…

Heuristic Q-learning based on experience replay for three-dimensional path planning of the unmanned aerial vehicle

In order to solve the problem that the existing reinforcement learning algorithm is difficult to converge due to the excessive state space of the three-dimensional path planning of the unmanned aerial vehicle, this article proposes a reinforcement learning algorithm based on the heuristic function a...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Ronglei, Meng, Zhijun, Zhou, Yaoming, Ma, Yunpeng, Wu, Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10453672/
https://www.ncbi.nlm.nih.gov/pubmed/31829875
http://dx.doi.org/10.1177/0036850419879024