Cargando…

A hierarchical reinforcement learning method for missile evasion and guidance

This paper proposes an algorithm for missile manoeuvring based on a hierarchical proximal policy optimization (PPO) reinforcement learning algorithm, which enables a missile to guide to a target and evade an interceptor at the same time. Based on the idea of task hierarchy, the agent has a two-layer...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Mengda, Yang, Rennong, Zhang, Ying, Yue, Longfei, Hu, Dongyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9640633/
https://www.ncbi.nlm.nih.gov/pubmed/36344598
http://dx.doi.org/10.1038/s41598-022-21756-6