Cargando…

An Improved Distributed Sampling PPO Algorithm Based on Beta Policy for Continuous Global Path Planning Scheme

Traditional path planning is mainly utilized for path planning in discrete action space, which results in incomplete ship navigation power propulsion strategies during the path search process. Moreover, reinforcement learning experiences low success rates due to its unbalanced sample collection and...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Qianhao, Jiang, Li, Wang, Manman, Zhang, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346433/
https://www.ncbi.nlm.nih.gov/pubmed/37447949
http://dx.doi.org/10.3390/s23136101