Cargando…

Transformer Decoder-Based Enhanced Exploration Method to Alleviate Initial Exploration Problems in Reinforcement Learning

In reinforcement learning, the epsilon (ε)-greedy strategy is commonly employed as an exploration technique This method, however, leads to extensive initial exploration and prolonged learning periods. Existing approaches to mitigate this issue involve constraining the exploration range using expert...

Descripción completa

Detalles Bibliográficos
Autores principales: Kyoung, Dohyun, Sung, Yunsick
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10490608/
https://www.ncbi.nlm.nih.gov/pubmed/37687867
http://dx.doi.org/10.3390/s23177411