Cargando…

Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play

Many real-world decision-making problems involve multiple conflicting objectives that can not be optimized simultaneously without a compromise. Such problems are known as multi-objective Markov decision processes and they constitute a significant challenge for conventional single-objective reinforce...

Descripción completa

Detalles Bibliográficos
Autores principales: Abdelfattah, Sherif, Kasmarik, Kathryn, Hu, Jiankun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6189603/
https://www.ncbi.nlm.nih.gov/pubmed/30356836
http://dx.doi.org/10.3389/fnbot.2018.00065