Cargando…

Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play

Many real-world decision-making problems involve multiple conflicting objectives that can not be optimized simultaneously without a compromise. Such problems are known as multi-objective Markov decision processes and they constitute a significant challenge for conventional single-objective reinforce...

Descripción completa

Detalles Bibliográficos
Autores principales:	Abdelfattah, Sherif, Kasmarik, Kathryn, Hu, Jiankun
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2018
Materias:	Neuroscience
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6189603/ https://www.ncbi.nlm.nih.gov/pubmed/30356836 http://dx.doi.org/10.3389/fnbot.2018.00065

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6189603/
https://www.ncbi.nlm.nih.gov/pubmed/30356836
http://dx.doi.org/10.3389/fnbot.2018.00065

Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play

Internet

Ejemplares similares