Cargando…

Simple Strategies in Multi-Objective MDPs

We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (n...

Descripción completa

Detalles Bibliográficos
Autores principales:	Delgrange, Florent, Katoen, Joost-Pieter, Quatmann, Tim, Randour, Mickael
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19

_version_	1783573042981502976
author	Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael
author_facet	Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael
author_sort	Delgrange, Florent
collection	PubMed
description	We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case is treated by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms.
format	Online Article Text
id	pubmed-7439746
institution	National Center for Biotechnology Information
language	English
publishDate	2020
record_format	MEDLINE/PubMed
spelling	pubmed-74397462020-08-21 Simple Strategies in Multi-Objective MDPs Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael Tools and Algorithms for the Construction and Analysis of Systems Article We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case is treated by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms. 2020-03-13 /pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
spellingShingle	Article Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael Simple Strategies in Multi-Objective MDPs
title	Simple Strategies in Multi-Objective MDPs
title_full	Simple Strategies in Multi-Objective MDPs
title_fullStr	Simple Strategies in Multi-Objective MDPs
title_full_unstemmed	Simple Strategies in Multi-Objective MDPs
title_short	Simple Strategies in Multi-Objective MDPs
title_sort	simple strategies in multi-objective mdps
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19
work_keys_str_mv	AT delgrangeflorent simplestrategiesinmultiobjectivemdps AT katoenjoostpieter simplestrategiesinmultiobjectivemdps AT quatmanntim simplestrategiesinmultiobjectivemdps AT randourmickael simplestrategiesinmultiobjectivemdps

Simple Strategies in Multi-Objective MDPs

Ejemplares similares