Cargando…

Simple Strategies in Multi-Objective MDPs

We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (n...

Descripción completa

Detalles Bibliográficos
Autores principales: Delgrange, Florent, Katoen, Joost-Pieter, Quatmann, Tim, Randour, Mickael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7439746/
http://dx.doi.org/10.1007/978-3-030-45190-5_19
_version_ 1783573042981502976
author Delgrange, Florent
Katoen, Joost-Pieter
Quatmann, Tim
Randour, Mickael
author_facet Delgrange, Florent
Katoen, Joost-Pieter
Quatmann, Tim
Randour, Mickael
author_sort Delgrange, Florent
collection PubMed
description We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case is treated by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms.
format Online
Article
Text
id pubmed-7439746
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-74397462020-08-21 Simple Strategies in Multi-Objective MDPs Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael Tools and Algorithms for the Construction and Analysis of Systems Article We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case is treated by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms. 2020-03-13 /pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
spellingShingle Article
Delgrange, Florent
Katoen, Joost-Pieter
Quatmann, Tim
Randour, Mickael
Simple Strategies in Multi-Objective MDPs
title Simple Strategies in Multi-Objective MDPs
title_full Simple Strategies in Multi-Objective MDPs
title_fullStr Simple Strategies in Multi-Objective MDPs
title_full_unstemmed Simple Strategies in Multi-Objective MDPs
title_short Simple Strategies in Multi-Objective MDPs
title_sort simple strategies in multi-objective mdps
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7439746/
http://dx.doi.org/10.1007/978-3-030-45190-5_19
work_keys_str_mv AT delgrangeflorent simplestrategiesinmultiobjectivemdps
AT katoenjoostpieter simplestrategiesinmultiobjectivemdps
AT quatmanntim simplestrategiesinmultiobjectivemdps
AT randourmickael simplestrategiesinmultiobjectivemdps