Cargando…
Simple Strategies in Multi-Objective MDPs
We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (n...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19 |
_version_ | 1783573042981502976 |
---|---|
author | Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael |
author_facet | Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael |
author_sort | Delgrange, Florent |
collection | PubMed |
description | We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case is treated by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms. |
format | Online Article Text |
id | pubmed-7439746 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
record_format | MEDLINE/PubMed |
spelling | pubmed-74397462020-08-21 Simple Strategies in Multi-Objective MDPs Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael Tools and Algorithms for the Construction and Analysis of Systems Article We consider the verification of multiple expected reward objectives at once on Markov decision processes (MDPs). This enables a trade-off analysis among multiple objectives by obtaining a Pareto front. We focus on strategies that are easy to employ and implement. That is, strategies that are pure (no randomization) and have bounded memory. We show that checking whether a point is achievable by a pure stationary strategy is NP-complete, even for two objectives, and we provide an MILP encoding to solve the corresponding problem. The bounded memory case is treated by a product construction. Experimental results using Storm and Gurobi show the feasibility of our algorithms. 2020-03-13 /pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. |
spellingShingle | Article Delgrange, Florent Katoen, Joost-Pieter Quatmann, Tim Randour, Mickael Simple Strategies in Multi-Objective MDPs |
title | Simple Strategies in Multi-Objective MDPs |
title_full | Simple Strategies in Multi-Objective MDPs |
title_fullStr | Simple Strategies in Multi-Objective MDPs |
title_full_unstemmed | Simple Strategies in Multi-Objective MDPs |
title_short | Simple Strategies in Multi-Objective MDPs |
title_sort | simple strategies in multi-objective mdps |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7439746/ http://dx.doi.org/10.1007/978-3-030-45190-5_19 |
work_keys_str_mv | AT delgrangeflorent simplestrategiesinmultiobjectivemdps AT katoenjoostpieter simplestrategiesinmultiobjectivemdps AT quatmanntim simplestrategiesinmultiobjectivemdps AT randourmickael simplestrategiesinmultiobjectivemdps |