Cargando…
Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual,...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier B.V
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579905/ https://www.ncbi.nlm.nih.gov/pubmed/26516289 http://dx.doi.org/10.1016/j.jcss.2012.12.001 |
_version_ | 1782391338624352256 |
---|---|
author | Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A. |
author_facet | Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A. |
author_sort | Chatterjee, Krishnendu |
collection | PubMed |
description | We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. First, we present a simple proof of the fact that in concurrent reachability games, for all [Formula: see text] , memoryless ε-optimal strategies exist. A memoryless strategy is independent of the history of plays, and an ε-optimal strategy achieves the objective with probability within ε of the value of the game. In contrast to previous proofs of this fact, our proof is more elementary and more combinatorial. Second, we present a strategy-improvement (a.k.a. policy-iteration) algorithm for concurrent games with reachability objectives. Finally, we present a strategy-improvement algorithm for turn-based stochastic games (where each player selects moves in turns) with safety objectives. Our algorithms yield sequences of player-1 strategies which ensure probabilities of winning that converge monotonically (from below) to the value of the game. |
format | Online Article Text |
id | pubmed-4579905 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Elsevier B.V |
record_format | MEDLINE/PubMed |
spelling | pubmed-45799052015-10-27 Strategy improvement for concurrent reachability and turn-based stochastic safety games()() Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A. J Comput Syst Sci Article We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. First, we present a simple proof of the fact that in concurrent reachability games, for all [Formula: see text] , memoryless ε-optimal strategies exist. A memoryless strategy is independent of the history of plays, and an ε-optimal strategy achieves the objective with probability within ε of the value of the game. In contrast to previous proofs of this fact, our proof is more elementary and more combinatorial. Second, we present a strategy-improvement (a.k.a. policy-iteration) algorithm for concurrent games with reachability objectives. Finally, we present a strategy-improvement algorithm for turn-based stochastic games (where each player selects moves in turns) with safety objectives. Our algorithms yield sequences of player-1 strategies which ensure probabilities of winning that converge monotonically (from below) to the value of the game. Elsevier B.V 2013-08 /pmc/articles/PMC4579905/ /pubmed/26516289 http://dx.doi.org/10.1016/j.jcss.2012.12.001 Text en © 2013 Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/3.0/This is an open access article under the CC BY NC ND license (https://creativecommons.org/licenses/by-nc-nd/3.0/). |
spellingShingle | Article Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A. Strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
title | Strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
title_full | Strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
title_fullStr | Strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
title_full_unstemmed | Strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
title_short | Strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
title_sort | strategy improvement for concurrent reachability and turn-based stochastic safety games()() |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579905/ https://www.ncbi.nlm.nih.gov/pubmed/26516289 http://dx.doi.org/10.1016/j.jcss.2012.12.001 |
work_keys_str_mv | AT chatterjeekrishnendu strategyimprovementforconcurrentreachabilityandturnbasedstochasticsafetygames AT dealfaroluca strategyimprovementforconcurrentreachabilityandturnbasedstochasticsafetygames AT henzingerthomasa strategyimprovementforconcurrentreachabilityandturnbasedstochasticsafetygames |