Cargando…

Strategy improvement for concurrent reachability and turn-based stochastic safety games()()

We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chatterjee, Krishnendu, de Alfaro, Luca, Henzinger, Thomas A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier B.V 2013
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579905/ https://www.ncbi.nlm.nih.gov/pubmed/26516289 http://dx.doi.org/10.1016/j.jcss.2012.12.001

_version_	1782391338624352256
author	Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A.
author_facet	Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A.
author_sort	Chatterjee, Krishnendu
collection	PubMed
description	We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. First, we present a simple proof of the fact that in concurrent reachability games, for all [Formula: see text] , memoryless ε-optimal strategies exist. A memoryless strategy is independent of the history of plays, and an ε-optimal strategy achieves the objective with probability within ε of the value of the game. In contrast to previous proofs of this fact, our proof is more elementary and more combinatorial. Second, we present a strategy-improvement (a.k.a. policy-iteration) algorithm for concurrent games with reachability objectives. Finally, we present a strategy-improvement algorithm for turn-based stochastic games (where each player selects moves in turns) with safety objectives. Our algorithms yield sequences of player-1 strategies which ensure probabilities of winning that converge monotonically (from below) to the value of the game.
format	Online Article Text
id	pubmed-4579905
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	Elsevier B.V
record_format	MEDLINE/PubMed
spelling	pubmed-45799052015-10-27 Strategy improvement for concurrent reachability and turn-based stochastic safety games()() Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A. J Comput Syst Sci Article We consider concurrent games played on graphs. At every round of a game, each player simultaneously and independently selects a move; the moves jointly determine the transition to a successor state. Two basic objectives are the safety objective to stay forever in a given set of states, and its dual, the reachability objective to reach a given set of states. First, we present a simple proof of the fact that in concurrent reachability games, for all [Formula: see text] , memoryless ε-optimal strategies exist. A memoryless strategy is independent of the history of plays, and an ε-optimal strategy achieves the objective with probability within ε of the value of the game. In contrast to previous proofs of this fact, our proof is more elementary and more combinatorial. Second, we present a strategy-improvement (a.k.a. policy-iteration) algorithm for concurrent games with reachability objectives. Finally, we present a strategy-improvement algorithm for turn-based stochastic games (where each player selects moves in turns) with safety objectives. Our algorithms yield sequences of player-1 strategies which ensure probabilities of winning that converge monotonically (from below) to the value of the game. Elsevier B.V 2013-08 /pmc/articles/PMC4579905/ /pubmed/26516289 http://dx.doi.org/10.1016/j.jcss.2012.12.001 Text en © 2013 Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/3.0/This is an open access article under the CC BY NC ND license (https://creativecommons.org/licenses/by-nc-nd/3.0/).
spellingShingle	Article Chatterjee, Krishnendu de Alfaro, Luca Henzinger, Thomas A. Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
title	Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
title_full	Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
title_fullStr	Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
title_full_unstemmed	Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
title_short	Strategy improvement for concurrent reachability and turn-based stochastic safety games()()
title_sort	strategy improvement for concurrent reachability and turn-based stochastic safety games()()
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4579905/ https://www.ncbi.nlm.nih.gov/pubmed/26516289 http://dx.doi.org/10.1016/j.jcss.2012.12.001
work_keys_str_mv	AT chatterjeekrishnendu strategyimprovementforconcurrentreachabilityandturnbasedstochasticsafetygames AT dealfaroluca strategyimprovementforconcurrentreachabilityandturnbasedstochasticsafetygames AT henzingerthomasa strategyimprovementforconcurrentreachabilityandturnbasedstochasticsafetygames

Strategy improvement for concurrent reachability and turn-based stochastic safety games()()

Ejemplares similares