MPI-INF Logo
Campus Event Calendar

Event Entry

New for: D3

What and Who

Stochastic games with perfect information: the existence of canonical form

Khaled Elbassioni
Max-Planck-Institut für Informatik - D1
AG1 Mittagsseminar (own work)
AG 1, AG 4, RG1, MMCI, AG 3, AG 5, SWS  
AG Audience
English

Date, Time and Location

Tuesday, 24 November 2009
13:15
30 Minutes
E1 4
024
Saarbrücken

Abstract

We consider the well-known class of two-player, zero-sum, turn-based, stochastic games with perfect information introduced in the classical work of Gillette (1957). In such a game, there are two players White or maximizer and Black or minimizer, which control a finite set of states $V$, partitioned into two sets, each controlled by one of the players. At each state $u\in V$, the controlling player chooses one of a finite set of possible actions, according to which a real-valued reward is paid by Black to White, and then a new state is reached with a certain probability which depends on the action. The play continues forever, and White's (Black's) objective is to maximize (respectively, to minimize) her limiting average payoff.


The fact that a saddle point exists in pure positional strategies was proved by Gillette in 1957 and Liggett and Lippman in 1969
by considering the discounted version, in which the payoff of White is discounted by a factor $\gb^i$ at step $i$ and then proceeding to the limit as the discount factor $\gb\in[0,1)$ goes to $1$.

This class of games lies in the complexity class NP$\cap$CoNP and contains the well-known cyclic games (when $V_R$ is empty) and
Markov decision processes (when $V_B$ or $V_W$ is empty).

We show that every such game can be reduced by a potential transformation to a canonical form in which locally optimal strategies are globally optimal, and hence the value for every initial position and the optimal strategies of both players are obvious.

In the discounted case the transformed game values are also ergodic (i.e. do not depend on the initial position) and the corresponding potentials can be found in polynomial time.

Yet, this time tends to infinity, as $\gb \rightarrow 1^-$.


This is joint work with Endre Boros, Vladimir Gurvich and Kazuhisa Makino.

Contact

Khaled Elbassioni
--email hidden
passcode not visible
logged in users only

Khaled Elbassioni, 11/23/2009 19:39
Khaled Elbassioni, 10/19/2009 10:15
Khaled Elbassioni, 10/17/2009 14:05 -- Created document.