TY - GEN
T1 - Stochastic search methods for Nash equilibrium approximation in simulation-based games
AU - Vorobeychik, Yevgeniy
AU - Wellman, Michael P.
PY - 2008
Y1 - 2008
N2 - We define the class of games called simulation-based games, in which the payoffs are available as an output of an oracle (simulator), rather than specified analytically or using a payoff matrix. We then describe a convergent algorithm based on a hierarchical application of simulated annealing for estimating Nash equilibria in simulation-based games with finite-dimensional strategy sets. Additionally, we present alternative algorithms for best response and Nash equilibrium estimation, with a particular focus on one-shot infinite games of incomplete information. Our experimental results demonstrate that all the approaches we introduce are efficacious, albeit some more so than others. We show, for example, that while iterative best response dynamics has relatively weak convergence guarantees, it outperforms our convergent method experimentally. Additionally, we provide considerable evidence that a method based on random search outperforms gradient descent in our setting.
AB - We define the class of games called simulation-based games, in which the payoffs are available as an output of an oracle (simulator), rather than specified analytically or using a payoff matrix. We then describe a convergent algorithm based on a hierarchical application of simulated annealing for estimating Nash equilibria in simulation-based games with finite-dimensional strategy sets. Additionally, we present alternative algorithms for best response and Nash equilibrium estimation, with a particular focus on one-shot infinite games of incomplete information. Our experimental results demonstrate that all the approaches we introduce are efficacious, albeit some more so than others. We show, for example, that while iterative best response dynamics has relatively weak convergence guarantees, it outperforms our convergent method experimentally. Additionally, we provide considerable evidence that a method based on random search outperforms gradient descent in our setting.
KW - Approximate equilibria
KW - Empirical game
KW - Heuristic search
UR - https://www.scopus.com/pages/publications/84899992081
M3 - Conference contribution
AN - SCOPUS:84899992081
SN - 9781605604701
T3 - Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
SP - 1037
EP - 1044
BT - 7th International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2008
PB - International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS)
T2 - 7th International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS 2008
Y2 - 12 May 2008 through 16 May 2008
ER -