TY - GEN
T1 - Multi-Agent Learning with Policy Prediction
AU - Zhang, Chongjie
AU - Lesser, Victor
N1 - Publisher Copyright:
© 2010, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
PY - 2010/7/15
Y1 - 2010/7/15
N2 - Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting the basic gradient ascent approach with policy prediction. We prove that this augmentation results in a stronger notion of convergence than the basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this augmentation, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms.
AB - Due to the non-stationary environment, learning in multi-agent systems is a challenging problem. This paper first introduces a new gradient-based learning algorithm, augmenting the basic gradient ascent approach with policy prediction. We prove that this augmentation results in a stronger notion of convergence than the basic gradient ascent, that is, strategies converge to a Nash equilibrium within a restricted class of iterated games. Motivated by this augmentation, we then propose a new practical multi-agent reinforcement learning (MARL) algorithm exploiting approximate policy prediction. Empirical results show that it converges faster and in a wider variety of situations than state-of-the-art MARL algorithms.
UR - https://www.scopus.com/pages/publications/85099723578
U2 - 10.1609/aaai.v24i1.7639
DO - 10.1609/aaai.v24i1.7639
M3 - Conference contribution
AN - SCOPUS:85099723578
T3 - Proceedings of the 24th AAAI Conference on Artificial Intelligence, AAAI 2010
SP - 927
EP - 934
BT - Proceedings of the 24th AAAI Conference on Artificial Intelligence, AAAI 2010
PB - AAAI press
T2 - 24th AAAI Conference on Artificial Intelligence, AAAI 2010
Y2 - 11 July 2010 through 15 July 2010
ER -