Combining dynamic reward shaping and action shaping for coordinating multi-agent learning

  • Xiangbin Zhu
  • , Chongjie Zhang
  • , Victor Lesser

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Coordinating multi-agent reinforcement learning provides a promising approach to scaling learning in large cooperative multi-agent systems. It allows agents to learn local decision policies based on their local observations and rewards, and, meanwhile, coordinates agents' learning processes to ensure the global learning performance. One key question is that how coordination mechanisms impact learning algorithms so that agents' learning processes are guided and coordinated. This paper presents a new shaping approach that effectively integrates coordination mechanisms into local learning processes. This shaping approach uses two-level agent organization structures and combines reward shaping and action shaping. The higher-level agents dynamically and periodically produce the shaping heuristic knowledge based on the learning status of the lower-level agents. The lower-level agents then uses this knowledge to coordinate their local learning processes with other agents. Experimental results show our approach effectively speeds up the convergence of multi-agent learning in large systems.

Original languageEnglish
Title of host publicationProceedings - 2013 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013
PublisherIEEE Computer Society
Pages321-328
Number of pages8
ISBN (Print)9781479929023
DOIs
StatePublished - 2013
Event2013 12th IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013 - Atlanta, GA, United States
Duration: Nov 17 2013Nov 20 2013

Publication series

NameProceedings - 2013 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013
Volume2

Conference

Conference2013 12th IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013
Country/TerritoryUnited States
CityAtlanta, GA
Period11/17/1311/20/13

Keywords

  • Action shaping
  • Multi-agent learning
  • Organization control
  • Reward shaping
  • Supervision

Fingerprint

Dive into the research topics of 'Combining dynamic reward shaping and action shaping for coordinating multi-agent learning'. Together they form a unique fingerprint.

Cite this