Arrow Research search
Back to AAMAS

AAMAS 2022

A Simulation Based Online Planning Algorithm for Multi-Agent Cooperative Environments

Conference Paper Extended Abstracts Autonomous Agents and Multiagent Systems

Abstract

Multi-agent Markov Decision Process (MMDP) has been an effective way of modelling sequential decision making algorithms for multi-agent cooperative environments. However, challenges such as exponential size of action space and dynamic changes limit the efficacy of proposed solutions. This paper propose a scalable and robust algorithm that can effectively solve MMDPs in real time. Simulation, pruning, and prediction are the three key components of the algorithm. The simulation component enables real time solutions by using a novel iterative pruning technique which in turn makes use of the prediction component trained with self play data. The algorithm is self-sustained as it generates new training data from simulation and gradually becomes better. Furthermore, we show empirical results demonstrating the capabilities of the algorithm and compare them with existing MMDP solvers.

Authors

Keywords

  • Cooperation Learning
  • Multi-agent Markov Decision Process
  • Transfer Learning
  • Graph Convolutional Network

Context

Venue
International Conference on Autonomous Agents and Multiagent Systems
Archive span
2002-2025
Indexed papers
7403
Paper id
651189448553454125