A Simulation Based Online Planning Algorithm for Multi-Agent Cooperative Environments

Rafid Ameer Mahmud; Fahim Faisal; Saaduddin Mahmud; Md. Mosaddek Khan

Back to AAMAS

AAMAS 2022

A Simulation Based Online Planning Algorithm for Multi-Agent Cooperative Environments

Conference Paper Extended Abstracts Autonomous Agents and Multiagent Systems

PDF

Abstract

Multi-agent Markov Decision Process (MMDP) has been an effective way of modelling sequential decision making algorithms for multi-agent cooperative environments. However, challenges such as exponential size of action space and dynamic changes limit the efficacy of proposed solutions. This paper propose a scalable and robust algorithm that can effectively solve MMDPs in real time. Simulation, pruning, and prediction are the three key components of the algorithm. The simulation component enables real time solutions by using a novel iterative pruning technique which in turn makes use of the prediction component trained with self play data. The algorithm is self-sustained as it generates new training data from simulation and gradually becomes better. Furthermore, we show empirical results demonstrating the capabilities of the algorithm and compare them with existing MMDP solvers.

Authors

Keywords

Cooperation Learning
Multi-agent Markov Decision Process
Transfer Learning
Graph Convolutional Network

Context

Venue: International Conference on Autonomous Agents and Multiagent Systems
Archive span: 2002-2025
Indexed papers: 7403
Paper id: 651189448553454125