Sequential Cooperative Multi-Agent Reinforcement Learning

Yifan Zang; Jinmin He; Kai Li; Haobo Fu; Qiang Fu; Junliang Xing

Back to AAMAS

AAMAS 2023

Sequential Cooperative Multi-Agent Reinforcement Learning

Conference Paper Session 2A: Multiagent Reinforcement Learning II Autonomous Agents and Multiagent Systems

PDF

Abstract

Cooperative multi-agent reinforcement learning (MARL) aims to coordinate the actions of multiple agents via a shared team reward. The complex interactions among agents make this problem extremely difficult. The mainstream of MARL methods often implicitly learn an inexplicable value decomposition from the shared reward into individual utilities, failing to give insights into how well each agent acts and lacking direct policy optimization guidance. This paper presents a sequential MARL framework that factorizes and simplifies the complex interaction analysis into a sequential evaluation process for more effective and efficient learning. We explicitly formulate this factorization via a novel sequential advantage function to evaluate each agent’s actions, which achieves an explicable credit assignment and substantially facilitates policy optimization. We realize the sequential credit assignment (SeCA) by dynamically adjusting the sequence in light of agents’ contributions to the team. Extensive experimental validations on a challenging set of StarCraft II micromanagement tasks verify SeCA’s effectiveness.

Authors

Keywords

Cooperative Multi-Agent Reinforcement Learning
Sequential Credit
Assignment
Sequential Evaluation

Context

Venue: International Conference on Autonomous Agents and Multiagent Systems
Archive span: 2002-2025
Indexed papers: 7403
Paper id: 989331874391549584