Continuous strategy replicator dynamics for multi-agent Q-learning

Aram Galstyan

doi:10.1007/s10458-011-9181-6

Back to JAAMAS

JAAMAS 2011

Continuous strategy replicator dynamics for multi-agent Q-learning

Journal Article OriginalPaper Artificial Intelligence · Multi-Agent Systems

Details DOI

Abstract

Abstract The problem of multi-agent learning and adaptation has attracted a great deal of attention in recent years. It has been suggested that the dynamics of multi agent learning can be studied using replicator equations from population biology. Most existing studies so far have been limited to discrete strategy spaces with a small number of available actions. In many cases, however, the choices available to agents are better characterized by continuous spectra. This paper suggests a generalization of the replicator framework that allows to study the adaptive dynamics of Q -learning agents with continuous strategy spaces. Instead of probability vectors, agents’ strategies are now characterized by probability measures over continuous variables. As a result, the ordinary differential equations for the discrete case are replaced by a system of coupled integral-differential replicator equations that describe the mutual evolution of individual agent strategies. We derive a set of functional equations describing the steady state of the replicator dynamics, examine their solutions for several two-player games, and confirm our analytical results using simulations.

Authors

Aram Galstyan USC Information Sciences Institute

Keywords

Multi-agent reinforcement learning
Replicator dynamics
Continuous strategies

Context

Venue: Autonomous Agents and Multi-Agent Systems
Archive span: 2005-2026
Indexed papers: 940
Paper id: 622388993986174030