Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Luisa Zintgraf; Sam Devlin; Kamil Ciosek; Shimon Whiteson; Katja Hofmann

Back to AAMAS

AAMAS 2021

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Conference Paper Extended Abstracts Autonomous Agents and Multiagent Systems

PDF

Abstract

Agents that interact with other agents often do not know a priori what the other agents’ strategies are, but have to maximise their own online return while interacting with and learning about others. The optimal adaptive behaviour under uncertainty over the other agents’ strategies w. r. t. some prior can in principle be computed using the Interactive Bayesian Reinforcement Learning framework. Unfortunately, doing so is intractable in most settings, and existing approximation methods are restricted to small tasks. To overcome this, we propose to meta-learn (alongside the policy) approximate belief inference by combining sequential and hierarchical VAEs. We show empirically that our approach can learn a factorised belief model that separates the other agent’s permanent and temporal structure, and outperforms methods that sample from the approximate posterior or do not have this hierarchical structure. A full version of this work can be found in Zintgraf et al. [30].

Authors

Keywords

Interactive Bayesian RL
Meta Learning
Variational Methods

Context

Venue: International Conference on Autonomous Agents and Multiagent Systems
Archive span: 2002-2025
Indexed papers: 7403
Paper id: 400229957138164699