A Bayesian Fast-Slow Framework to Mitigate Interference in Non-Stationary Reinforcement Learning

Yihuan Mao; Chongjie Zhang

Back to NeurIPS

NeurIPS 2025

A Bayesian Fast-Slow Framework to Mitigate Interference in Non-Stationary Reinforcement Learning

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

Given the ever-changing nature of the world and its inhabitants, agents must possess the ability to adapt and evolve over time. Recent research in Given the ever-changing nature of the world and its inhabitants, agents must possess the ability to adapt and evolve over time. Recent research in non-stationary MDPs has focused on addressing this challenge, providing algorithms inspired by task inference techniques. However, these methods ignore the detrimental effects of interference, which particularly harm performance in contradictory tasks, leading to low efficiency in some environments. To address this issue, we propose a Bayesian Fast-Slow Framework (BFSF) that tackles both cross-task generalization and resistance to cross-task interference. Our framework consists of two components: a 'fast' policy, learned from recent data, and a 'slow' policy, learned through meta-reinforcement learning (meta-RL) using data from all previous tasks. A Bayesian estimation mechanism determines the current choice of 'fast' or 'slow' policy, balancing exploration and exploitation. Additionally, in the 'fast' policy, we introduce a dual-reset mechanism and a data relabeling technique to further accelerate convergence when encountering new tasks. Experiments demonstrate that our algorithm effectively mitigates interference and outperforms baseline approaches.

A Bayesian Fast-Slow Framework to Mitigate Interference in Non-Stationary Reinforcement Learning

Abstract

Authors

Keywords

Context