Exploration Potential

Jan Leike

Back to EWRL

EWRL 2016

Exploration Potential

Workshop Paper Accepted Paper Artificial Intelligence · Machine Learning · Reinforcement Learning

PDF Details

Abstract

We introduce exploration potential, a quantity that measures how much a reinforcement learning agent has explored its environment class. In contrast to information gain, exploration potential takes the problem’s reward structure into account. This leads to an exploration criterion that is both necessary and sufficient for asymptotic optimality (learning to act optimally across the entire environment class). Our experiments in multi-armed bandits use exploration potential to illustrate how different algorithms make the tradeoff between exploration and exploitation.

Authors

Jan Leike

Keywords

No keywords are indexed for this paper.

Context

Venue: European Workshop on Reinforcement Learning
Archive span: 2008-2025
Indexed papers: 649
Paper id: 58879771857147702