Auto-Exploratory Average Reward Reinforcement Learning

DoKyeong Ok

Back to AAAI

AAAI 1996

Auto-Exploratory Average Reward Reinforcement Learning

Conference Paper Reinforcement Learning Artificial Intelligence

PDF Details

Abstract

We introduce a model-based average reward Reinforcement Learning method called H-learning and compare it with its discounted counterpart, Adaptive Real-Time Dynamic Programming, in a simulated robot scheduling task. We also introduce an extension to H-learning, which automatically explores the unexplored parts of the state space, while always choosing greedy actions with respect to the current value function. We show that this “Auto-exploratory H-learning” performs better than the original H-learning under previously studied exploration methods such as random, recency-based, or counter-based exploration.

Authors

DoKyeong Ok Prasad Tadepalli

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 593897426517572926