Arrow Research search
Back to AAMAS

AAMAS 2018

Eligibility Traces for Options

Conference Paper Session 25: Learning and Adaptation 3 Autonomous Agents and Multiagent Systems

Abstract

Temporally extended actions not only represent knowledge in the hierarchical setup in reinforcement learning, they also improve exploration while reducing the complexity of choosing actions. The option framework provides a concrete way to implement and reason about temporal abstraction. This work attempts to test the utility of eligibility traces with options and find good ways of doing multi-step intra-option updates. Three algorithms, based on offpolicy methods - importance sampling, tree-backup and retrace, are proposed for using eligibility traces with options.

Authors

Keywords

  • Temporal abstraction
  • Option framework
  • Off-policy methods
  • Intraoption updates
  • Eligibility traces

Context

Venue
International Conference on Autonomous Agents and Multiagent Systems
Archive span
2002-2025
Indexed papers
7403
Paper id
870146504036005726