Online Multi-Task Learning for Policy Gradient Methods

Haitham Bou-Ammar; Eric Eaton; Paul Ruvolo; Matthew E. Taylor

Back to ICML

ICML 2014

Online Multi-Task Learning for Policy Gradient Methods

Conference Paper Cycle 2 Papers Artificial Intelligence · Machine Learning

Details

Abstract

Policy gradient algorithms have shown considerable recent success in solving high-dimensional sequential decision making tasks, particularly in robotics. However, these methods often require extensive experience in a domain to achieve high performance. To make agents more sample-efficient, we developed a multi-task policy gradient method to learn decision making tasks consecutively, transferring knowledge between tasks to accelerate learning. Our approach provides robust theoretical guarantees, and we show empirically that it dramatically accelerates learning on a variety of dynamical systems, including an application to quadrotor control.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: International Conference on Machine Learning
Archive span: 1993-2025
Indexed papers: 16471
Paper id: 165019778673159717