ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search

Shangtong Zhang; Hengshuai Yao

Back to AAAI

AAAI 2019

ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search

Conference Paper AAAI Technical Track: Machine Learning Artificial Intelligence

PDF Details

Abstract

In this paper, we propose an actor ensemble algorithm, named ACE, for continuous control with a deterministic policy in reinforcement learning. In ACE, we use actor ensemble (i. e. , multiple actors) to search the global maxima of the critic. Besides the ensemble perspective, we also formulate ACE in the option framework by extending the option-critic architecture with deterministic intra-option policies, revealing a relationship between ensemble and options. Furthermore, we perform a look-ahead tree search with those actors and a learned value prediction model, resulting in a refined value estimation. We demonstrate a significant performance boost of ACE over DDPG and its variants in challenging physical robot simulators.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 534024186110074211