Stable Dual Dynamic Programming

Tao Wang; Michael Bowling; Dale Schuurmans; Daniel Lizotte

Back to NeurIPS

NeurIPS 2007

Stable Dual Dynamic Programming

Conference Paper Artificial Intelligence · Machine Learning

PDF Details

Abstract

Recently, we have introduced a novel approach to dynamic programming and re- inforcement learning that is based on maintaining explicit representations of sta- tionary distributions instead of value functions. In this paper, we investigate the convergence properties of these dual algorithms both theoretically and empirically, and show how they can be scaled up by incorporating function approximation.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: Annual Conference on Neural Information Processing Systems
Archive span: 1987-2025
Indexed papers: 30776
Paper id: 502945964157063347