Value-Aware Loss Function for Model Learning in Reinforcement Learning

Amir-massoud Farahmand; Andre Barreto; Daniel Nikovski

Back to EWRL

EWRL 2016

Value-Aware Loss Function for Model Learning in Reinforcement Learning

Workshop Paper Accepted Paper Artificial Intelligence · Machine Learning · Reinforcement Learning

PDF Details

Abstract

We consider the problem of estimating the transition probability kernel to be used by a model-based reinforcement learning (RL) algorithm. We argue that estimating a generative model that minimizes a probabilistic loss, such as the log-loss, might be an overkill because such a probabilistic loss does not take into account the underlying structure of the decision problem and the RL algorithm that intends to solve it. We introduce a loss function that takes the structure of the value function into account. We provide a finite-sample upper bound for the loss function showing the dependence of the error on model approximation error and the number of samples.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: European Workshop on Reinforcement Learning
Archive span: 2008-2025
Indexed papers: 649
Paper id: 904536075545472684