Cross-Validated Off-Policy Evaluation

Matej Cief; Branislav Kveton; Michal Kompan

doi:10.1609/aaai.v39i15.33765

Back to AAAI

AAAI 2025

Cross-Validated Off-Policy Evaluation

Conference Paper AAAI Technical Track on Machine Learning I Artificial Intelligence

PDF Details DOI

Abstract

We study estimator selection and hyper-parameter tuning in off-policy evaluation. Although cross-validation is the most popular method for model selection in supervised learning, off-policy evaluation relies mostly on theory, which provides only limited guidance to practitioners. We show how to use cross-validation for off-policy evaluation. This challenges a popular belief that cross-validation in off-policy evaluation is not feasible. We evaluate our method empirically and show that it addresses a variety of use cases.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 22542548474958213