Cross-validation Confidence Intervals for Test Error

Pierre Bayle; Alexandre Bayle; Lucas Janson; Lester Mackey

Back to NeurIPS

NeurIPS 2020

Cross-validation Confidence Intervals for Test Error

Conference Paper Artificial Intelligence · Machine Learning

PDF Details

Abstract

This work develops central limit theorems for cross-validation and consistent estimators of the asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact confidence intervals for k-fold test error and valid, powerful hypothesis tests of whether one learning algorithm has smaller k-fold test error than another. These results are also the first of their kind for the popular choice of leave-one-out cross-validation. In our experiments with diverse learning algorithms, the resulting intervals and tests outperform the most popular alternative methods from the literature.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: Annual Conference on Neural Information Processing Systems
Archive span: 1987-2025
Indexed papers: 30776
Paper id: 686527571191919225