Corrector Sampling in Language Models

Itai Gat; Neta Shaul; Uriel Singer; Yaron Lipman

Back to NeurIPS

NeurIPS 2025

Corrector Sampling in Language Models

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

Autoregressive language models accumulate errors due to their fixed, irrevocable left-to-right token generation. To address this, we propose a new sampling method called Resample-Previous-Tokens (RPT). RPT mitigates error accumulation by iteratively revisiting and potentially replacing tokens in a window of previously generated text. Fine-tuning a pretrained 8B parameter model with RPT for only 100B resulted in ~10% relative improvements on reasoning and coding benchmarks compared to the standard sampling.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: Annual Conference on Neural Information Processing Systems
Archive span: 1987-2025
Indexed papers: 30776
Paper id: 553008611443197591