Beyond Value Functions: Single-Loop Bilevel Optimization under Flatness Conditions

Liuyuan Jiang; Quan Xiao; Lisha Chen; Tianyi Chen

Back to NeurIPS

NeurIPS 2025

Beyond Value Functions: Single-Loop Bilevel Optimization under Flatness Conditions

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

Bilevel optimization, a hierarchical optimization paradigm, has gained significant attention in a wide range of practical applications, notably in the fine-tuning of generative models. However, due to the nested problem structure, most existing algorithms require either the Hessian vector calculation or the nested loop updates, which are computationally inefficient in large language model (LLM) fine-tuning. In this paper, building upon the fully first-order penalty-based approach, we propose an efficient value function-free (\textsf{PBGD-Free}) algorithm that eliminates the loop of solving the lower-level problem and admits fully single-loop updates. Inspired by the landscape analysis of representation learning-based LLM fine-tuning problem, we propose a relaxed flatness condition for the upper-level function and prove the convergence of the proposed value-function-free algorithm. We test the performance of the proposed algorithm in various applications and demonstrate its superior computational efficiency over the state-of-the-art bilevel methods.

Beyond Value Functions: Single-Loop Bilevel Optimization under Flatness Conditions

Abstract

Authors

Keywords

Context