DISC: Dynamic Decomposition Improves LLM Inference Scaling

Jonathan Li; Wei Cheng; Benjamin Riviere; Yue Wu; Masafumi Oyamada; Mengdi Wang; Yisong Yue; Santiago Paternain; Haifeng Chen

Back to NeurIPS

NeurIPS 2025

DISC: Dynamic Decomposition Improves LLM Inference Scaling

Conference Paper Main Conference Track Artificial Intelligence · Machine Learning

PDF Details

Abstract

Inference scaling methods for LLMs often rely on decomposing problems into steps (or groups of tokens), followed by sampling and selecting the best next steps. However, these steps and their sizes are often predetermined or manually designed based on domain knowledge. We propose dynamic decomposition, a method that adaptively and automatically partitions solution and reasoning traces into manageable steps during inference. By more effectively allocating compute -- particularly through subdividing challenging steps and prioritizing their sampling -- dynamic decomposition significantly improves inference efficiency. Experiments on benchmarks such as APPS, MATH, and LiveCodeBench demonstrate that dynamic decomposition outperforms static approaches, including token-level, sentence-level, and single-step decompositions, reducing the pass@10 error rate by 5. 0%, 6. 7%, and 10. 5% respectively. These findings highlight the potential of dynamic decomposition to improve a wide range of inference scaling techniques.

DISC: Dynamic Decomposition Improves LLM Inference Scaling

Abstract

Authors

Keywords

Context