Distribution-Aware Algorithm Design with LLM Agents
Quick Answer
The study introduces a distribution-aware algorithm design using LLM code agents, achieving a mean normalized quality of 0.971 across 21 combinatorial optimization tasks.
Quick Take
The study introduces a distribution-aware algorithm design using LLM code agents, achieving a mean normalized quality of 0.971 across 21 combinatorial optimization tasks. The synthesized solvers outperform existing heuristics significantly, being up to 342.8 times faster than Gurobi while maintaining correctness across all tested instances.
Key Points
- Synthesized solvers improve by +0.224 over average heuristics and +0.098 over the best heuristic.
- Empirical results show solvers are 336.9x faster than the quality-best heuristic.
- The framework leverages reusable solver hints inferred from task distribution samples.
- All synthesized solvers validated on 100 PACE 2025 Dominating Set graphs.
- Performance gains stem from optimized computation tailored to specific distributions.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2605. 14141v1 Announce Type: new Abstract: We study learning when the learned object is executable solver code rather than a predictor. In this setting, correctness is not enough: two solvers may both return valid solutions on the deployment distribution while differing substantially in runtime. Given samples from an unknown task distribution, the learner returns code evaluated on fresh instances by both solution quality and execution time.
Our central abstraction is a \emph{solver hint}: reusable structure inferred from samples and compiled into specialized solver code. We prove that the empirically fastest sample-consistent solver from a fixed library generalizes in both correctness and runtime, and that statistically identifiable hints can be recovered and compiled from polynomially many samples.
Empirically, we instantiate the framework with LLM code agents on \(21\) structured combinatorial-optimization target distributions across seven problem classes. The synthesized solvers reach mean normalized quality \(0. 971\), improve by \(+0. 224\) over the average heuristic pool and by \(+0. 098\) over the highest-quality heuristic, and are \(336. 9\times\), \(342. 8\times\), and \(16. 1\times\) faster than the quality-best heuristic, Gurobi, and the selected time-limited exact backend, respectively.
On released PACE 2025 Dominating Set private instances, the synthesized solver is valid on all \(100\) graphs and runs about two orders of magnitude faster than top competition solvers, with a moderate quality gap. Inspection shows that many gains come from changing the computational scale: replacing ambient exponential search or general-purpose optimization with compiled distribution-specific computation.
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.AI
See more →The Verification Horizon: No Silver Bullet for Coding Agent Rewards
As coding agents evolve, verifying solutions becomes more challenging than generating them, necessitating a focus on scalable, faithful, and robust verification methods. The study reveals that no fixed reward function can sustain effectiveness as model capabilities advance, emphasizing the need for verification to evolve alongside solution generation.