Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Relation Reasoning with LLMs in Expensive Optimization

About

Expensive optimization problems (EOPs) are black-box tasks with costly objective evaluations and no gradient access, making the evaluation budget the key bottleneck. Surrogate-assisted evolutionary algorithms (SAEAs) reduce evaluations via surrogate predictions, but conventional surrogates often require frequent retraining as populations evolve, incurring overhead. This paper proposes R2SAEA, a reinforcement-trained relation-based large language model (LLM) surrogate assisted evolutionary algorithm. We cast relation-based surrogate modeling as an in-context pairwise reasoning task. To enable efficient inference in evolutionary loops, we develop an anchor-based iterative context construction strategy that reduces prompt complexity from quadratic to linear in population size, and a voting-based aggregation scheme that converts predicted relations into scores for offspring selection. We further build an RL pipeline from evolutionary trajectories and fine-tune Qwen2.5 with GRPO. Experiments on single- and multi-objective benchmarks show improved relation prediction and state-of-the-art optimization performance over strong SAEA baselines and general LLMs. Quantization also enables efficient edge deployment, supporting a zero-shot surrogate paradigm without per-generation retraining. Code and models are available at https://github.com/Septend9/R2SAEA.

Ye Lu, Bingdong Li, Aimin Zhou, Hao Hao• 2026

Related benchmarks

TaskDatasetResultRank
Multi-Objective OptimizationDTLZ5
IGD0.0076
63
Multi-Objective OptimizationDTLZ7
IGD0.8391
57
Multi-Objective OptimizationDTLZ2
IGD0.0954
48
Multi-Objective OptimizationDTLZ3
IGD0.3122
48
Multi-Objective OptimizationDTLZ6
IGD0.0074
48
Multi-Objective OptimizationDTLZ1
IGD0.1328
48
Multi-Objective OptimizationDTLZ4
IGD0.3786
48
Single Objective OptimizationLZG and YLL suites D=5, 10, 20 Combined (test)
Mean Rank2.69
7
Single Objective OptimizationLZG01-04 D=5 (test)
Mean Objective Value0.3462
7
Showing 9 of 9 rows

Other info

Follow for update