Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OR-Agent: Bridging Evolutionary Search and Structured Research for Automated Algorithm Discovery

About

Automating scientific discovery in complex, experiment-driven domains requires more than iterative mutation of programs; it demands structured hypothesis management, environment interaction, and principled reflection. We present OR-Agent, a configurable multi-agent research framework designed for automated exploration in rich experimental environments. OR-Agent organizes research as a structured tree-based workflow that explicitly models branching hypothesis generation and systematic backtracking, enabling controlled management of research trajectories beyond simple mutation-crossover loops. At its core, we introduce an evolutionary-systematic ideation mechanism that unifies evolutionary selection of research starting points, comprehensive research plan generation, and coordinated exploration within a research tree. We introduce a hierarchical optimization-inspired reflection system in which short-term reflections act as verbal gradients, long-term reflections as verbal momentum, and memory compression as semantic weight decay, collectively forming a principled mechanism for governing research dynamics. We conduct extensive experiments across classical combinatorial optimization benchmarks as well as simulation-based cooperative driving scenarios. Results demonstrate that OR-Agent outperforms strong evolutionary baselines while providing a general, extensible, and inspectable framework for AI-assisted scientific discovery. All code and experimental data are publicly available at https://github.com/qiliuchn/OR-Agent.

Qi Liu, Ruochen Hao, Can Li, Wanjing Ma• 2026

Related benchmarks

TaskDatasetResultRank
Bin Packing ProblemBPP-Offline-ACO
Normalized Score1
5
Capacitated Vehicle Routing ProblemCVRP-LEHD
Normalized Score1
5
Multi-dimensional Knapsack ProblemMKP-ACO
Normalized Score100
5
Traveling Salesman ProblemTSP-ACO
Normalized Score1
5
Traveling Salesman ProblemTSP-LEHD
Normalized Score1
5
Bin Packing ProblemBPP Online
Normalized Score94.8
5
Capacitated Vehicle Routing ProblemCVRP-ACO
Normalized Score0.68
5
Capacitated Vehicle Routing ProblemCVRP-POMO
Normalized Score98.6
5
DPP (Optimization Problem)DPP-GA
Normalized Score0.787
5
Traveling Salesman ProblemTSP-Constructive
Normalized Score0.959
5
Showing 10 of 15 rows

Other info

Follow for update