Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TodoEvolve: Learning to Architect Agent Planning Systems

About

Planning has become a central capability for contemporary agent systems in navigating complex, long-horizon tasks, yet existing approaches predominantly rely on fixed, hand-crafted planning structures that lack the flexibility to adapt to the structural diversity of open-ended problems. To address this limitation, we introduce TodoEvolve, a meta-planning paradigm that autonomously synthesizes and dynamically revises task-specific planning architectures. Specifically, we first construct PlanFactory, a modular design space that standardizes diverse planning paradigms within a unified codebase encompassing topology, initialization, adaptation, and navigation, thereby providing a common interface for heterogeneous planning patterns. Leveraging PlanFactory, we collect high-quality planning trajectories and train Todo-14B via \textit{Impedance-Guided Preference Optimization} (IGPO), a multi-objective reinforcement learning objective that encourages the generation of planning systems that are performant, stable, and token-efficient across arbitrary tasks and agent backbones. Empirical evaluations on five agentic benchmarks demonstrate that TodoEvolve consistently surpasses carefully engineered planning modules while maintaining economical API costs and runtime overhead.

Jiaxi Liu, Yanzuo Jiang, Guibin Zhang, Zihan Zhang, Heng Chang, Zhenfei Yin, Qibing Ren, Junchi Yan• 2026

Related benchmarks

TaskDatasetResultRank
General AI Assistant TasksGAIA
Avg Performance72.12
54
Data Science Agent tasksxBench-DS
Pass@10.75
13
Web Navigation Question AnsweringWebWalker QA
Accuracy70
13
Web-based Agent QAWebWalkerQA
Pass@173.53
13
Agent Planning and ExecutionTaskCraft
pass@10.7267
12
General AI Assistant TasksGAIA level2 Text-only
Accuracy57.14
8
Information retrieval and Question AnsweringDeepSearch-QA
Accuracy42
8
Showing 7 of 7 rows

Other info

GitHub

Follow for update