Share your thoughts, 1 month free Claude Pro on usSee more

Agent Planning and Execution on TaskCraft

0.7533pass@1

Agent KB

Updated 4mo ago

Evaluation Results

Method	Links
Agent KB 2026.02		0.7533
Agent KB 2026.02		0.7267
TodoEvolve + Smolagents 2026.02		0.7267
TodoEvolve + Smolagents 2026.02		0.7133
Flash-Searcher 2026.02		0.6967
Flash-Searcher 2026.02		0.6933
TodoEvolve + Smolagents 2026.02		0.6933
Cognitive Kernel-Pro 2026.02		0.66
Smolagents 2026.02		0.64
Agent KB 2026.02		0.6167
OWL Workforce 2026.02		0.5833
Flash-Searcher 2026.02		0.58