Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

OpenSage: Self-programming Agent Generation Engine

About

Agent development kits (ADKs) provide effective platforms and tooling for constructing agents, and their designs are critical to the constructed agents' performance, especially the functionality for agent topology, tools, and memory. However, current ADKs either lack sufficient functional support or rely on humans to manually design these components, limiting agents' generalizability and overall performance. We propose OpenSage, the first ADK that enables LLMs to automatically create agents with self-generated topology and toolsets while providing comprehensive and structured memory support. OpenSage offers effective functionality for agents to create and manage their own sub-agents and toolkits. It also features a hierarchical, graph-based memory system for efficient management and a specialized toolkit tailored to software engineering tasks. Extensive experiments across three state-of-the-art benchmarks with various backbone models demonstrate the advantages of OpenSage over existing ADKs. We also conduct rigorous ablation studies to demonstrate the effectiveness of our design for each component. We believe OpenSage can pave the way for the next generation of agent development, shifting the focus from human-centered to AI-centered paradigms.

Hongwei Li, Zhun Wang, Qinrun Dai, Yuzhou Nie, Jinjun Peng, Ruitong Liu, Jingyang Zhang, Kaijie Zhu, Jingxuan He, Lun Wang, Yangruibo Ding, Yueqi Chen, Wenbo Guo, Dawn Song• 2026

Related benchmarks

TaskDatasetResultRank
Question AnsweringLocomo
Single Hop F163.21
22
Terminal-based task executionTerminal-bench 2.0
Resolved %65.2
5
Security AnalysisCyberGym
Resolved Percentage60.2
4
Software EngineeringSWE-Bench Python subset Pro
Resolution Rate59
3
Showing 4 of 4 rows

Other info

Follow for update