SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence
About
The rapid progress of Large Language Models has advanced agentic systems in decision-making, coordination, and task execution. Yet, existing agentic system generation frameworks lack full autonomy, missing from-scratch agent generation, self-optimizing agent functionality, and collaboration, limiting adaptability and scalability. We propose SwarmAgentic, a framework for fully automated agentic system generation that constructs agentic systems from scratch and jointly optimizes agent functionality and collaboration as interdependent components through language-driven exploration. To enable efficient search over system-level structures, SwarmAgentic maintains a population of candidate systems and evolves them via feedback-guided updates, drawing inspiration from Particle Swarm Optimization (PSO). We evaluate our method on six real-world, open-ended, and exploratory tasks involving high-level planning, system-level coordination, and creative reasoning. Given only a task description and an objective function, SwarmAgentic outperforms all baselines, achieving a +261.8% relative improvement over ADAS on the TravelPlanner benchmark, highlighting the effectiveness of full automation in structurally unconstrained tasks. This framework marks a significant step toward scalable and autonomous agentic system design, bridging swarm intelligence with fully automated system multi-agent generation. Our code is publicly released at https://yaoz720.github.io/SwarmAgentic/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Mathematical Reasoning | AIME 2024 | Pass@1 Accuracy46.67 | 165 | |
| Mathematical Reasoning | AIME 2025 | Pass@1 Accuracy40 | 118 | |
| Embodied Task | AlfWorld | -- | 96 | |
| Long horizon planning | TravelPlanner | Delivery Rate100 | 13 | |
| General Assistant | GAIA | Pass@1 (L1)28.3 | 13 | |
| Planning | TravelPlanner | Pass@159.14 | 12 | |
| Multi-hop Question Answering | 2WikiMultihopQA | Pass@180 | 12 | |
| Multi-hop Question Answering | HotpotQA | Pass@156 | 12 | |
| Web Interaction | Webshop | Pass@134.12 | 12 |