Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MermaidFlow: Redefining Agentic Workflow Generation via Safety-Constrained Evolutionary Programming

About

Despite the promise of autonomous agentic reasoning, existing workflow generation methods frequently produce fragile, unexecutable plans due to unconstrained LLM-driven construction. We introduce MermaidFlow, a framework that redefines the agentic search space through safety-constrained graph evolution. At its core, MermaidFlow represent workflows as a verifiable intermediate representation using Mermaid, a structured and human-interpretable graph language. We formulate domain-aware evolutionary operators, i.e., crossover, mutation, insertion, and deletion, to preserve semantic correctness while promoting structural diversity, enabling efficient exploration of a high-quality, statically verifiable workflow space. Without modifying task settings or evaluation protocols, MermaidFlow achieves consistent improvements in success rates and faster convergence to executable plans on the agent reasoning benchmark. The experimental results demonstrate that safety-constrained graph evolution offers a scalable, modular foundation for robust and interpretable agentic reasoning systems.

Chengqi Zheng, Jianda Chen, Yueming Lyu, Wen Zheng Terence Ng, Haopeng Zhang, Yew-Soon Ong, Ivor Tsang, Haiyan Yin• 2025

Related benchmarks

TaskDatasetResultRank
Tool LearningRestBench TMDB
Success Rate47
50
Tool UseToolHop
Answer Correctness47.39
18
Search-based Question AnsweringSearchQA
Hotpot Score32.67
18
API UseAPI-Bank
Success Rate57.02
18
Showing 4 of 4 rows

Other info

Follow for update