| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| CTF-3 | Environment-Grounded Multi-Agent Workflow | Success Count5 | 8 | 24d ago | |
| CTF-2 | Environment-Grounded Multi-Agent Workflow | Success Count5 | 8 | 24d ago | |
| CTF 1 | Environment-Grounded Multi-Agent Workflow | Success Count5 | 8 | 24d ago | |
| CTF-0 1.0 (test) | Environment-Grounded Multi-Agent Workflow | Success Count5 | 8 | 24d ago |