Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Collaborative software engineering on MultiAgentBench Coding Graph
Loading...
57.41
Task Performance
QWEN
50.962
52.636
54.31
55.984
Apr 21, 2026
Task Performance
Coordination
Updated 1mo ago
Evaluation Results
Method
Method
Links
Task Performance
Coordination
QWEN
Trait Source=none
2026.04
57.41
74.29
ETI
Agent=QWEN, Trait Sour...
2026.04
56.82
86.46
ETI
Agent=QWEN, Trait Sour...
2026.04
56.44
84.43
ETI
Agent=GPT, Trait Sourc...
2026.04
53.31
73.52
ETI
Agent=GPT, Trait Sourc...
2026.04
52.84
74.38
GPT
Trait Source=none
2026.04
51.21
57.01
Feedback
Search any
task
Search any
task