Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on LiveCode Bench (Performance % and Total Tokens)
Loading...
1.37
Total Tokens
AgentFmw
-31.6552
191.2649
414.185
637.1051
Apr 1, 2026
Total Tokens
Performance
Updated 16d ago
Evaluation Results
Method
Method
Links
Total Tokens
Performance
AgentFmw
Backbone=GPT-OSS:120B
2026.04
1.37
-
TopoDIM
Backbone=GPT-OSS:120B
2026.04
1.39
-
G-Designer
Backbone=GPT-OSS:120B
2026.04
2.63
-
LongGraph
Backbone=GPT-OSS:120B
2026.04
2.93
-
GTD
Backbone=GPT-OSS:120B
2026.04
3.12
-
AutoGen
Backbone=GPT-OSS:120B
2026.04
3.42
-
GPTSwarm
Backbone=GPT-OSS:120B
2026.04
4.9
-
LLM-Debate
Backbone=GPT-OSS:120B
2026.04
8.81
-
Agent Q-Mix
Backbone=GPT-OSS:120B
2026.04
312
-
Lobster
Backbone=GPT-OSS:120B
2026.04
827
-
Feedback
Search any
task
Search any
task