Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-Agent Game on KuhnPoker vs. NE Bot
Loading...
94.04
Normalized Score (First Move)
Strat-Reasoner-4B
56.5688
66.2969
76.025
85.7531
May 6, 2026
Normalized Score (First Move)
Normalized Score (Second Move)
Updated 27d ago
Evaluation Results
Method
Method
Links
Normalized Score (First Move)
Normalized Score (Second Move)
Strat-Reasoner-4B
Source category=Open-s...
2026.05
94.04
90.47
Gemini-2.5-flash
Source category=Closed...
2026.05
92.44
87.79
GPT-5-mini
Source category=Closed...
2026.05
77.72
86.56
MARSHAL-4B
Source category=Open-s...
2026.05
75.05
73.94
Qwen3-32B
Source category=Open-s...
2026.05
74.95
76.06
Qwen3-4B
Source category=Open-s...
2026.05
70.71
70.15
Qwen3-8B
Source category=Open-s...
2026.05
68.26
70.05
Gemma3-12B
Source category=Open-s...
2026.05
66.71
65.01
SPIRAL-4B
Source category=Open-s...
2026.05
58.01
69.38
Feedback
Search any
task
Search any
task