Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Game playing on Simple Negotiation
Loading...
46.2
Win Rate
Instruct Model
6.68
16.94
27.2
37.46
Jun 30, 2025
Win Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Win Rate
Instruct Model
Opponent=Gemini-2.0-Flash
2025.06
46.2
Simple Negotiation Specialist
Opponent=Gemini-2.0-Flash
2025.06
39.1
Multi-Game Model
Opponent=Gemini-2.0-Flash
2025.06
33.2
TicTacToe Specialist
Opponent=Gemini-2.0-Flash
2025.06
30.5
Kuhn Poker Specialist
Opponent=Gemini-2.0-Flash
2025.06
28.7
Base Model
Opponent=Gemini-2.0-Flash
2025.06
15.6
Random Policy
Opponent=Gemini-2.0-Flash
2025.06
8.2
Feedback
Search any
task
Search any
task