Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Opinion Recovery on Board Game Playtesting Dataset
Loading...
69.77
Op-Rec
MeepleLM
7.5988
23.7394
39.88
56.0206
Jan 12, 2026
Op-Rec
Updated 4d ago
Evaluation Results
Method
Method
Links
Op-Rec
MeepleLM
2026.01
69.77
GPT-5.1
2026.01
63.44
Gemini3-Pro
2026.01
57.74
MeepleLM
Ablation=w/o MDA
2026.01
55.35
Qwen3-235B
2026.01
54.27
MeepleLM
Ablation=w/o Persona
2026.01
53.84
Qwen3-8B
2026.01
11.39
MeepleLM
Ablation=w/o Rulebook
2026.01
9.99
Feedback
Search any
task
Search any
task