Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Social Deduction on Spy
Loading...
70
Accuracy
GPT-4o
13.84
28.42
43
57.58
Jan 9, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o
Access=Closed-source
2026.01
70
DeepSeek-V3.2
Access=Closed-source
2026.01
69
Gemini3-Flash
Access=Closed-source
2026.01
16
Feedback
Search any
task
Search any
task