Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Theory of Mind on ToMATO
Loading...
82.2
Accuracy
GPT-4o
64.104
68.802
73.5
78.198
Feb 11, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o
Model Family=GPT Famil...
2026.02
82.2
GPT-o3
Model Family=GPT Famil...
2026.02
81.7
GPT-o4-mini
Model Family=GPT Famil...
2026.02
79.2
DeepSeek-V3
Model Family=DeepSeek...
2026.02
78.2
DeepSeek-R1
Model Family=DeepSeek...
2026.02
74.9
Qwen3-32B
Model Family=Qwen3-32B...
2026.02
73.2
Qwen3-32B-Reasoning
Model Family=Qwen3-32B...
2026.02
71.4
Qwen3-8B
Model Family=Qwen3-8B,...
2026.02
70.5
Qwen3-8B-Reasoning
Model Family=Qwen3-8B,...
2026.02
64.8
Feedback
Search any
task
Search any
task