Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Theory of Mind on HiToM
Loading...
74.7
Accuracy
GPT-o3
47.036
54.218
61.4
68.582
Feb 11, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-o3
Model Family=GPT Famil...
2026.02
74.7
DeepSeek-V3
Model Family=DeepSeek...
2026.02
69.4
Qwen3-32B-Reasoning
Model Family=Qwen3-32B...
2026.02
68
GPT-4o
Model Family=GPT Famil...
2026.02
60.7
Qwen3-32B
Model Family=Qwen3-32B...
2026.02
58.6
Qwen3-8B
Model Family=Qwen3-8B,...
2026.02
55.8
DeepSeek-R1
Model Family=DeepSeek...
2026.02
54.9
GPT-o4-mini
Model Family=GPT Famil...
2026.02
54.7
Qwen3-8B-Reasoning
Model Family=Qwen3-8B,...
2026.02
48.1
Feedback
Search any
task
Search any
task