Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Context Understanding on CulturalBench
Loading...
0.904
Easy Accuracy
Gemini-3-Pro
0.50776
0.61063
0.7135
0.81637
Feb 2, 2026
Easy Accuracy
Hard Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Easy Accuracy
Hard Accuracy
Gemini-3-Pro
Category=MLLM, Note=di...
2026.02
0.904
0.9
GPT-5.2
Category=MLLM, Note=di...
2026.02
0.883
0.844
TwC (Ours) - Img & Txt
Category=Think with Co...
2026.02
0.883
0.822
Claude-Sonnet 4.5
Category=MLLM, Note=di...
2026.02
0.872
0.765
DeepSeek-R1
Category=Reasoning LLM...
2026.02
0.872
0.851
Qwen3-235B-A22B
Category=Reasoning LLM...
2026.02
0.831
0.825
TwC (Ours) - Only Image
Category=Think with Co...
2026.02
0.7
0.805
TWI-1-Generated Photo
Category=Think with Im...
2026.02
0.697
0.714
Sora 2
Category=Think with Vi...
2026.02
0.6
0.7
DREAMLLM
Category=Think with Im...
2026.02
0.523
0.428
Feedback
Search any
task
Search any
task