Share your thoughts, 1 month free Claude Pro on usSee more

Context Understanding on CulturalBench

0.904Easy Accuracy

Gemini-3-Pro

Updated 4mo ago

Evaluation Results

Method	Links
Gemini-3-Pro 2026.02		0.904	0.9
GPT-5.2 2026.02		0.883	0.844
TwC (Ours) - Img & Txt 2026.02		0.883	0.822
Claude-Sonnet 4.5 2026.02		0.872	0.765
DeepSeek-R1 2026.02		0.872	0.851
Qwen3-235B-A22B 2026.02		0.831	0.825
TwC (Ours) - Only Image 2026.02		0.7	0.805
TWI-1-Generated Photo 2026.02		0.697	0.714
Sora 2 2026.02		0.6	0.7
DREAMLLM 2026.02		0.523	0.428