Share your thoughts, 1 month free Claude Pro on usSee more

Cultural commonsense reasoning on CultureAtlas (All Culture)

95.8Precision

GPT-4

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4 2026.01		95.8	90.6	93.1
CALM 2026.01		93.6	87.7	89.1
LLaMA-2 2026.01		84.2	42.1	56.1
Vicuna 2026.01		79.6	56.8	66.3
Vicuna 2026.01		67.4	81.2	73.7
LLaMA-2 2026.01		63.6	77.1	69.7