Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Cultural Alignment on GlobalOpinionQA

60.08Accuracy

GPT-5 + Role-Play

45.717649.446353.17556.9037Apr 9, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.04
60.08100
2025.04
59.7485.7143
2025.04
59.2292.8571
2025.04
58.3385.7143
2025.04
58.2592.8571
2025.04
58.1571.4286
2025.04
57.78100
2025.04
57.44100
2025.04
57.1185.7143
2025.04
56.76100
2025.04
56.6492.8571
2025.04
56.5978.5714
2025.04
56.4778.5714
2025.04
56.4392.8571
2025.04
56.2385.7143
2025.04
55.8964.2857
2025.04
55.8392.8571
2025.04
54.8471.4286
2025.04
53.7664.2857
2025.04
53.28-
2025.04
52.8342.8571
2025.04
52.69-
2025.04
51.83-
2025.04
46.27-