Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Commonsense Reasoning on Winogrande first 1000 examples (standard)

48.9Accuracy

k=64 + sink

48.79648.82348.8548.877Apr 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
48.9
2026.04
48.8
2026.04
48.8