Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Commonsense Reasoning on Winogrande first 1000 examples (standard)
Loading...
48.9
Accuracy
k=64 + sink
48.796
48.823
48.85
48.877
Apr 20, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
k=64 + sink
Model size=300M, Token...
2026.04
48.9
k=0 (baseline)
Model size=300M, Token...
2026.04
48.8
k=64 (DR only)
Model size=300M, Token...
2026.04
48.8
Feedback
Search any
task
Search any
task