Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Zero-shot Commonsense Reasoning Suite (PIQA, HellaSwag, WinoGrande, ARC-E/C, SIQA, BoolQ, LAMBADA)
Loading...
73.3
PIQA Accuracy (Zero-shot)
OSDN-APF
72.676
72.838
73
73.162
May 13, 2026
PIQA Accuracy (Zero-shot)
HellaSwag Accuracy (Zero-shot)
WinoGrande Accuracy (Zero-shot)
ARC-Easy Accuracy (Zero-shot)
ARC-Challenge Accuracy (Zero-shot)
SIQA Accuracy (Zero-shot)
BoolQ Accuracy (Zero-shot)
LAMBADA Accuracy (Zero-shot)
Average Accuracy (Zero-shot)
Updated 20d ago
Evaluation Results
Method
Method
Links
PIQA Accuracy (Zero-shot)
HellaSwag Accuracy (Zero-shot)
WinoGrande Accuracy (Zero-shot)
ARC-Easy Accuracy (Zero-shot)
ARC-Challenge Accuracy (Zero-shot)
SIQA Accuracy (Zero-shot)
BoolQ Accuracy (Zero-shot)
LAMBADA Accuracy (Zero-shot)
Average Accuracy (Zero-shot)
OSDN-APF
Model Size=1.3B, Train...
2026.05
73.3
59.2
60.6
69
40.4
41.7
60.5
48
56.6
DeltaNet
Model Size=1.3B, Train...
2026.05
72.7
58.1
59.8
68.7
41.8
41.5
57.9
47.3
56
Feedback
Search any
task
Search any
task