Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Zero-shot Common Sense Reasoning on (PIQA, HellaSwag, WSC, BoolQ, RACE-H)
Loading...
71.22
PIQA
LLMPruner
66.0096
67.3623
68.715
70.0677
Jan 29, 2026
PIQA
HellaSwag
WSC
BoolQ
RACE-H
Average Accuracy (Zero-Shot)
Updated 4d ago
Evaluation Results
Method
Method
Links
PIQA
HellaSwag
WSC
BoolQ
RACE-H
Average Accuracy (Zero-Shot)
LLMPruner
Backbone=Llama-2-7B, P...
2026.01
71.22
56.46
36.54
55.2
22.56
48.4
LaCo
Backbone=Llama-2-7B, P...
2026.01
69.8
55.69
40.38
64.07
22.61
50.51
TAPPA
Backbone=Llama-2-7B, P...
2026.01
66.76
55.97
68.13
62.17
33.88
57.38
ShortGPT
Backbone=Llama-2-7B, P...
2026.01
66.43
53.02
52.46
74.71
32.25
55.77
SliceGPT
Backbone=Llama-2-7B, P...
2026.01
66.21
50.27
36.54
38.32
21.07
42.48
Feedback
Search any
task
Search any
task