Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-shot Common Sense Reasoning on (PIQA, HellaSwag, WSC, BoolQ, RACE-H)

71.22PIQA

LLMPruner

66.009667.362368.71570.0677Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
71.2256.4636.5455.222.5648.4
2026.01
69.855.6940.3864.0722.6150.51
2026.01
66.7655.9768.1362.1733.8857.38
2026.01
66.4353.0252.4674.7132.2555.77
2026.01
66.2150.2736.5438.3221.0742.48