Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Zero-shot Commonsense Reasoning on CSQA

83.19PIQA

SLEB-pruned LLaMA2-7B

52.801260.690668.5876.4694Feb 11, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
83.1986.8272.0179.9761.8668.97775.68
2026.02
80.286.8871.0376.6858.5362.4374.672.91
2026.02
76.7775.0767.273.5352.7360.626567.27
2026.02
72.8572.3265.6668.9447.8755.5660.263.34
2026.02
71.1677.1564.4864.7345.7356.7555.262.17
2026.02
53.9748.5549.0844.0230.3854.3825.243.65