Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Commonsense Reasoning on SIQA (test)
Loading...
40.28
Accuracy
PGM 6 / 6 (1024)
38.7304
39.1327
39.535
39.9373
May 24, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
PGM 6 / 6 (1024)
distillation=Before
2025.05
40.28
PGM 8 / 8
distillation=Before
2025.05
39.97
PGM 8 / 8
distillation=After Dis...
2025.05
39.61
PGM 6 / 6 (1024)
distillation=After Dis...
2025.05
39.25
MDLM
distillation=Before
2025.05
38.84
MDLM
distillation=After Dis...
2025.05
38.79
Feedback
Search any
task
Search any
task