Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Commonsense Reasoning on SIQA (test)

40.28Accuracy

PGM 6 / 6 (1024)

38.730439.132739.53539.9373May 24, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.05
40.28
2025.05
39.97
2025.05
39.61
2025.05
39.25
2025.05
38.84
2025.05
38.79