Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

General Reasoning on AGIEval (Accuracy)

46.03Accuracy

Qwen2.5-7B

1.93413.38224.8336.278Feb 4, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
46.03
2026.02
39.82
2026.02
16.12
2026.02
3.63