Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Skill Learning on Medical and Multi-Task Suite (Hellaswag, Humaneval, IFeval, MMLU, TruthfulQA, Winogrande)

40.2Medical Score

SDFT

29.69632.42335.1537.877Jan 27, 2026
Updated 3d ago

Evaluation Results

MethodLinks
2026.01
40.261.467.772.371.547.371.965.4
2026.01
36.261.964.674.671.640.171.364
35.661.563.167.67042.371.462.6
2026.01
35.559.562.156.670.539.872.960.2
2026.01
30.16265.874.371.747.971.165.5