Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Medical and Previous Task Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Skill LearningMedical and Previous Task Suite (Hellaswag, Humaneval, IFeval, MMLU, TruthfulQA, Winogrande)
Medical Score40.2
5
Showing 1 of 1 rows