Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WizardLM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingWizardLM (test)
Score6.87
13
Refusal behavior defenseWizardLM (test)
BadNet CACC90.4
12
Toxic behavior defenseWizardLM (test)
BadNet CACC0.904
12
Instruction FollowingWizardLM low-resource
Win Rate (bn)62.8
7
Instruction Following EvaluationWizardLM
Score72.06
5
GenerationWizardLM (test)
LLM-as-a-Judge Score48.37
2
Showing 6 of 6 rows