Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instruction Following Evaluation Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingInstruction Following Evaluation Suite
Self-Instruct Score25.39
33
Instruction-followingInstruction-following Evaluation Suite (MMLU, DROP, HEval, BBH) (test)
MMLU79.67
11
Showing 2 of 2 rows