Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

IFBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingIFBench
Pass@1 (Strict)68.1
68
Instruction FollowingIFBench
Accuracy67.77
25
Reward ModelingIFBench
Accuracy69.3
17
Reward ModelingIFBench Hard
Accuracy78
16
Reward ModelingIFBench Normal
Accuracy80.5
16
Reward ModelingIFBench Simple
Accuracy87.2
16
Instruction FollowingIFBench
IFBench Score43.28
12
AlignmentIFBench
pass@141.7
7
Reward ModelingIFBench (test)
Accuracy57.9
7
Instruction FollowingIFBench (test)
Score38.61
5
Instruction FollowingIFBench Strict
Avg@1031.5
2
Showing 11 of 11 rows