Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FollowBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingFollowBench
HSR66.9
39
Instruction FollowingFollowBench In-Domain
HSR70.4
23
Constraint FollowingFollowBench SSR
Success Rate (Level 1)84.7
18
Reward ModelingFollowBench
Accuracy89.9
17
Showing 4 of 4 rows