Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ReasonIF

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingReasonIF synthesized v1.0
IFS96.3
55
ReasoningReasonIF
Error Rate (ERR)6.7
11
Showing 2 of 2 rows