Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

ReasonIF

Benchmarks

Task NameDataset NameSOTA ResultTrend
Instruction FollowingReasonIF synthesized v1.0
IFS96.3
55
ReasoningReasonIF
Error Rate (ERR)6.7
11
Showing 2 of 2 rows