Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Following on IFBench Strict
Loading...
31.5
Avg@10
Qwen3-4B RL finetuned on HanabiRewards
30.876
31.038
31.2
31.362
Jan 26, 2026
Avg@10
Pass@10
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@10
Pass@10
Qwen3-4B RL finetuned on HanabiRewards
Backbone=Qwen3-4B, Var...
2026.01
31.5
44.6
Qwen3-4B-Instruct-2507
Backbone=Qwen3-4B, Var...
2026.01
30.9
42.9
Feedback
Search any
task
Search any
task