Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Following on IFBench, IFEval, IFEval-ko
Loading...
86.1
Accuracy
GLM-4.6
39.716
51.758
63.8
75.842
Jan 14, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM-4.6
Thinking Mode=true, Pa...
2026.01
86.1
GLM-4.6
Thinking Mode=true, Pa...
2026.01
85.8
DeepSeek-V3.1
Thinking Mode=true, Pa...
2026.01
84.4
A.X K1
Thinking Mode=true, Pa...
2026.01
81
A.X K1
Thinking Mode=true, Pa...
2026.01
80.4
DeepSeek-V3.1
Thinking Mode=true, Pa...
2026.01
79.2
A.X K1
Thinking Mode=true, Pa...
2026.01
64.7
GLM-4.6
Thinking Mode=true, Pa...
2026.01
43.4
DeepSeek-V3.1
Thinking Mode=true, Pa...
2026.01
41.5
Feedback
Search any
task
Search any
task