Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge & Instruction Following on IFEval (Accuracy)
Loading...
91.31
Accuracy
Qwen3.5-9B
41.4836
54.4193
67.355
80.2907
Jun 1, 2026
Accuracy
Updated 22h ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3.5-9B
Params=9B, Sampling mo...
2026.06
91.31
Qwen3.5
Params=4B, Sampling mo...
2026.06
90.02
LLaDA-2.1-flash
Params=100B-A5B, Sampl...
2026.06
83.36
LLaDA-2.0-flash
Params=100B-A5B, Sampl...
2026.06
81.7
LLaDA-2.1-mini
Params=16B-A1B, Sampli...
2026.06
81.33
LLaDA-2.0-mini
Params=16B-A1B, Sampli...
2026.06
80.78
Qwen3.5
Params=2B, Sampling mo...
2026.06
79.48
FLARE
Params=4B, Sampling mo...
2026.06
73.57
FLARE
Params=4B, Sampling mo...
2026.06
73.2
FLARE-9B
Params=9B, Sampling mo...
2026.06
71.35
FLARE
Params=2B, Sampling mo...
2026.06
68.95
FLARE-9B
Params=9B, Sampling mo...
2026.06
63.22
FLARE
Params=2B, Sampling mo...
2026.06
62.66
SDAR
Params=8B, Sampling mo...
2026.06
61.4
SDAR 30B-A3B
Params=30B-A3B, Sampli...
2026.06
60.6
SDAR
Params=4B, Sampling mo...
2026.06
56.6
SDAR
Params=1.7B, Sampling...
2026.06
43.4
Feedback
Search any
task
Search any
task