Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Instruction Following on IFEval EN
Loading...
87.64
Score
Qwen3-8B
30.7104
45.4902
60.27
75.0498
Jan 26, 2026
Jan 31, 2026
Feb 6, 2026
Feb 12, 2026
Feb 18, 2026
Feb 24, 2026
Mar 2, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
Qwen3-8B
backbone=Qwen3-8B
2026.01
87.64
Llama-EstLLM-8B-Instruct-CV
parameters=8B, chat_ve...
2026.03
81.7
Llama-3.1-8B-Instruct
parameters=8B
2026.03
81.1
Qwen2.5-7B-Instruct
parameters=7B
2026.03
79.5
Typhoon-S-8B
training=SFT+OPD with...
2026.01
79.28
Apertus-8B-Instruct-2509
parameters=8B
2026.03
78.1
Llama-EstLLM-8B-Instruct
parameters=8B, chat_ve...
2026.03
75.3
EuroLLM-9B-Instruct
parameters=9B
2026.03
70
Ministral-3-8B-Instruct-2512
parameters=8B
2026.03
68.5
Apertus-EstLLM-8B-Instruct
parameters=8B
2026.03
66.4
Llammas
2026.03
43.7
salamandra-7b-instruct
parameters=7B
2026.03
32.9
Feedback
Search any
task
Search any
task