Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Instruction Following on Ko-IFEval
Loading...
93.2
Overall Score
gpt-oss-120b
78.952
82.651
86.35
90.049
Jan 11, 2026
Jan 22, 2026
Feb 2, 2026
Feb 13, 2026
Feb 24, 2026
Mar 7, 2026
Mar 19, 2026
Overall Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Overall Score
gpt-oss-120b
Parameter Count=117B,...
2026.01
93.2
Qwen-3-30B-A3B
Reasoning=On
2026.03
93.2
K-EXAONE-236B-A23B
Reasoning=On
2026.03
91
Solar-open-100B
Reasoning=On
2026.03
88.19
Solar Open
Parameter Count=102B
2026.01
87.5
K-EXAONE-236B-A23B
Reasoning=Off
2026.03
87.45
gpt-oss-120b
Parameter Count=117B,...
2026.01
86.7
Mi:dm K 2.5 Pro (March ‘26)
Reasoning=On
2026.03
85.6
Qwen-3-30B-A3B
Reasoning=Off
2026.03
85.13
HyperCLOVAX-SEED-Think-32B
Reasoning=On
2026.03
84.2
Mi:dm K 2.5 Pro (March ‘26)
Reasoning=Off
2026.03
81.03
HyperCLOVAX-SEED-Think-32B
Reasoning=Off
2026.03
79.74
GLM-4.5-Air
Parameter Count=110B
2026.01
79.5
Feedback
Search any
task
Search any
task