Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Conversational on MixEval
Loading...
76
Score
Qwen3.5-9B
62.896
66.298
69.7
73.102
May 29, 2026
Score
Updated 2d ago
Evaluation Results
Method
Method
Links
Score
Qwen3.5-9B
Parameters=9B, Variant...
2026.05
76
Qwen3.5-4B
Parameters=4B, Variant...
2026.05
71.9
Ministral-3-14B
Parameters=14B, Varian...
2026.05
70.8
OLMo-3-7B
Parameters=7B, Variant...
2026.05
67
Mellum 2 (RL)
Post-training Stage=RL...
2026.05
66.9
Mellum 2 (SFT)
Post-training Stage=SF...
2026.05
63.4
Feedback
Search any
task
Search any
task