Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Conversational on JetBrains pairwise
Loading...
69.5
Score
Mellum 2 (RL)
30.708
40.779
50.85
60.921
May 29, 2026
Score
Updated 2d ago
Evaluation Results
Method
Method
Links
Score
Mellum 2 (RL)
Post-training Stage=RL...
2026.05
69.5
Mellum 2 (SFT)
Post-training Stage=SF...
2026.05
64.4
Ministral-3-14B
Parameters=14B, Varian...
2026.05
63.8
Qwen3.5-9B
Parameters=9B, Variant...
2026.05
56.7
Qwen3.5-4B
Parameters=4B, Variant...
2026.05
40.5
OLMo-3-7B
Parameters=7B, Variant...
2026.05
32.2
Feedback
Search any
task
Search any
task