Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Natural Language Understanding on EuroEval English
Loading...
56.8
EuroEval Score
EuroLLM-9B (Base)
-2.272
13.064
28.4
43.736
May 25, 2026
EuroEval Score
Updated 7d ago
Evaluation Results
Method
Method
Links
EuroEval Score
EuroLLM-9B (Base)
Language model=EuroLLM-9B
2026.05
56.8
aya-3B (Base)
Language model=aya-3B
2026.05
54.1
aya-3B (Multilingual All Lang SFT)
Regime=Multilingual, P...
2026.05
7.9
aya-3B (Multilingual Max-R SFT)
Regime=Multilingual, P...
2026.05
7
aya-3B (Monolingual Max-R SFT)
Regime=Monolingual, Po...
2026.05
6.3
EuroLLM-9B (Multilingual All Lang SFT)
Regime=Multilingual, P...
2026.05
3.9
ICR (8B)
Parameters=8B, Compari...
2026.05
3.6
aya-3B (Monolingual In-lang SFT)
Regime=Monolingual, Po...
2026.05
2.8
MAPO (13B)
Parameters=13B, Compar...
2026.05
2.6
ICR (8B)
Baseline Model=ICR, Pa...
2026.05
1
EuroLLM-9B (Monolingual Max-R SFT)
Regime=Monolingual, Po...
2026.05
0.5
aya-3B (Multilingual Paired DPO)
Regime=Multilingual, P...
2026.05
0.4
EuroLLM-9B (Monolingual Paired DPO)
Regime=Monolingual, Po...
2026.05
0.4
EuroLLM-9B (Multilingual Paired DPO)
Regime=Multilingual, P...
2026.05
0.3
aya-3B (Monolingual Paired DPO)
Regime=Monolingual, Po...
2026.05
0.2
EuroLLM-9B (Multilingual Max-R SFT)
Regime=Multilingual, P...
2026.05
0.2
EuroLLM-9B (Monolingual In-lang SFT)
Regime=Monolingual, Po...
2026.05
0.1
MAPO (13B)
Baseline Model=MAPO, P...
2026.05
0
Feedback
Search any
task
Search any
task