Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Low-resource language evaluation on MiLi-Eval
Loading...
54.8
BOD
TRIMIX
22.976
31.238
39.5
47.762
Apr 20, 2026
BOD
UIG
KAZ
MVF
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
BOD
UIG
KAZ
MVF
Average Score
TRIMIX
Selection Strategy=PPL...
2026.04
54.8
55.9
51.8
29.4
48
12B-ins
Model Size=12B, Stage=...
2026.04
49.7
57.6
50.8
24.1
45.6
Proxy Tuning
Strategy=Proxy Tuning,...
2026.04
49.6
54.4
48.5
24.6
44.3
4B-cpt
Model Size=4B, Stage=c...
2026.04
35.7
36
33.2
19.1
31
4B-base
Model Size=4B, Stage=b...
2026.04
24.2
32
24.7
17.2
24.5
Feedback
Search any
task
Search any
task