Share your thoughts, 1 month free Claude Pro on usSee more

Low-resource language evaluation on MiLi-Eval

54.8BOD

TRIMIX

Updated 3mo ago

Evaluation Results

Method	Links
TRIMIX 2026.04		54.8	55.9	51.8	29.4	48
12B-ins 2026.04		49.7	57.6	50.8	24.1	45.6
Proxy Tuning 2026.04		49.6	54.4	48.5	24.6	44.3
4B-cpt 2026.04		35.7	36	33.2	19.1	31
4B-base 2026.04		24.2	32	24.7	17.2	24.5