Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Evaluation on Ruler (Average rank)

2Average Rank

Ministral-3-8B

Updated 2mo ago

Evaluation Results

Method	Links
Ministral-3-8B 2026.05		2
Qwen3-8B 2026.05		2.3
Llama-3.1-8B 2026.05		2.7
Qwen3-4B 2026.05		4.3
gemma-3-12b-it 2026.05		5
GPT-5 nano 2026.05		6
Llama-3.2-3B 2026.05		6.7
gemma-3-4b-it 2026.05		9
gpt-oss-20b 2026.05		9.3
Moonlight-16B-A3B 2026.05		10.3
EngGPT2-16B-A3B 2026.05		11.3
LLaMAntino-3-8B 2026.05		11.3
FastwebMIIA-7B 2026.05		12
Velvet-14B 2026.05		13.7
deepseek-moe-16b 2026.05		14.3
Minerva-7B 2026.05		15.7