Share your thoughts, 1 month free Claude Pro on usSee more

Long-context capability evaluation on RULER 8192 length

93.75Accuracy

Qwen3-30B A3B-Instruct

Updated 4mo ago

Evaluation Results

Method	Links
Qwen3-30B A3B-Instruct 2026.02		93.75
QUOKA 2026.02		92.77
Qwen3-4B 2026.02		91.68
QUOKA 2026.02		91.35
Smollm3 2026.02		83.46
Qwen2.5-3B 2026.02		81.99
QUOKA 2026.02		81.45
Llama3.2-3B 2026.02		81.33
QUOKA 2026.02		80.66
QUOKA 2026.02		79.78
QUOKA 2026.02		79.72
GPT-OSS-20B 2026.02		79.32