Share your thoughts, 1 month free Claude Pro on usSee more

Long-context Evaluation on RULER ultra-long context official (Context Length Sweep)

96Accuracy (128K)

Qwen3-Next-80B-A3B-Instruct

Updated 3mo ago

Evaluation Results

Method	Links
Qwen3-Next-80B-A3B-Instruct 2026.02		96	86.9	80.3	-
Qwen3-235B-A22B-Instruct-2507 2026.02		93.9	90.9	84.5	-
MiniCPM-SALA 2026.02		89.4	87.1	86.3	81.6
Qwen3-30B-A3B-Instruct-2507 2026.02		89.1	78.4	72.8	-