Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context Evaluation on RULER ultra-long context official (Context Length Sweep)

96Accuracy (128K)

Qwen3-Next-80B-A3B-Instruct

88.82490.68792.5594.413Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
9686.980.3-
2026.02
93.990.984.5-
2026.02
89.487.186.381.6
2026.02
89.178.472.8-