Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-context Evaluation on RULER ultra-long context official (Context Length Sweep)
Loading...
96
Accuracy (128K)
Qwen3-Next-80B-A3B-Instruct
88.824
90.687
92.55
94.413
Feb 12, 2026
Accuracy (128K)
Accuracy (512K)
Accuracy (1000K)
Accuracy (2048K)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (128K)
Accuracy (512K)
Accuracy (1000K)
Accuracy (2048K)
Qwen3-Next-80B-A3B-Instruct
Parameters=80B
2026.02
96
86.9
80.3
-
Qwen3-235B-A22B-Instruct-2507
Parameters=235B, Versi...
2026.02
93.9
90.9
84.5
-
MiniCPM-SALA
Parameters=9B
2026.02
89.4
87.1
86.3
81.6
Qwen3-30B-A3B-Instruct-2507
Parameters=30B, Versio...
2026.02
89.1
78.4
72.8
-
Feedback
Search any
task
Search any
task