Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context performance evaluation on RULER

95.31Accuracy

Dense

80.947684.676388.40592.1337Feb 7, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
95.31------
2026.02
95.31------
2026.02
95------
2026.02
94.06------
2026.02
93.64------
2026.02
93.13------
2026.02
89.69------
2026.02
87.5------
2026.02
82.81------
2026.02
81.5------
2026.02
-91.44-----
2026.02
-90.15-----
2025.02
-8690.19583.485.576.3
2025.02
-18.61521.524.416.915
2025.02
--27.1----
2025.02
-73.675.676.872.97567.7
2025.02
--66.5----
2025.02
-69.669.768.270.469.869.8
2025.02
-75.677.877.377.277.468.2