Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Retrieval on RULER 128K context

66.71Accuracy

gpt-oss-puzzle-88B

44.173250.024155.87561.7259Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
66.71-------------
2026.02
66.7-------------
2026.02
59.8-------------
2026.02
56.11-------------
2026.02
56.02-------------
2026.02
55.01-------------
2026.02
53.77-------------
2026.02
52.61-------------
2026.02
50.89-------------
2026.02
47.28-------------
2026.02
45.82-------------
2026.02
45.04-------------
2025.06
-10097.9210097.9277.0894.2796.0951.0475.3578.1240.6282.68-
2025.06
-98.9693.7596.8896.8859.3891.6794.795057.9976.0435.2977.8-
2025.06
-10069.7947.9297.3553.1278.9190.161.2563.1973.9638.5472.55-
2025.06
-94.794.1751.0465.6220.8344.7957.8141.6757.9967.7138.5455.51-
2025.06
-1.042.083.125.213.124.695.730.6267.751.0434.3816.44-
2025.06
-36.4688.543.124.172.081.821.5623.3348.6148.9633.3318.68-
2025.06
-98.9698.968.3389.583.1213.2852.3452.2938.8978.1239.5851.18-
2025.06
-98.9698.9610096.8869.7989.0694.5350.2171.1876.0440.6280.57-
2026.01
-------------50.1
2026.01
-------------53.2