Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Long-context understanding on RULER (dev)

96.1Accuracy (4K Context)

Olmo 3 32B

84.015287.152690.2993.4274Dec 15, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
96.194.5790.4286.2279.7
2025.12
96.0595.0693.7792.4288.8
2025.12
96.0394.5295.0792.6780.73
2025.12
95.5894.193.7890.29-
2025.12
95.5692.7693.1391.4386.88
2025.12
95.3193.0991.5889.0185.13
2025.12
94.8991.2184.1478.7967.96
2025.12
94.6390.8788.6887.2667.3
2025.12
94.3393.4592.5389.28-
2025.12
91.9885.6982.778.1367.62
2025.12
91.5284.2680.5476.8260.33
2025.12
90.4782.4874.4369.0559.89
2025.12
84.4884.285.3687.0684.59