Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context understanding on RULER 64K

92.37Accuracy

RetroInfer

37.530851.767966.00580.2421Nov 18, 2025Dec 17, 2025Jan 16, 2026Feb 15, 2026Mar 17, 2026Apr 16, 2026May 16, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2025.11
92.37---
2025.11
92.35---
2025.11
92.29---
92.26---
2025.11
92.24---
2026.05
90.56---
2025.11
89.74---
2026.05
88.92---
2026.05
88.43---
2026.05
87.73---
2025.12
84.961001001
2025.12
84.2462611.05
2026.03
84.15---
2026.05
84.12---
2026.05
83.96---
2026.05
83.94---
2026.05
83.82---
2026.05
83.52---
2025.12
83.3944451.09
2026.05
83.27---
2025.12
83.0835361.13
2025.12
83.03--1.03
2026.04
80.26---
2026.05
79.99---
2026.05
79.57---
2025.11
76.28---
2025.11
76.26---
2025.11
76.26---
2025.11
75.94---
2025.12
75.07--1.1
2025.11
72.96---
2026.04
72.12---
2026.03
69.45---
2025.12
66.04--1.19
2026.03
64.1---
2026.03
64.02---
2026.03
39.64---