Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Long-context Language Modeling on Composite Suite (MRCR v2, GraphWalks, LongBench v2, RULER, AA-LCR)
Loading...
78.7
Average Score
IndexCache
72.46
74.08
75.7
77.32
Mar 12, 2026
Average Score
MRCR v2 Score
GraphWalks Score
LongBench v2 Score
RULER Score
AA-LCR Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Score
MRCR v2 Score
GraphWalks Score
LongBench v2 Score
RULER Score
AA-LCR Score
IndexCache
Backbone=GLM-5 (744B),...
2026.03
78.7
72.3
90.8
66
97.3
67.2
Original DSA
Backbone=GLM-5 (744B)
2026.03
78.4
71.1
92.7
64.5
97.7
66.2
IndexCache
Backbone=GLM-5 (744B),...
2026.03
78.1
72.8
90.2
65.1
97.6
64.6
IndexCache
Backbone=GLM-5 (744B),...
2026.03
78
70.8
90.3
63.7
97.6
67.6
IndexCache
Backbone=GLM-5 (744B),...
2026.03
72.7
65.8
74.9
62.2
96.2
64.6
Feedback
Search any
task
Search any
task