Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long-context language understanding on LongBench (Specific Metric Subset)

87.77TriviaQA Score

MemDLM (Train & Inference)

53.990862.760471.5380.2996Mar 23, 2026
Updated 25d ago

Evaluation Results

MethodLinks
2026.03
87.7754.6987.3831.9743.2822.6122.3426.77116.1464.2555.23
2026.03
87.7454.3686.2931.4442.8722.4322.1926.3570.515.864.255.05
2026.03
55.2950.3274.529.2242.4721.8522.1323.527013.5962.5554.65